FinWorkBench (Finch: Finance and Accounting Workflow Benchmark)

AdinaY

posted an update about 5 hours ago

Post

105

Z.ai just released a powerful lightweight option of GLM 4.7

✨ 30B total/3B active - MoE

zai-org/GLM-4.7-Flash

AdinaY

posted an update about 8 hours ago

Post

45

Another Chinese model fully trained on domestic chips, released by China Telecom 👀

Tele-AI/TeleChat3-36B-Thinking

TeleChat3-36B-Thinking:
✨ Native support for the Ascend + MindSpore ecosystem
✨ Inspired by DeepSeek’s architecture design, bringing training stability and efficiency gains.

2 replies

·

AdinaY

posted an update 3 days ago

Post

936

After a VLM, StepFun dropped a new audio model: Step-Audio-R1.1, enabling thinking while speaking 🔥

stepfun-ai/Step-Audio-R1.1

✨ Apache 2.0
✨ Combines dual-brain architecture and acoustic-grounded reasoning to enable real-time dialogue with SOTA-level reasoning

2 replies

·

AdinaY

posted an update 4 days ago

Post

1667

We have a new heatmap live on huggingface now🔥

woojun-jung/open-source-release-heatmap-ko

Korean community built their own version to track labs that actively publish open work, inspired by Chinese open source heat map!

This is the open source community at its best ♥️

1 reply

·

AdinaY

posted an update 5 days ago

Post

645

More lightweight multimodal models are coming 👀

StepFun has been focused on multimodal AI from the very beginning. Their latest release a new foundational model: STEP3-VL🔥
https://huggingface.co/collections/stepfun-ai/step3-vl-10b
✨ 10B - Apache2.0
✨ Leads in the 10B class and competes with models 10–20× larger

AdinaY

posted an update 5 days ago

Post

297

Agentic capability is the new battleground🔥

LongCat-Flash-Thinking-2601, the latest reasoning model from Meituan- LongCat

✨ MoE - 560B total / 27B active
✨ MIT license
✨ Agentic tool use
✨ Multi-environment RL
✨ Parallel + iterative reasoning

meituan-longcat/LongCat-Flash-Thinking-2601

AdinaY

posted an update 5 days ago

Post

297

GLM-Image from Z.ai is out 🔥

It was fully trained on Ascend Atlas 800T A2 with MindSpore, probably the first SOTA multimodal model fully trained on domestic chips 👀

zai-org/GLM-Image

✨ Hybrid Architecture: combined autoregressive + diffusion design delivers strong semantic alignment with high-fidelity details
✨ Strong performance in long, dense, and multilingual text rendering
✨ MIT licensed (VQ tokenizer & ViT weights under Apache 2.0)
✨ Now live on Hugging Face inference provider 🤗

AdinaY

posted an update 6 days ago

Post

2612

From ChatGPT Healthcare to Claude for healthcare, AI in medicine is speeding up🚀

Now BaichuanAI joins with Baichuan-M3 🏥 an open medical LLM trained for clinical decision-making

https://huggingface.co/collections/baichuan-inc/baichuan-m3

✨ 235B - Apache2.0
✨ Lower hallucinations via Fact-Aware RL
✨ Built for long medical chats

2 replies

·

AdinaY

posted an update 7 days ago

Post

2805

AgentCPM-Explore🔥 on device agent foundation model released by OpenBMB
openbmb/AgentCPM-Explore
✨ 4B - Apache2.0
✨ Supports 100+ multi-turn environment interactions with search + verification
✨ Full training/inference stack is openly shared as well

AdinaY

posted an update 7 days ago

Post

2567

Based on 2025 Chinese AI Timeline, here are some interesting takeaways:

✨ DeepSeek cadence: They shipped almost every month! (except Feb 2025)

✨ Qwen trajectory: Not a single “hit” model, but an expanding product line. VL/Math/Coder/Reranker/Embedding/Omni/Next/Image

✨ Multimodal trend: Steadily rising share, shifting from generation to editing + tooling.

✨ Reasoning as a main track: more engineered, system-level reasoning.

✨ From foundation to components: growth in infra models (embeddings, rerankers, OCR, speech) signals a move toward deployable stacks.

✨ Ecosystem broadening: more players beyond the top labs.

Follow for more updates👉

zh-ai-community

2 replies

·

AdinaY

posted an update 7 days ago

Post

275

Spirit AI (千寻智能) shared its VLA foundation model Spirit v1.5 on huggingface 🔥

It’s now No. 1 on RoboChallenge’s Table30 leaderboard, beating Pi0.5 🏆
Spirit-AI-robotics/Spirit-v1.5

AdinaY

posted an update 11 days ago

Post

1500

Wechat AI is shipping!

WeDLM 🔥 A new language model that generates tokens in parallel, making it faster than standard LLMs , with the same Transformer setup!
https://huggingface.co/collections/tencent/wedlm

✨ 7B/8B - Base & Instruct
✨ Apache 2.0

4 replies

·

AdinaY

posted an update 11 days ago

Post

2110

Qwen just released two new model series: Qwen3-VL-Embedding & Qwen3-VL-Reranker 🚀

✨ 2B / 8B - Apache2.0
✨ 30+ languages
✨ Supported text, images, screenshots, videos, and arbitrary multimodal combinations

Qwen3-VL-Embedding: Flexible vector sizes (64–2048)
https://huggingface.co/collections/Qwen/qwen3-vl-embedding
Qwen3-VL-Reranker: Built for recall>rerank pipelines
https://huggingface.co/collections/Qwen/qwen3-vl-reranker

AdinaY

posted an update 12 days ago

Post

611

MOSS Transcribe Diarize 🔊 A multimodal model for Speaker-Attributed, Time-Stamped Transcription from OpenMOSS.

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization (2601.01554)
OpenMOSS-Team/MOSS-transcribe-diarize

✨ Single-pass end-to-end SATS
✨ 128k context, ~90 min audio
✨ Robust to overlap & noise

1 reply

·

AdinaY

posted an update 12 days ago

Post

292

Daily Papers just got an AI reading assistant 🔥

You can ask any question you want: clarify a paragraph, get a short summary...all without leaving the page!

✨ Powered by HuggingChat + Hugging Face MCP server

AdinaY

posted an update 14 days ago

Post

1831

Chinese open source AI in December 2025 was about the stack coming together: open, end to end, and ready to ship 🔥

https://huggingface.co/collections/zh-ai-community/december-2025-china-open-source-highlights

✨ Big wave of foundation models: still scaling, but efficiency, reasoning, and deployment now matter more than size
- DeepSeek-V3.2
- Z.ai GLM-4.7
- MiniMax-M2.1
- Xiaomi: MiMo-V2-Flash

✨ Multimodal reasoning is now default
- Z.ai GLM-4.6V
- Z.ai AutoGLM-Phone 9B
- Bytedance: Dolphin-v2

✨ Image & video: editable assets and real workflows
- Qwen-Image-Layered / Image-2512
- Meituan: LongCat-Image & Image Edit
- AIDC: Ovis-Image-7B
- Live-Avatar / LongCat-Video-Avatar
- HY-WorldPlay / RealVideo

✨ Audio goes edge ready
- GLM-ASR-Nano / Fun-ASR-Nano
- GLM-TTS / VoxCPM1.5
- CosyVoice 0.5B

✨ The quiet backbone: data & infrastructure
- Finch (FinWorkBench)
- Tencent ARC: TimeLens-100K
- BIGAI: TongSIM-Asset
- MiniMax: VTP-Base

✨ Also congrats on Minimax and Z.ai announced their IPOs and Moonshot announced a new $500M funding round 🔥

Like everyone else, I was OOO at the end of December, so feel free to share (in comments or PR) any I missed in this list!

AdinaY

posted an update 15 days ago

Post

1930

MiniMax M2.1 blog is out🔥
https://huggingface.co/blog/MiniMaxAI/multilingual-and-multi-task-coding-with-strong-gen

Only a year into open source, MiniMax is already making a great impact. Not only through solid models/products, but also by how well the team uses community platforms like Hugging Face.

HF Teams, blogs, Daily Papers, Spaces as project pages, and always experimenting with new ways to engage. Super impressive!

AdinaY

posted an update 15 days ago

Post

3634

2025.1 - DeepSeek entered the scene, backed by High Flyer Quant
2026.1 - IQuest enters the game, backed by Uniquant Quant 📈 and launching IQuest-Coder on huggingface
https://huggingface.co/collections/IQuestLab/iquest-coder

✨ 40B models: Instruct / Thinking / Loop
✨ Loop = MoE-level performance with only ~5% extra training cost
✨ Native 128K context