Current State of Chinese LLMs: Market Leaders, Open Models, and Business Models

This is a summary of the current Chinese LLM scene based on research shared on r/LocalLLaMA. The analysis categorizes major players by their proprietary models, open-weight offerings, and business approaches.
Major Companies and Their Models
ByteDance: Their proprietary model dola-seed (also called Doubao) is described as the current market leader, playing a role similar to OpenAI. They also have an open-source Seed OSS 36B model, but the source notes it doesn't get much discussion.
Alibaba: Their proprietary model Qwen Max is reportedly not widely used. However, Alibaba is noted as strongest in open-weight offerings, especially small models, and leads in text-to-image (T2I) and text-to-video (T2V) capabilities.
Tencent: Their proprietary model Hunyuan is also not widely used. Their T2I and T2V efforts are considered second to Alibaba's.
Baidu: Their proprietary model Ernie is not widely used, with Baidu being stronger in autonomous driving.
Xiaomi: Their proprietary model is Mimo V2 Pro, and they have an open-weight model Mimo V2 Flash 309B-A15B.
DeepSeek: The Innovative Side Project
DeepSeek is described as a side project from an algorithmic trading firm. Current usage in China is reportedly a close second to ByteDance's Doubao, with about half the users. The source highlights DeepSeek as "the most innovative among all Chinese LLM companies," having invented techniques like MLA, MTP, DSA, and GRPO. The analysis suggests its business model might be similar to the 'Six AI Small Tigers,' but speculates the project might be more for attracting investments and gaining political access.
The Six AI Small Tigers
This group is characterized by highly similar business models: releasing big open-weight models to gain recognition while providing cheap inference services. The source questions their long-term viability.
- Zhipu: IPOed in Hong Kong. Their current
GLM-5model is described as a derivative of DeepSeek. - Minimax: IPOed in Hong Kong. They have a proprietary
MiniMax 2.7model and an open-weightMiniMax 2.5model, which is described as a "vanilla MoE 229B-A10B." This architecture reportedly gives them significantly lower inference costs than others. - Moonshot: Their
Kimiopen-weight model is described as a derivative of DeepSeek. - Stepfun: Their open-weight
Step 3.5 flashmodel uses a mixture of full attention and sliding window attention (SWA) layers at a 1:3 ratio. It's described as a 196B-A11B model with a business model similar to Minimax's, though their model is reportedly not as good. - Baichuan: Their
Baichuan-M3 235Bis described as a medical-enhanced open-weight model based onQwen3Moe. - 01 AI: Their last open-weight model was
Yi-34B, published in November 2024. They now reportedly focus on enterprise AI agent systems, making them "irrelevant to people here."
📖 Read the full source: r/LocalLLaMA
👀 See Also

Yann LeCun's AMI raises $1B for AI world models, challenges LLM approach
Yann LeCun's startup AMI raised over $1 billion to develop AI world models that understand the physical world, arguing LLMs alone won't achieve human-level intelligence. The company will build systems with persistent memory, reasoning, and planning capabilities for manufacturing, biomedical, and robotics applications.

Georgia Court Order Contains AI-Hallucinated Legal Citations
A Georgia Supreme Court appeal revealed a trial court order contained at least five citations to nonexistent cases and five more to cases that don't support their cited propositions, with the prosecutor's proposed order containing the same errors.

SCOTUS Declines to Hear AI Copyright Case, Leaving Lower Court Ruling Intact
The U.S. Supreme Court declined to hear a dispute over copyrights for AI-generated material, leaving a lower court ruling that denied copyright protection for works created without human authorship in place.

Reddit user proposes timestamping feature for Claude to address temporal awareness gap
A Reddit user identifies Claude's lack of temporal awareness as a limitation for productivity use cases and proposes an optional timestamping feature that would stamp every response with date and time, persistent across sessions.