This trend is not new, but a surge in announcements over the last few weeks reconfirms how the Chinese models are not only keeping pace with the rest but also reshaping the dynamics of artificial intelligence innovation locally and, in some sense, providing new ideas globally.
While the Western world often basks in the glory of considerable AI advancements, there is also a rapidly unfolding revolution in China—one driven by both ingenuity and necessity.
Let’s list the announcements of the last few weeks to get a measure of the pace:
Company | Model/Product | Date Announced | Key Features | Significance |
Alibaba Cloud | Qwen-72B | Early Nov 2024 | 72B parameters- Open-source- Multilingual- Advanced MoE- State-of-the-art reasoning performance | Reinforces Alibaba’s lead in the LLM space; offers a powerful open-source alternative |
Tencent | HunYuan Video | Nov 28, 2024 | 13B parameters- Text-to-video generation- Contrastive Video-Language Alignment (CVLA)- Open-source | A significant step forward in video generation capabilities, democratizing access through open source |
Baidu | iRAG | Nov 12, 2024 | Text-to-image generation- Minimizes hallucinations using search capabilities | Enhances the accuracy of AI-generated imagery |
DeepSeek | DeepSeek-R1-Lite-Preview | Nov 20, 2024 | Reasoning-focused model- “Chain-of-thought” reasoning- Matches Western models’ performance | Demonstrates a strong focus on reasoning, efficiency, and keeping up with global standards |
Bytedance | Doubao | Ongoing | Doubao-PixelDance and Doubao-Seaweed for video generation- Potential integration with TikTok | Emphasizes Bytedance’s expertise in marrying LLMs with content creation on social media |
JD.com | ChatJD | Sept 2024 | E-commerce optimization- Multimodal support- Sector-specific applications | Drives e-commerce innovation through LLMs focused on consumer needs |
Chinese models have been in lockstep with the West’s for some time, if not right from the start.
Its journey has involved numerous groundbreaking contributions recognized globally much later after becoming a part of more famous Western modelers’ toolkits. Their approach to AI has consistently been about finding new pathways prioritizing efficiency, scalability and practicality.
Wu Dao 2.0, released in May 2021, is a prime example of this early leadership. It showcased China’s capabilities in multimodal AI, integrating text and image processing with a scale of 1.75 trillion parameters—surpassing many Western models of the time.
Beyond Wu Dao, Huawei’s Pangu-α model, introduced in 2021, exemplified China’s focus on developing resource-efficient AI. Pangu-α was among the first to demonstrate that large-scale language models could be optimized for efficiency, paving the way for similar global trends. Baidu also contributed significantly with its early multimodal AI models, which integrated visual and language capabilities. All these were well ahead of similar advancements in the West.
The latest innovations continue this trend of pioneering approaches that will likely be replicated, if not emulated, globally. For instance, Tencent’s Hunyuan Video model utilizes Contrastive Video-Language Alignment (CVLA) to achieve impressive quality in video generation, an area where efficiency is often a key challenge.
Meanwhile, DeepSeek’s “chain-of-thought” reasoning capabilities have significantly improved understanding of complex prompts, underlining the early adoption and refinement of techniques.
Of course, China has lagged in numerous fields. In the patentless world, its model makers have adopted a far greater number of innovations than those pioneered in the US and elsewhere. The important point is that they, too, have innovated, and their innovations need more attention now because of their different focus.
China’s regulatory environment places unique restrictions on AI development, prompting Chinese developers to find creative solutions to meet these demands. Regulations necessitate AI models to control generated content, preventing potentially dangerous or undesirable outputs.
This necessity has led to the development of sophisticated methods like adaptive token dropping, which optimizes model efficiency while effectively managing resource allocation, ensuring that models adhere to regulatory standards.
Another aspect of regulatory compliance is the integration of censorship mechanisms within AI models. These features, while controversial, serve to align AI output with local regulations.
Such mechanisms could be adapted globally for spam filtering, harmful content moderation, and cybersecurity. The Chinese focus on safety and controlled outputs could provide valuable insights into building more secure AI systems worldwide.
Chinese developers have had to contend with restricted access to cutting-edge hardware, such as advanced GPUs. These limitations have driven innovation, pushing companies to achieve more with less. This has led to breakthroughs in efficient hardware utilization and an emphasis on designing software that optimizes available technology.
For example, Baidu adapted its Ernie models to run effectively on Kunlun chips, while Alibaba optimized its Qwen models for Huawei’s Ascend processors. These adaptations demonstrate how careful software-hardware integration can overcome limitations and achieve competitive performance.
This focus on maximizing output from constrained resources showcases ingenuity and aligns with global concerns over the environmental and economic costs of AI development.
Recent innovations in Sparse-Layered Models (SLMs) and Mixture of Experts (MoE) architectures have further enhanced the ability of Chinese AI systems to function efficiently under hardware limitations.
Alibaba’s Qwen-72B, for instance, utilizes an advanced MoE architecture that activates only a subset of model parameters during inference. This approach—supposedly an improvement over MOE efforts globally—reduces computational load while maintaining high performance.
Similarly, agentic developments, such as Baichuan AI’s Baixiaoying assistant, incorporate features that make interactions more efficient by leveraging Sparse-MoE techniques. These models are designed to be both resource-aware and highly responsive, ensuring that limited hardware does not materially compromise the quality of user interactions or computational efficiency.
DeepSeek’s reduction in token processing costs to 1 RMB per million tokens illustrates another aspect of this constraint-driven innovation. Although further validation is needed, this approach points to a promising trend in cost-effective AI, setting benchmarks for affordability that could prove crucial as AI continues to expand globally.
Chinese AI innovations have excelled in focusing on practical applications that directly address market needs. Unlike the Western tendency to expand LLM capabilities for general-purpose use, Chinese companies have concentrated on specific sectors where AI can make an immediate impact. This approach has led to significant mobility, robotics, healthcare, and e-commerce advances.
BYD is integrating LLMs for advanced voice assistants and autonomous driving features, creating more intuitive and personalized driving experiences. In robotics, Chinese LLMs are driving advancements in human-robot interaction. These improvements lead to more effective automated systems that work seamlessly with human operators.
Apart from such cobots, companies like Geek+ and Hai Robotics use AI-powered robots for warehouse automation. Elephant Robotics has developed robots to assist in elderly care. A host of companies are using AI-powered drones and robots for tasks like crop spraying, seeding, and field monitoring apart from harvesting.
For healthcare applications, Baidu’s iRAG tool has shown notable success in improving the reliability of AI in medical imaging by reducing hallucinations—an essential step in enhancing diagnostic accuracy.
Baidu has been actively developing ERNIE Bot 4.0 for medical consultation; this could be an active area for most models globally after the recent NY Times article that showed AI models’ astounding diagnostic accuracy compared to professional doctors, although the regulatory barriers could be lesser for China.
Huawei’s Pangu drug molecule model has learned the chemical structure of 1.7 billion drug-like molecules in the market. Huawei expects the model to function as a virtual chemist, helping researchers design and identify novel molecules likely to interact with drug targets and reduce R&D costs by over 70%.
Chinese models are also notable for their emphasis on accessibility. Tools like Baidu’s Miaoda no-code AI application builder simplify the development of AI-powered solutions, allowing smaller enterprises without specialized technical teams to harness the power of LLMs.
Tencent is leveraging HunYuan Pro’s AI capabilities to enhance game development workflows. The model can assist with tasks like generating game dialogue, creating non-player characters (NPCs) with realistic behaviors, and generating game levels and environments.
AI models and applications are developed globally in all innovation centers. The trajectory of Chinese AI innovation showcases a different route to technological advancement—where constraints fuel creativity, and efficiency becomes the cornerstone of progress.
Chinese AI developers have not merely adapted to limitations in their regulatory environment and resource constraints but have turned them into opportunities. Their emphasis on application, efficiency, and adaptability forms a valuable counterbalance to the Western focus on scaling and capability expansion.
Most importantly, there is a remarkable pick-up in the announcements of late. In end-user-affecting changes like TikTok’s latest features, Tencent changing the accessibility in WeChat, or Baidu’s glasses, Chinese innovations are also reaching end-users rapidly.
None of these are likely to be ignored by markets for long, although reading reports from media and analyst communities, it may appear like nothing is going on with the Chinese tech companies in the AI space.
We will return to the harmful impact of the inability to generate excitement in the future, particularly when we discuss Korea, although this is also true in the case of China.
Nilesh Jasani is a Singapore-based innovation investor for LC GenInnov Fund