LongCat-2.0 Pushes Open Mixture-of-Experts to 1.6 Trillion Parameters
New 48B-active MoE model tests the limits of accessible large-scale architecture
LongCat-2.0 has entered the open-source arena as a 1.6-trillion-parameter mixture-of-experts model with 48 billion active parameters per forward pass. The release marks one of the largest publicly available MoE architectures to date, positioning it alongside proprietary systems that have remained behind API walls.
The model's sparse activation pattern means only a fraction of its total weights engage for any given token, keeping inference costs closer to a dense 48B model while retaining the knowledge capacity of a far larger system. For research teams and startups without hyperscaler budgets, this efficiency curve shifts what's feasible on commodity GPU clusters.
Early benchmarks suggest strong performance on long-context reasoning and multilingual tasks, though comprehensive third-party evaluation remains limited. The weights are available under a permissive license, inviting fine-tuning experiments that were previously restricted to organizations with dedicated training infrastructure.
As MoE architectures mature, the tension between parameter count and operational practicality will define the next wave of deployment strategies. Open releases of this scale force a recalibration of what "open" means when the hardware requirements still exclude most practitioners.
Join the discussion
How should the community evaluate openness when a model's weights are free but its inference demands remain prohibitive?
Loading comments...
More in Artificial Intelligence

OpenAI's Codex Agent Clears Loldle Worlds in Single Attempt
Coding assistant demonstrates unexpected proficiency at League of Legends trivia game

Agentic AI Gains Traction as Enterprises Chase ROI
Tech teams lead adoption while business context gaps remain the primary barrier
Firefly Aerospace Runs NVIDIA Jetson in Lunar Orbit, Proving Edge AI Beyond Earth
First operational test of commercial edge compute in deep space validates autonomous spacecraft architectures
South Korea Bets $1 Trillion on Chips, Data Centers, and Humanoid Robots
A coordinated megaproject aims to secure the physical infrastructure underpinning the next decade of AI