AI Lab @AntGroup, we envision AGI as humanity's shared milestone. Our Language Model @AntLingAGI and LLaDA, Embodied AI @robbyant_brain, OSS projects AReaL etc.
Ring-2.6-1T is now open source! 1T-parameter thinking model with two reasoning gears and SOTA Agent execution. 🚀
🔗 https://t.co/OI8hBuSJEn
🤖 Stronger Agent execution: SOTA on PinchBench (87.60) and ClawEval (63.82), top-tier on TAU2-Bench and GAIA2-search. SWE-Bench Verified 74.00.
🎯 high / xhigh Reasoning Effort: high for production Agent workflows, xhigh for hard reasoning (AIME 26: 95.83, GPQA Diamond: 88.27)
⚡ Async RL + Popsicle algorithm: stable long-cycle RL training at trillion-scale
Compatible with OpenClaw, Claude Code, OpenCode, Kilo Code, and more.
🚀 Ring-2.6-1T is now open source.
A trillion-scale flagship thinking model built for real-world complex tasks: Agent workflows, coding & engineering, long-horizon tasks, complex reasoning, research, and enterprise automation.
It is designed to move beyond “answering” toward execution: understanding context, planning steps, calling tools, and staying stable across long task chains.
Highlights:
- Advanced agentic workflow support.
- Reasoning effort levels: high for agentic tasks, xhigh for complex reasoning.
- Scalable asynchronous RL via the IcePop algorithm, enabling stable, trillion-scale training for long-horizon agentic RL.
Ring-2.6-1T from @AntLingAGI is now live on @OpenRouter with free access available until May 15 PT.
1T total parameters · 63B active parameters
Built for real-world agent workflows:
• Coding agents
• Tool use
• Long-horizon execution
• Lower token overhead with adaptive reasoning
Designed for advanced autonomous systems where capability, latency, and cost efficiency all matter.
Powered by Novita AI
Ring 2.6 1T from @TheInclusionAI is live in Kilo, free for a limited time.
Worth noting: Ling 2.6 1T shot to the top of our leaderboard right after launch.
Both available now in the model picker! 🔥
Ring-2.6-1T is now 1️⃣week free on @OpenRouter!
✅Lower token overhead and rapid multi-step execution
✅Suitable for general and coding agent use cases
✅Designed to deliver reliable execution in production workflows at a rational inference cost
start now🤗 https://t.co/6GABLt5AzK
We are launching Ring-2.6-1T, a trillion-parameter flagship thinking model engineered for real-world complex tasks and production env: 🚀
- Adjustable Thinking Effort: dynamic compute mechanism to flexibly balance cognitive depth, token cost, and execution speed;
- Agent-Optimized: Built for high-frequency workflows, delivering rapid multi-step execution and tool orchestration with SOTA stability;
- Deep Thinking: Unlocks the model's maximum capability ceiling for rigorous mathematical logic and scientific research;
AReaL is joining the @PyTorch Landscape🎉https://t.co/5IvYOhWcYW
Fully async, modular, built for the agentic AI era. AReaL is a scalable, RL infrastructure designed to bridge foundation model training with modern agent-based applications.
Explore AReaL: https://t.co/659iOG4Xbq
Announcing Ling-2.6-1T by inclusionAI, now available on OpenRouter. 🚀
This trillion-parameter flagship instruct model is built for real-world agents. It utilizes a “fast thinking” approach to cut costs by ~75% while maintaining SOTA performance on AIME26 and SWE-bench Verified. Ideal for:
- Advanced coding
- Complex reasoning
- Large-scale agent workflows
So excited to announce Ling-2.6-1T is now live on ModelScope!🔥
This 1T parameter model is built for complex agent workflows, multi-step execution and long-context understanding. It truly delivers in production.
📊The benchmarks speak for themselves:
- AIME26 — leads all non-reasoning models
- SWE-bench Verified, TAU2-Bench, BFCL-V4, PinchBench — first-tier open-source
- ~16M tokens on Artificial Analysis full eval — same efficiency story as Ling-2.6-flash
Works with Claude Code, OpenClaw, OpenCode & CodeBuddy ✅ SGLang & vLLM ready · Open weights available now 🚀
Explore on ModelScope 👇 https://t.co/wBJR1dVNZX
https://t.co/Ylvkzfgfqs
Ling-2.6-1T, designed for precise instruct task execution is now #OpenSource🥂Built for the Agentic era, Ling-2.6-1T brings about a significant capability upgrade↗️ and top-tier performance.#LLM
🤗 Hugging Face: https://t.co/cxqRwUV5Uo
📷 ModelScope: https://t.co/c5poIGOkNY
Last week, we introduced Ling-2.6-1T. Today, Ling-2.6-1T is officially an open model~ 🤗
1T total parameters · 63B active parameters
We bring values to developers by making it easier to test, deploy, customize, and build.
It is optimized to be "token efficiency" for real production needs:
• Lower token overhead: strong intelligence without long reasoning traces
• Reliable multi-step execution: better instruction, tool, context, and workflow control
• Production-ready deployment: from code generation to bug fixing, with broad agent framework compatibility
A sneak pick into the agentic capability in @opencode
📣Fast generation, high token efficiency, real task execution and improved experience, Ling-2.6-flash is now opensource!
🔧Available in BF16, FP8, and INT4 variants for different deployment needs.
Hugging Face: https://t.co/pLOHIj7o8L
ModelScope: https://t.co/dmk4lqAlg9
Ling-2.6-flash is now officially open-sourced!
A fast, token-efficient Instruct model built for real-world agent workflows.
104B total parameters · 7.4B active parameters
Available in BF16, FP8, and INT4 variants for different deployment needs.
Key strengths:
- Fast generation: 215 tokens/s on Artificial Analysis Output Speed
- High token efficiency: only 15M tokens on the full AA Intelligence Index evaluation
- Real task execution: strong performance across coding, document processing, and lightweight agent workflows
- Improved experience: better Chinese-English switching and smoother compatibility with mainstream coding frameworks
📢 Ling-2.6-1T from @TheInclusionAI is live on @ZenMuxAI 🔥
Ant Group's trillion-param MoE flagship. 50B active per token. Open-source SOTA on SWE-bench Verified, AIME & agent benchmarks — no thinking tokens needed ⚡️
🔗 https://t.co/NO5Qjbny7R
✨ Inclusion AI's LLaDA2.0-Uni is open-source! A single MoE-based diffusion LLM that unifies visual understanding and image generation — natively, in one model.
Download on ModelScope 👉 https://t.co/3CJDxI8GQ1
Built on a single Mask Token Prediction paradigm, LLaDA2.0-Uni handles:
🖼️ Text-to-image synthesis at 1024×1024, with the option to "think" before drawing
🔍 Visual question answering, captioning, and document understanding on par with dedicated VLMs
✏️ Instruction-driven image editing — single or multi-reference, faithful to original details
🎨 Interleaved text-image reasoning, opening the door to a new class of multimodal chains
Released under Apache 2.0 — paper, code, and weights all open.
📄 https://t.co/4mwDOznE4j
🔗 https://t.co/90FxClxuqQ
LLaDA2.0-Uni from Inclusion AI
Unified diffusion LLM for multimodal understanding and generation. Features MoE backbone with SigLIP-VQ tokenizer enabling 8-step image generation and native interleaved reasoning.
DR-Venus: a 4B parameter deep research agent trained on only 10K open data
Achieves frontier performance on edge-scale devices, outperforming prior 9B agents and narrowing the gap to 30B-class systems through agentic SFT and RL with information gain rewards.
Built for stable, high-speed execution in complex, real-world environments, meet Ling-2.6-1T!🥳Massive leap& top-tier performance, and 🛠️ Engineering-Task-Friendly.
We’re unlocking 1 week of free API access. Start testing now at @OpenRouter https://t.co/eEE1QvZXNY
Open-source model is on the way! Keep an eye out—more details dropping soon. 👀
#inclusionAI #LLM
🚀 Today, we are launching Ling-2.6-1T, a trillion-parameter flagship model designed for precise instruct task execution. By prioritizing a "Fast-Thinking" mechanism, it delivers SOTA intelligence with ultra-low token overhead, making token efficiency a first-class citizen.
Brand new release from our team! 🆕DR-Venus — exploring new possibilities for edge-scale deep research with just 10K open data.
🛠️ Built with a two-stage recipe:
1️⃣ Agentic SFT: strict trajectory cleaning + long-horizon resampling for maximum data value
2️⃣ IGPO RL: turn-level rewards via information gain & format-aware regularization for stable long-horizon execution
#inclusionAI #deepresearch #AIAgent
Edge-ready GGUF format — deploy on your Mac today.
🐙 Code: https://t.co/2gQzYZC3iC
📄 Paper: https://t.co/DoQglP9Kmw
🤗 Models: https://t.co/2WzxZQyAUo
We’re excited to introduce LLaDA2.0-Uni, the 💥first unified multimodal model in the LLaDA2.0 series.
Highlights:
🧠 One paradigm to rule all – With unified block-wise mask token prediction, LLaDA2.0-Uni achieves top-tier performance across visual understanding, high-fidelity image generation, and single/multi-reference image editing.
⚡ Efficient inference – A novel decoding strategy in the dLLM backbone, together with an 8-step distilled diffusion decoder, enables highly efficient inference. (SGLang support soon for even faster inference.)
🔄 Interleaved, intelligent, infinite – Unified discrete representations enable seamless interleaved generation and advanced interleaved reasoning capabilities.
After two months of teamwork, we’re excited to share our team’s latest achievement — LLaDA2.0-Uni, InclusionAI’s first multimodal LLaDA.
A unified discrete diffusion LLM built for both understanding and generation across text and images.
Highlights:
● One paradigm for VQA, doc understanding, and image generation
● Efficient inference with a new decoding strategy + 8-step distilled decoder
● Interleaved text-image generation enabled by unified discrete representations
(SGLang support soon)
🤗 Hugging Face: https://t.co/bDiucSDEN7
📷 ModelScope: https://t.co/qnztdVyl7U
Ant Group's Ling 2.6 Flash scores 26 on the Artificial Analysis Intelligence Index, a 10-point jump from Ling-flash-2.0. It is one of few recent open weights releases focused on non-reasoning capabilities and focuses on a reasonable cost to intelligence ratio.
Ling 2.6 Flash is a non-reasoning model from Ant Group's @TheInclusionAI lab. Ant Group's model family comprises three series: Ling (non-reasoning), Ring (reasoning), and Ming (multimodal). Ling-flash-2.0 was the previous flash-tier non-reasoning model. Ling 2.6 Flash is expected to be open weights shortly after release, but as of today the weights have not been released on Hugging Face.
Key takeaways:
➤ At 104B total parameters with 7.4B active parameters, Ling 2.6 Flash (26) sits in intelligence near GPT-5.4 nano (Non-Reasoning, 24) and Gemma 4 26B A4B (Non-reasoning, 27), both models with comparable active parameter counts. However, at 18 points behind GLM-5.1 (Non-reasoning, 44), there remains a gap to frontier non-reasoning open weights models
➤ Ling 2.6 Flash is comparatively token efficient, using ~15M output tokens to run the Intelligence Index. This is comparable to Gemma 4 26B A4B (~14M) but a fraction of Qwen3.5 9B (~78M). Compared to models in the similar intelligence tier, Ling 2.6 Flash represents a reasonable efficiency tradeoff, which has positive effects on cost when deployed on larger workloads. At a price of $0.1 / million input tokens and $0.3 / million output tokens, Ling 2.6 Flash costs only ~$23 to run the full Artificial Analysis Intelligence Index.
➤ Gains from Ling-flash-2.0 were driven mostly by improvements agentic capabilities and instruction following. τ²-Bench jumped from 21% to 86% (+65 points), IFBench from 34% to 57% (+23 points), and GDPval-AA Elo from 425 to 783 (+84%). Conversely, GPQA Diamond fell from 66% to 59% (-6 points) and SciCode from 29% to 27% (-2 points).
➤ AA-Omniscience performance is at -66 with 15% accuracy and 96% hallucination rate. This is consistent with the model's small 7.4B active parameter count. Knowledge recall benefits from larger parameter counts, and sub-10B active-parameter models systematically underperform on this metric.
Additional model details:
➤ Architecture: MoE, 104B total parameters, 7.4B active parameters
➤ Context window: 262K tokens (doubled from 128K for Ling-flash-2.0)
➤ Pricing: $0.10 / $0.30 per 1M input/output tokens (via Novita API)
➤ License: Weights not yet released ➤ Availability: Third party API through @novita_labs