inclusionAI Releases Ling-2.6-1T: Hybrid Architecture Trillion-Parameter Model with Sparse Attention and Fast Thinking

inclusionAI has open-sourced Ling-2.6-1T, a trillion-parameter flagship model from the Ling family, targeting complex real-world tasks. The model introduces a hybrid architecture combining Multi-head Latent Attention (MLA) and Linear Attention to improve inference efficiency, lowering latency and VRAM usage for long contexts while keeping expressivity.
Fast Thinking via Reward Strategy
Post-training uses a Contextual Process Redundancy Suppression reward strategy, which encourages shorter, direct outputs — a "fast thinking" mechanism that reduces reliance on verbose chains-of-thought. This cuts token overhead while maintaining performance.
Benchmark SOTA
Ling-2.6-1T achieves open-source SOTA on execution-heavy benchmarks:
- AIME26 (reasoning)
- SWE-bench Verified (software engineering)
- BFCL-V4 (function calling)
- TAU2-Bench (task completion)
- IFBench (instruction following)
Agent Integration
The model is designed for end-to-end engineering workflows — from code generation to bug fixing — and integrates with mainstream agent frameworks including Claude Code, OpenClaw, OpenCode, and CodeBuddy. It handles multi-tool, multi-step constraints in enterprise environments.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Claude Code v2.1.79 OAuth Login Broken After Auto-Update: Workaround and Fix
Claude Code v2.1.79 has a confirmed OAuth login bug where the CLI times out after browser authorization. The issue stems from the native installer auto-updating to this version, and the fix involves downgrading to v2.1.75 by removing the native installation.
Google DeepMind's AI Pointer: Reimagining the Mouse for Gemini Interactions
Google DeepMind introduces an AI-powered mouse pointer that uses Gemini to understand context, enabling commands like pointing at an image and saying 'Show me directions,' integrated into Chrome and Googlebook.

Kimi K2.6 vs Claude Opus 4.7: A Practical Coding Showdown on a Minetest Mod + Google Sheets Integration
A developer tested Kimi K2.6 and Claude Opus 4.7 on building a Minetest bounty board mod with a TypeScript backend and Google Sheets logging. Opus succeeded in both tasks; Kimi passed the local task but failed the integration. Costs: Opus ~$3.59 local, $16.03 integrated; Kimi $0.39 local, $5.03 failed.

Anthropic Paying SpaceX $15B/Year for Compute Through 2029
SpaceX IPO filing reveals Anthropic pays $1.25B/month through May 2029 for compute. The deal supports AI training on Colossus 1 & 2 facilities.