Today we release LFM2.5-1.2B-Thinking, a reasoning model that runs entirely on-device. What needed a data center two years ago now runs on any phone with 900 MB of memory.
> Trained specifically for concise reasoning
> Generates internal thinking traces before producing answers
> Enables systematic problem-solving at edge-scale latency
> Shines on tool use, math, and instruction following
This is insane: a 10b (!) model outperforms 10x bigger models, and even the still very good Gemini 2.5 pro.
Those guys are cooking. China is delivering again every hour today!
A year ago, we verified a preview of an unreleased version of @OpenAI o3 (High) that scored 88% on ARC-AGI-1 at est. $4.5k/task
Today, we’ve verified a new GPT-5.2 Pro (X-High) SOTA score of 90.5% at $11.64/task
This represents a ~390X efficiency improvement in one year