๐ LightOnOCR-2-1B ๐ฆ is out, a major update to LightOnOCR.
1B parameters, end-to-end multilingual OCR, and it beats models 9ร larger on OlmOCR-Bench while being much faster.
PDF/page in, clean ordered Markdown out, with optional image localization (bbox variants).
Do you like the open-source models we keep shipping at @LightOnIO? ๐
Now you can actually *build* with them!!
We're launching LightOn Console ๐ฎ: three endpoints (Parse, Extract, Search) so you can run our models on your own documents without building the plumbing yourself!
๐งต
Today, we're introducing LightOn Console.
โ๏ธ Three endpoints:
/Parse any documents
/Extract structured data
/Search enterprise knowledge with citations
๐ Built-in connectors. MCP-ready. Governance enforced at the chunk level.
No infrastructure. No pipeline maintenance. No dedicated retrieval team required.
Make your enterprise knowledge agent-readable now!
Read the launch announcement: https://t.co/LcxXqyOgo5
Test it now: https://t.co/RNJQKEHzQ2
@skalskip92 if qwen3.5 122B-A10B is only 5 tests away from gemini 3,5 then the 397B-A17B should be even closer.
plus it completely destroys everyone on speed?
@RfK_001 FlexiViT did train with different patch sizes w/ weight sharing. But it was fixed resolution ViTs, i am not sure it's the same if you vary input resolution as they become very similar(either make the image bigger or make the patch smaller).
created a simple tool to help me in my daily workflow: hit a hotkey, talk, the transcript pastes itself into Claude Code/Codex/wherever I'm typing. fully local, runs on my mac.