Everyone is sleeping on this new OCR model!
- 85.9% (sota) on olmocr bench
- 90+ language support w/benchmarks
- 4B model (down from 9B)
- Full layout information
- Extracts + captions images and diagrams
- Strong handwriting, math, form, table support
100% open-source.
🚨 BREAKING: A new 33-page PDF demystifying how hedge funds create bias-free signals
This is what you need to know (Number 2 is the most important finding): 🧵