Ravi Theja @ravithejads - Twitter Profile

Pinned Tweet

almost 2 years ago

🔥 Releasing BRAG: High-Performance RAG Model Trained In $25 🚀 We’re thrilled to announce the launch of our RAG models, collectively known as BRAG—a series of SLMs (3 SLMs and 1 Ultra SLM) specifically trained for Retrieval Augmented Generation (RAG). 1️⃣ ⁠BRAG-Qwen2-7b-v0.1 2️⃣ ⁠BRAG-Llama-3.1-8b-v0.1 3️⃣ ⁠BRAG-Llama-3-8b-v0.1 4️⃣ ⁠BRAG-Qwen2-1.5b-v0.1 🌟 Our models outperform Cohere’s Command R+, Qwen2, Llama3.1, and Llama3 Instruct models and closely matched the performance of GPT-4-Turbo and Nvidia’s ChatQA-1.5-8B on ChatRAG-Bench. 💵 Each model was trained in less than $25. 🔍 More interesting details on datasets and training procedure in our release technical report: https://t.co/taUmXKQduS 🙌 A huge thank you to @HamelHusain, @dan_s_becker, and @charles_irl for the credits on @modal_labs via the LLM Fine-tuning course. This work is done in collaboration with @nlpguy_ under https://t.co/QgmPlZjZf3

ravithejads's tweet photo. 🔥 Releasing BRAG: High-Performance RAG Model Trained In $25

🚀 We’re thrilled to announce the launch of our RAG models, collectively known as BRAG—a series of SLMs (3 SLMs and 1 Ultra SLM) specifically trained for Retrieval Augmented Generation (RAG).

1️⃣ ⁠BRAG-Qwen2-7b-v0.1
2️⃣ ⁠BRAG-Llama-3.1-8b-v0.1
3️⃣ ⁠BRAG-Llama-3-8b-v0.1
4️⃣ ⁠BRAG-Qwen2-1.5b-v0.1

🌟 Our models outperform Cohere’s Command R+, Qwen2, Llama3.1, and Llama3 Instruct models and closely matched the performance of GPT-4-Turbo and Nvidia’s ChatQA-1.5-8B on ChatRAG-Bench.

💵 Each model was trained in less than $25.

🔍 More interesting details on datasets and training procedure in our release technical report: https://t.co/taUmXKQduS

🙌 A huge thank you to @HamelHusain, @dan_s_becker, and @charles_irl for the credits on @modal_labs via the LLM Fine-tuning course.

This work is done in collaboration with @nlpguy_ under https://t.co/QgmPlZjZf3

18

440

82

450

66K

ravithejads retweeted

Jerry Liu

@jerryjliu0

about 2 hours ago

We're presenting ParseBench at CVPR 2026! ParseBench is the most comprehensive document understanding benchmark for VLMs. ✅ It contains 2k pages of real-world enterprise documents ✅ It has comprehensive evaluation metrics around tables, charts, visual grounding, semantic formatting, and content faithfulness The core goal is measuring whether models can semantically interpret a document in the right way, without having models overfit to our precise benchmark. Parsing 100% of PDFs to 100% accuracy is the final boss for document OCR. In general, the latest frontier models have been tuned for coding, math, and scientific reasoning as opposed to precise visual understanding; hope more benchmarks that these will encourage overall progress towards solving this problem! Poster is below. If you want to learn more come check out our site or 30-page ArXiv paper: ParseBench: https://t.co/PWczfhp0OX ArXiv: https://t.co/2dEJIaBBkr

jerryjliu0's tweet photo. We're presenting ParseBench at CVPR 2026!

ParseBench is the most comprehensive document understanding benchmark for VLMs.
✅ It contains 2k pages of real-world enterprise documents
✅ It has comprehensive evaluation metrics around tables, charts, visual grounding, semantic formatting, and content faithfulness

The core goal is measuring whether models can semantically interpret a document in the right way, without having models overfit to our precise benchmark.

Parsing 100% of PDFs to 100% accuracy is the final boss for document OCR.
In general, the latest frontier models have been tuned for coding, math, and scientific reasoning as opposed to precise visual understanding; hope more benchmarks that these will encourage overall progress towards solving this problem!

Poster is below. If you want to learn more come check out our site or 30-page ArXiv paper:

ParseBench: https://t.co/PWczfhp0OX
ArXiv: https://t.co/2dEJIaBBkr

5

32

8

15

3K

ravithejads retweeted

Kunal Bhatia

@kunalbhatia91

7 days ago

Superintelligence will be built on Self Improvement. Today @hexoai, we’re excited to release ‘SIA’ - an open-source Self-Improving AI, to achieve any goal through recursive self improvement. While trying to solve a problem, SIA doesn't just improve it's abilities by updating it's harness, it updates it's own weights as well.

122

644

139

406

497K

Ravi Theja

@ravithejads

11 days ago

In another decade, this might be a robot. For now, Indian unit economics favor the human version :)

Khush Mahajan

@YesKhush_5

12 days ago

In Lajpat, you can now pay ₹149/hr for someone to carry your bags, wait in food queues, walk you to the metro, find you a place to sit, and even set up a foldable chair. interesting biz !!

YesKhush_5's tweet photo. In Lajpat, you can now pay ₹149/hr for someone to carry your bags, wait in food queues, walk you to the metro, find you a place to sit, and even set up a foldable chair.

interesting biz !! https://t.co/zS5lmi5E4w

632

6K

653

942

1M

1

2

0

781

Who to follow

Eden Marco

@EdenEmarco177

LLMs @google cloud | Best-seller @udemy Instructor | Opinions stated here are my own, not those of my company

Nous Research

@NousResearch

World-class open source AI https://t.co/vrD0aDJeto

Anil Chandra Naidu Matcha

@matchaman11

CTO @VadooAI ⚡️ Youtube : https://t.co/MRA3d9h6pv 🌐 AI apps : https://t.co/Qpxd3qhdOk

ravithejads retweeted

Pratyush Choudhury (PC)

@177pc

17 days ago

For a while, @aakrit & I've quietly done this at Activate: spend time with exceptional operators, CTOs & senior builders before there’s a deck, company, or fully formed idea. Now opening it up Mixture of Experts - a small private dinner for people at the edge of founding First edition tomorrow in Bengaluru Request below on the Luma to join or DM if someone exceptional should be in the room. https://t.co/E2mUP845Z2

6

93

4

30

19K

ravithejads retweeted

tokenbender

@tokenbender

28 days ago

Ever wondered if you could extract capabilities and behaviors from neural networks and reuse/update/route it as needed? We introduce low-rank circuit conditioning, a novel approach that preserves the model's output behavior while reshaping how an existing capability is represented. In the base model, standard compact recovery stalls at 29%. After conditioning, the same extraction pipeline reaches 91.33% autoregressive full-answer recovery from 5.05% of MLP channels. The evidence points to a possibility of extracting and using isolated capabilities saving cost, latency and high adaptability. Read our work to understand more - https://t.co/ufTg1cVN1M

28

391

66

287

42K

ravithejads retweeted

Pratyush Choudhury (PC)

@177pc

28 days ago

India’s AI future is being built by those who ship, not just study. Today I’m announcing Activate Fellows - a summer program for 15 of India’s (and the world’s) best student builders to work inside the country’s leading AI startups Only 15 spots. And 8 days left to apply. Details + link 🧵

177pc's tweet photo. India’s AI future is being built by those who ship, not just study.

Today I’m announcing Activate Fellows - a summer program for 15 of India’s (and the world’s) best student builders to work inside the country’s leading AI startups

Only 15 spots. And 8 days left to apply. Details + link 🧵

10

182

19

127

15K

Ravi Theja

@ravithejads

30 days ago

@GenAI_is_real Congratulations.

0

1

0

144

ravithejads retweeted

Kaggle @kaggle

about 1 month ago

ParseBench is now live on Kaggle Benchmarks! 🚀 Developed by @llama_index, this benchmark evaluates PDF-to-structured-data conversion, featuring ~2k human-verified pages from real enterprise docs across 5 capability dimensions. 🥇Gemini 3 Flash: 79.3% 🥈GPT 5.4: 72.9% 🥉Gemma 4 31B: 66.4%

5

118

20

48

16K

ravithejads retweeted

Jerry Liu

@jerryjliu0

about 2 months ago

We’re open sourcing the first document OCR benchmark for the agentic era, ParseBench. Document parsing is the foundation of every AI agent that works with real-world files. ParseBench is a benchmark that measures parsing quality specifically for agent knowledge work: ✅ It optimizes for semantic correctness (instead of exact similarity) ✅ It has the most comprehensive distribution of real-world enterprise documents It contains ~2,000 human-verified enterprise document pages with 167,000+ test rules across five dimensions that matter most: tables, charts, content faithfulness, semantic formatting, and visual grounding. We benchmarked 14 known document parsers on ParseBench, from frontier/OSS VLMs to specialized parsers to LlamaParse. Here are some of our findings: 💡 Increasing compute budget yields diminishing returns - Gemini/gpt-5-mini/haiku gain 3-5 points from minimal to high thinking, at 4x the cost. 💡 Charts are the most polarizing dimension for evaluation. Most specialized parsers score below 6%, while some VLM-based parsers do a bit better. 💡 VLMs are great at visual understanding but terrible at layout extraction. GPT-5-mini/haiku score below 10% on our visual grounding task, all specialized parsers do much better. 💡 No method crushes all 5 dimensions at once, but LlamaParse achieves the highest overall score at 84.9%, and is the leader in 4 out of the 5 dimensions. This is by far the deepest technical work that we’ve published as a company. I would encourage you to start with our blog and explore our links to Hugging Face to GitHub. All the details are in our full 35-page (!!) ArXiv whitepaper. 🌐: Blog: https://t.co/57OHkx0pQW 📄 Paper: https://t.co/Ho2oH2xEAM 💻 Code: https://t.co/6P7UxqOZYA 📊 Dataset: https://t.co/YguIXWm41j 🎥 YouTube: https://t.co/6Fh1Nsk9ei

31

521

80

549

107K

ravithejads retweeted

LlamaIndex 🦙

@llama_index

2 months ago

LlamaIndex is proud to be named to the 2026 Enterprise Tech 30, #3 in the Early Stage category. The ET30 is an annual list by @Wing_VC and Eric Newcomer, voted on by 90+ leading investors and corporate development leaders. It recognizes the private companies wi th the most potential to shape the future of enterprise technology. Thank you to Wing Venture Capital and Eric Newcomer, and congratulations to all the companies honored this year.

llama_index's tweet photo. LlamaIndex is proud to be named to the 2026 Enterprise Tech 30, #3 in the Early Stage category.

The ET30 is an annual list by @Wing_VC and Eric Newcomer, voted on by
90+ leading investors and corporate development leaders. It recognizes the private companies wi th the most potential to shape the future of enterprise technology.

Thank you to Wing Venture Capital and Eric Newcomer, and congratulations to all the companies honored this year.

3

24

7

1

6K

ravithejads retweeted

Boris Cherny

@bcherny

2 months ago

I wanted to share a bunch of my favorite hidden and under-utilized features in Claude Code. I'll focus on the ones I use the most. Here goes.

550

23K

3K

52K

4M

ravithejads retweeted

Mistral AI

@MistralAI

3 months ago

Mistral AI is headed to San Jose for @NVIDIA GTC! 🚀 We’re demoing our newest frontier models, sharing our vision for the future of enterprise AI, and unveiling some big news you won't want to miss. 📍 Visit us to see the latest innovations in action. 🗓️ Check out our sessions and book a meeting: link in 🧵

MistralAI's tweet photo. Mistral AI is headed to San Jose for @NVIDIA GTC! 🚀
We’re demoing our newest frontier models, sharing our vision for the future of enterprise AI, and unveiling some big news you won't want to miss.

📍 Visit us to see the latest innovations in action.
🗓️ Check out our sessions and book a meeting: link in 🧵

11

217

25

15

22K

ravithejads retweeted

Pratyush Kumar

@pratykumar

3 months ago

📢 Open-sourcing the Sarvam 30B and 105B models! Trained from scratch with all data, model research and inference optimisation done in-house, these models punch above their weight in most global benchmarks plus excel in Indian languages. Get the weights at Hugging Face and AIKosh. Thanks to the good folks at SGLang for day 0 support, vLLM support coming soon. Links, benchmark scores, examples, and more in our blog - https://t.co/DcCG3zlN8p

206

7K

1K

745K

ravithejads retweeted

Ramsri Goutham Golla

@ramsri_goutham

3 months ago

Navarasa in Deepmind's Gemmaverse 🚀 The work @ravithejads and I did in building Navarasa, an Indic instruction-finetuned model on top of Google's Gemma catering to 15 Indian languages, has been featured in Deepmind's Gemmaverse!

ramsri_goutham's tweet photo. Navarasa in Deepmind's Gemmaverse 🚀

The work @ravithejads and I did in building Navarasa, an Indic instruction-finetuned model on top of Google's Gemma catering to 15 Indian languages, has been featured in Deepmind's Gemmaverse! https://t.co/nDYsd8cuof

5

22

2

4

1K

ravithejads retweeted

Tanishq Kumar

@tanishqkumar07

3 months ago

I've been working on a new LLM inference algorithm. It's called Speculative Speculative Decoding (SSD) and it's up to 2x faster than the strongest inference engines in the world. Collab w/ @tri_dao @avnermay. Details in thread.

135

4K

453

3K

612K

Ravi Theja

@ravithejads

3 months ago

Looks solid and properly grounded in sources. Congratulations, team @SarvamAI.

Pratyush Kumar

@pratykumar

3 months ago

Drop 14a/14: Over the past few days, we’ve been putting our models, products, and research out there. The positive feedback has helped more people agree with our long held belief that #IndiaCan be a builder in this space. Today, we are rolling out Indus - a chat interface to experience Sarvam 105B. Here is what all you can do:

263

3K

664

230

499K

3

38

3

1

2K

ravithejads retweeted

Pratyush Kumar

@pratykumar

4 months ago

Drop 13/14: The 30B and 105B models, benchmarks, and HF links will all come. But today it is a drop about people. About how our team of just 15 folks gave it their all to do what many doubted as not doable - ie train usefully large, globally competitive models from scratch in India. This team of 15 has now firmly launched @sarvam into its second innings. Yes, we can! @_mohit_singla @anand_404 @kediaharshit9 @AashaySachdeva @sumanthd17 @ArpitDwivedi100 @HarveenChadha @rkal4 @sushil_khyalia @ManavSinghal157 @sohampetkar missing in the pictuere - @selfawareatom @AnnaUpreti Anand @MeghMakwan33973 Utkarsh

pratykumar's tweet photo. Drop 13/14: The 30B and 105B models, benchmarks, and HF links will all come. But today it is a drop about people. About how our team of just 15 folks gave it their all to do what many doubted as not doable - ie train usefully large, globally competitive models from scratch in India. This team of 15 has now firmly launched @sarvam into its second innings. Yes, we can!

@_mohit_singla
@anand_404
@kediaharshit9
@AashaySachdeva
@sumanthd17
@ArpitDwivedi100
@HarveenChadha
@rkal4
@sushil_khyalia
@ManavSinghal157
@sohampetkar
missing in the pictuere -
@selfawareatom
@AnnaUpreti
Anand
@MeghMakwan33973
Utkarsh

198

5K

725

315

310K

ravithejads retweeted

Pratyush Kumar

@pratykumar

4 months ago

Drop 12/14: Models, products, impact - today something different, very different. Launching Sarvam Kaze, our foray into getting our models into the your hands with our devices - designed and built here in India!

127

3K

555

363

349K

ravithejads retweeted

Pratyush Kumar

@pratykumar

4 months ago

Drop 9/14: Today we are introducing Sarvam Studio, our product to help creators go multilingual. One piece of content, every corner of India. With AI video dubbing, Studio generates high-fidelity dubs in 11 Indian languages. In an expert study, participants preferred Sarvam Studio for overall quality and production readiness. With agentic document translation, Studio excels in contextually translating long-form content across genres. Our evaluations demonstrate that readers strongly preferred the output from Studio across different genres.