Avinash U @ummavi - Twitter Profile

ummavi retweeted

2 months ago

India enters the open-weights AI race with its largest models pre-trained from scratch: Sarvam 105B and Sarvam 30B @SarvamAI's Sarvam 105B and Sarvam 30B score 18 and 12 on the Artificial Analysis Intelligence Index respectively. Announced at the India AI Impact Summit 2026 and open-sourced under Apache 2.0, both are Mixture-of-Experts models trained entirely in India using compute provided under the IndiaAI Mission (@OfficialINDIAai). Both support reasoning and non-reasoning modes. These are an improvement from Sarvam's previous model, Sarvam M (8 on Intelligence Index, 23.6B parameters), which was based on Mistral Small rather than pre-trained from scratch. Sarvam 105B has 106B total parameters with ~10B active per token and a 128K context window. Sarvam 30B has 32B total parameters with ~2.4B active per token and a 65K context window. Alongside the text models, Sarvam also announced Saaras v3 (Speech to Text) and Bulbul v3 (Text to Speech) with a focus on Indic languages. Key takeaways in reasoning mode: ➤ Sarvam 105B scores 18 on the Intelligence Index. Among ~100B-class open-weights reasoning models, it trails GLM-4.5-Air (23), INTELLECT-3 (22), Mistral Small 4 (27), and gpt-oss-120B (High, 33). All four peers also activate more parameters per token ➤ Sarvam 30B scores 12 on the Intelligence Index. Among ~30B-class open-weights reasoning models, it trails GLM-4.7-Flash (30), Nemotron Cascade 2 30B A3B (28), Qwen3 30B A3B 2507 (22), and Qwen3 32B (17). Sarvam 30B activates fewer parameters than these peers. ➤ Sarvam 105B's relative strength is in select agentic tasks. Its agentic index of 25 places it ahead of INTELLECT-3 (20) and GLM-4.5-Air (21) despite trailing both on overall intelligence. Its GDPval index of 773 also edges ahead of GLM-4.5-Air (665). Both new models are a large step up from Sarvam M (Reasoning), which scored 8 on the Intelligence Index. ➤ Compared to peers, both models score lower on TerminalBench Hard (Agentic Coding & Terminal Use) and AA-Omniscience. Sarvam 105B scored 1.5% and Sarvam 30B scored 2.3% on TerminalBench Hard, compared to GLM-4.5-Air (20.5%) and INTELLECT-3 (9.1%). The AA-Omniscience Index is -60 for Sarvam 105B and -72 for Sarvam 30B. Both models have high hallucination rates relative to their accuracy, and both attempt to answer far more questions rather than abstaining, which drives the negative scores. Key model details: ➤ Modality: Text input and output only. ➤ Context window: 128K tokens (Sarvam 105B) and 65K tokens (Sarvam 30B). ➤ Pricing: Currently free on Sarvam's first-party API. ➤ License: Apache 2.0. ➤ Availability: Sarvam's first-party API; weights available on @huggingface and AIKosh.

ArtificialAnlys's tweet photo. India enters the open-weights AI race with its largest models pre-trained from scratch: Sarvam 105B and Sarvam 30B

@SarvamAI's Sarvam 105B and Sarvam 30B score 18 and 12 on the Artificial Analysis Intelligence Index respectively. Announced at the India AI Impact Summit 2026 and open-sourced under Apache 2.0, both are Mixture-of-Experts models trained entirely in India using compute provided under the IndiaAI Mission (@OfficialINDIAai). Both support reasoning and non-reasoning modes.

These are an improvement from Sarvam's previous model, Sarvam M (8 on Intelligence Index, 23.6B parameters), which was based on Mistral Small rather than pre-trained from scratch. Sarvam 105B has 106B total parameters with ~10B active per token and a 128K context window. Sarvam 30B has 32B total parameters with ~2.4B active per token and a 65K context window. Alongside the text models, Sarvam also announced Saaras v3 (Speech to Text) and Bulbul v3 (Text to Speech) with a focus on Indic languages.

Key takeaways in reasoning mode:

➤ Sarvam 105B scores 18 on the Intelligence Index. Among ~100B-class open-weights reasoning models, it trails GLM-4.5-Air (23), INTELLECT-3 (22), Mistral Small 4 (27), and gpt-oss-120B (High, 33). All four peers also activate more parameters per token

➤ Sarvam 30B scores 12 on the Intelligence Index. Among ~30B-class open-weights reasoning models, it trails GLM-4.7-Flash (30), Nemotron Cascade 2 30B A3B (28), Qwen3 30B A3B 2507 (22), and Qwen3 32B (17). Sarvam 30B activates fewer parameters than these peers.

➤ Sarvam 105B's relative strength is in select agentic tasks. Its agentic index of 25 places it ahead of INTELLECT-3 (20) and GLM-4.5-Air (21) despite trailing both on overall intelligence. Its GDPval index of 773 also edges ahead of GLM-4.5-Air (665). Both new models are a large step up from Sarvam M (Reasoning), which scored 8 on the Intelligence Index.

➤ Compared to peers, both models score lower on TerminalBench Hard (Agentic Coding & Terminal Use) and AA-Omniscience. Sarvam 105B scored 1.5% and Sarvam 30B scored 2.3% on TerminalBench Hard, compared to GLM-4.5-Air (20.5%) and INTELLECT-3 (9.1%). The AA-Omniscience Index is -60 for Sarvam 105B and -72 for Sarvam 30B. Both models have high hallucination rates relative to their accuracy, and both attempt to answer far more questions rather than abstaining, which drives the negative scores.

Key model details:
➤ Modality: Text input and output only.
➤ Context window: 128K tokens (Sarvam 105B) and 65K tokens (Sarvam 30B).
➤ Pricing: Currently free on Sarvam's first-party API.
➤ License: Apache 2.0.
➤ Availability: Sarvam's first-party API; weights available on @huggingface and AIKosh.

12

379

39

43

26K

ummavi retweeted

JAPAN AI株式会社【広報】｜そのタスク「AI社員」に任せませんか？

@JAPAN_AI_pr

4 months ago

📢 プレスリリース国産AIエージェント基盤「JAPAN AI Code」、ソフトウェア開発ベンチマークSWE-bench Verifiedにおいて解決率80.2%を達成〜国内開発のAIエージェント技術として世界最高水準の性能を実証〜 https://t.co/joZGTkZco5

0

23

12

3

7K

ummavi retweeted

Jeffrey J. Hall 🇯🇵🇺🇸

@mrjeffu

9 months ago

A team of Japanese researchers led by Kojima Tomoki won the Ig Nobel prize for their research paper "Cows painted with zebra-like striping can avoid biting fly attack." They had a great sense of humor about how to act at the award ceremony.

3

433

77

59

95K

ummavi retweeted

PLaMo @PLaMoLLM

9 months ago

これまでPFNのtech blogで紹介してきたPLaMo 2に関するものをまとめてarXivに公開しました。多くの方に利用していただいているPLaMo翻訳も、このPLaMo 2ベースのモデルとなっています。ご興味のある方は是非ご覧ください。論文へのリンクはリプライ欄へ👇 #PLaMo #国産LLM

PLaMoLLM's tweet photo. これまでPFNのtech blogで紹介してきたPLaMo 2に関するものをまとめてarXivに公開しました。

多くの方に利用していただいているPLaMo翻訳も、このPLaMo 2ベースのモデルとなっています。

ご興味のある方は是非ご覧ください。

論文へのリンクはリプライ欄へ👇

#PLaMo #国産LLM https://t.co/qpZXs7mwNJ

1

211

58

84

27K

Who to follow

Anton Osokin

@aosokin_ml

Machine Learning at @IsomorphicLabs, Ex @bayesgroup, @inria_paris, @CS_HSE, @YandexResearch. Opinions are my own

Jun Yamada

@junjungoal

Postdoc @ Amazon Robotics / DPhil @ University of Oxford, A2I Lab (@a2i_oxford) | ex-intern@NVIDIA | Robot Learning, Manipulation

Funabashi / 船橋

@funabashihand

Lecturer at Waseda Univ., Japan/PM at FingerVision/ACT-I Acceleration/MIT ex-intern/DL, Tactile Sensing, Multi-Fingered Manipulation ヘッダーは某原塚先生の仕業

ummavi retweeted

mooopan @mooopan

about 1 year ago

My new preprint on RL is out: Experience Replay with Random Reshuffling https://t.co/9NOPA4IUJ5

1

20

5

2K

ummavi retweeted

PLaMo @PLaMoLLM

over 1 year ago

PLaMo™のフラッグシップモデルPLaMo PrimeのWeb APIを本日提供開始しました🧑‍💻 対話型AIアシスタントPLaMo Chatは期間限定で無料開放‼️β版からの進化点は👇 🔤 文章生成、要約、翻訳、テキスト分析性能を向上 💬 コンテキスト長を拡大 🚀 RAG (検索拡張生成) の精度向上ご利用登録はリプライから！

1

22

11

4

5K

ummavi retweeted

Kuniyuki Takahashi @kuniyuki_taka

over 1 year ago

本日の10:00-12:00のIROSのポスターセッションU(7-8で展示)で共著1件（主著：@ummavi）の発表を行います。 NeRFを用いた透明物体の深度情報の再構成に焦点を当てており、Visual Foundation Modelsでセグメンテーション情報を与えることで、性能を大幅に改善して、ロボット把持を実現しています！

0

11

2

1

1K

ummavi retweeted

Kuniyuki Takahashi @kuniyuki_taka

over 1 year ago

7.8でポスター発表 @ IROS2024をします。

0

6

1

664

ummavi retweeted

Kuniyuki Takahashi @kuniyuki_taka

almost 2 years ago

IROS2024に採択：ロボットの指先に4軸の受動的適応機構を搭載したグリッパー「FAAF Hand」を開発しました。位置ずれがあっても、正方形・三角形の棒や実験器具の蓋をスムーズに挿入可能。単純な制御でも高精度な作業ができる次世代ロボットハンド。 https://t.co/yKO7wWmRJB https://t.co/dp6XZsMvAr

0

38

9

6

5K

ummavi retweeted

Masaki Saito @rezoolab

almost 2 years ago

Preferred Networksを退職し、8/1から独立研究者（フリーランス）として活動いたします。様々な産業分野のお仕事を経験しつつ、研究活動を行なっていくのが理想です。業務委託・コンサル等のお問い合わせにつきましては、以下のURLからお気軽にご連絡ください: https://t.co/YknHFbpyw2

22

566

76

111

149K

ummavi retweeted

Bartłomiej Cupiał @CupiaBart

about 2 years ago

So here's a story of, by far, the weirdest bug I've encountered in my CS career. Along with @maciejwolczyk we've been training a neural network that learns how to play NetHack, an old roguelike game, that looks like in the screenshot. Recenlty, something unexpected happened.

CupiaBart's tweet photo. So here's a story of, by far, the weirdest bug I've encountered in my CS career.

Along with @maciejwolczyk we've been training a neural network that learns how to play NetHack, an old roguelike game, that looks like in the screenshot. Recenlty, something unexpected happened. https://t.co/AFTgRm1gtv

134

8K

1K

3K

2M

ummavi retweeted

Aravind @the_aravind

about 2 years ago

Can you know when your robot policy works without running it? "MORALS: Analysis of High-Dimensional Robot Controllers via Topological Tools in a Latent Space" (nominated for Best Automation Paper @ieee_ras_icra), does just that! Here is all you need to know 📷 (1/n)

the_aravind's tweet photo. Can you know when your robot policy works without running it?

"MORALS: Analysis of High-Dimensional Robot Controllers via Topological Tools in a Latent Space" (nominated for Best Automation Paper @ieee_ras_icra), does just that! Here is all you need to know 📷 (1/n) https://t.co/w7ubeR1bi8

1

23

6

24

3K

ummavi retweeted

Preferred Networks @PreferredNet

about 2 years ago

PFN's Robotics Research Team is looking for a Japan-based part-time engineer/researcher for R&D in object manipulation tasks using a robotic arm with several sensors. https://t.co/9RAtNv0Y6T

0

1

2K

ummavi retweeted

Kuniyuki Takahashi @kuniyuki_taka

about 2 years ago

NeRFを用いて透明物体の深度情報を補完する手法の論文を公開しました。通常、透明物体はNeRFでも難しいです。この提案手法では、Visual Foundation Models (VFMs)を使って、透明物体のセグメンテーションの情報を与えることで、解決しています。 https://t.co/DIItW9CwQ7 https://t.co/B7jSUYnJGe 1/n

1

24

3

12

4K

Avinash U @ummavi

about 2 years ago

We evaluate on the ClearPose dataset & perform robotic grasping expts. for 10 diverse objects and find remarkable performance across the board, even when placed on challenging unpatterned, glossy white tables w/ harsh lights Video: https://t.co/kl0TFzZxTs

0

53

Avinash U @ummavi

about 2 years ago

Introducing Segmentation-AIDed NeRF (SAID-NeRF) for depth completion of transparent objects. NeRFs can capture specular surface effects of transparent objs but struggle to recover underlying geometry. We exploit segmentation VFMs like SAM to overcome this https://t.co/AxxMjyv4ud

1

7

6

1

2K

Avinash U @ummavi

about 2 years ago

Idea: By jointly constructing a semantic field, we force estimated densities to be concentrated on the object resulting in more coherent depth. We use a heuristic to derive label-free masks & extensions to NeRF for fast, few-view estimations in complex settings

ummavi's tweet photo. Idea: By jointly constructing a semantic field, we force estimated densities to be concentrated on the object resulting in more coherent depth. We use a heuristic to derive label-free masks & extensions to NeRF for fast, few-view estimations in complex settings https://t.co/SZCWDZg92h

1

0

89

ummavi retweeted

Omega Crafter (オメガクラフター)

@omegacrafter_jp

about 2 years ago

🆕最新トレーラー公開🆕 『#OmegaCrafter / オメガクラフター』早期アクセス版リリース告知トレーラーが完成️🎉 ⏰リリース時刻：3月29日(金) 正午 🛍️販売価格：2,800円(税込) 【セール情報】リリース後1週間は10%OFF(2,520円)🔥 ウィッシュリスト登録はこちら👇 https://t.co/shJMkuzMfp