Rupesh Srivastava @rupspace - Twitter Profile

Pinned Tweet

6 months ago

Update: new gig, and I'm hiring! I recently joined the Institute of Foundation Models in the SF Bay Area! Our goal is to train large-scale FULLY open-source LLMs at and beyond the frontier, from scratch, with open science, open data and open checkpoints. We are hiring across the training stack. Further, I'm building a new team to advance open agentic LLMs, and hiring researchers/engineers on-site. Send me a DM or email if you are interested! I'll also be at #NeurIPS2025 in San Diego this week to talk to potential candidates for internships and FT positions.

rupspace's tweet photo. Update: new gig, and I'm hiring!
I recently joined the Institute of Foundation Models in the SF Bay Area! Our goal is to train large-scale FULLY open-source LLMs at and beyond the frontier, from scratch, with open science, open data and open checkpoints.

We are hiring across the training stack. Further, I'm building a new team to advance open agentic LLMs, and hiring researchers/engineers on-site. Send me a DM or email if you are interested! I'll also be at #NeurIPS2025 in San Diego this week to talk to potential candidates for internships and FT positions.

16

224

21

168

37K

Rupesh Srivastava @rupspace

1 day ago

Love it when Jürgen puts things in perspective! 🙂

Jürgen Schmidhuber

@SchmidhuberAI

1 day ago

Tera IPOs coming! $1T sounds like a lot. But $1T is just a 7-m-wide gold cube, thanks to massive inflation since 1971 when $ and gold decoupled. A little house full of gold. To put things in perspective: the 2017 neutron star merger GW170817 produced several earth masses of gold.

SchmidhuberAI's tweet photo. Tera IPOs coming! $1T sounds like a lot. But $1T is just a 7-m-wide gold cube, thanks to massive inflation since 1971 when $ and gold decoupled. A little house full of gold. To put things in perspective: the 2017 neutron star merger GW170817 produced several earth masses of gold. https://t.co/pfLbHerJ3P

10

153

11

22

26K

0

2

0

137

rupspace retweeted

Mingkai Deng

@mdeng34

13 days ago

Frontier LLMs are converging on efficient, adaptive reasoning. Opus 4.7 lets the model decide how deeply to reason. GPT-5.5 achieves strong results with fewer reasoning tokens. We study a related but more structural question: what 𝗸𝗶𝗻𝗱 𝗼𝗳 𝗿𝗲𝗮𝘀𝗼𝗻𝗶𝗻𝗴 should we adapt? Last year in SiRA (upper figure), we showed that simulative reasoning (System II), which uses a 𝘄𝗼𝗿𝗹𝗱 𝗺𝗼𝗱𝗲𝗹 to evaluate consequences of actions, yields up to 124% improvement over reactive baselines (System I), and that strong reasoning models (o1, o3-mini) fail as planners without this structure. In our new paper SR²AM (lower figure), we add a learned 𝗰𝗼𝗻𝗳𝗶𝗴𝘂𝗿𝗮𝘁𝗼𝗿 (System III) that self-regulates when to simulate, how far ahead, and when to skip planning entirely. Efficient reasoning is not just shorter reasoning: it is better allocation of simulation.

mdeng34's tweet photo. Frontier LLMs are converging on efficient, adaptive reasoning. Opus 4.7 lets the model decide how deeply to reason. GPT-5.5 achieves strong results with fewer reasoning tokens.

We study a related but more structural question: what 𝗸𝗶𝗻𝗱 𝗼𝗳 𝗿𝗲𝗮𝘀𝗼𝗻𝗶𝗻𝗴 should we adapt?

Last year in SiRA (upper figure), we showed that simulative reasoning (System II), which uses a 𝘄𝗼𝗿𝗹𝗱 𝗺𝗼𝗱𝗲𝗹 to evaluate consequences of actions, yields up to 124% improvement over reactive baselines (System I), and that strong reasoning models (o1, o3-mini) fail as planners without this structure.

In our new paper SR²AM (lower figure), we add a learned 𝗰𝗼𝗻𝗳𝗶𝗴𝘂𝗿𝗮𝘁𝗼𝗿 (System III) that self-regulates when to simulate, how far ahead, and when to skip planning entirely.

Efficient reasoning is not just shorter reasoning: it is better allocation of simulation.

4

278

47

273

61K

Rupesh Srivastava @rupspace

20 days ago

@tw_killian @BYU @BYUCS

0

1

0

21

Who to follow

Corey Lynch

@coreylynch

Director of AI at @figure_robot, building Helix 🧬

Ben Poole

@poolio

research scientist at google brain. phd in neural nonsense from stanford.

Brandon Amos

@brandondamos

🧙 RL @Reflection_AI past: @MetaAi @GoogleDeepmind @SCSatCMU @Cornell_Tech

Rupesh Srivastava @rupspace

20 days ago

@agarwl_ @BlackHC That idea came from von Malsburg. Hinton and Plaut even cited him in the paper for this, but his influence is sadly forgotten.

0

2

0

34

rupspace retweeted

Jeff Clune

@jeffclune

22 days ago

Thrilled to share that we founded Recursive to create AI that safely conducts experiments on how to improve itself in an open-ended process of endless, automated scientific discovery. As I wrote in my 2019 AI-generating algorithms paper, this will likely be the fastest path to superintelligence. Our work since has shown the power of this approach. Excited to scale up and improve upon ideas like the Darwin Gödel Machine, HyperAgents, ADAS, OMNI, ALMA, The AI Scientist, PromptBreeder, Rainbow Teaming, Automated Capability Discovery, and other work on open-ended and AI-generating algorithms. We’ve assembled a dream team of researchers and significant resources to pursue this vision. My amazing co-founders are pictured here, and we have an all-star team of founding members (we’re over 25 and growing). Please join us if you are interested! Follow our progress @Recursive_SI

jeffclune's tweet photo. Thrilled to share that we founded Recursive to create AI that safely conducts experiments on how to improve itself in an open-ended process of endless, automated scientific discovery. As I wrote in my 2019 AI-generating algorithms paper, this will likely be the fastest path to superintelligence. Our work since has shown the power of this approach. Excited to scale up and improve upon ideas like the Darwin Gödel Machine, HyperAgents, ADAS, OMNI, ALMA, The AI Scientist, PromptBreeder, Rainbow Teaming, Automated Capability Discovery, and other work on open-ended and AI-generating algorithms. We’ve assembled a dream team of researchers and significant resources to pursue this vision. My amazing co-founders are pictured here, and we have an all-star team of founding members (we’re over 25 and growing).

Please join us if you are interested! Follow our progress @Recursive_SI

50

613

44

170

117K

Rupesh Srivastava @rupspace

about 1 month ago

Did he just ... wow @fredagainagain1 thank you so much! https://t.co/uEcym5Yf6n

0

178

Rupesh Srivastava @rupspace

about 1 month ago

Yes!

Susan Zhang

@suchenzang

about 1 month ago

@charuman wasn't meant as sarcasm it's always nice to see a lab so confident/secure in their capabilities that they can openly publish all their struggles

1

45

0

3

3K

0

2

0

279

rupspace retweeted

Loren Lugosch @lorenlugosch

about 1 month ago

In this paper, we ask: 𝘏𝘰𝘸 𝘤𝘢𝘯 𝘸𝘦 𝘤𝘭𝘶𝘮𝘴𝘪𝘭𝘺 𝘳𝘦𝘧𝘰𝘳𝘮𝘶𝘭𝘢𝘵𝘦 𝘵𝘩𝘦 𝘤𝘢𝘱𝘢𝘣𝘪𝘭𝘪𝘵𝘺 𝘸𝘦 𝘪𝘮𝘱𝘭𝘦𝘮𝘦𝘯𝘵𝘦𝘥 𝘪𝘯 𝘵𝘩𝘦 𝘧𝘰𝘳𝘮 𝘰𝘧 𝘢 𝘲𝘶𝘦𝘴𝘵𝘪𝘰𝘯?

0

14

1

4

2K

Rupesh Srivastava @rupspace

about 2 months ago

@finbarrtimbers I think this is likely a difference of scale mainly. If there's enough filtered data to train on, then use that. If there's limited data, train on all.

0

1

0

226

rupspace retweeted

Shibo Hao

@Ber18791531

about 2 months ago

🍫 CocoaBench v1.0 is out! CocoaBench is a benchmark for unified digital agents, built around open-world tasks that require composing 💻 coding, 👀 vision, 🌐 search. Since our first research preview last December, we have expanded the benchmark substantially with community contributed tasks, and spent months testing and refining the tasks, evaluations, and agent runs. Some takeaways: • Even the best agent system reaches only 45.1% on CocoaBench v1.0. • Coding agents like Codex are already surprisingly strong on general tasks beyond software engineering. • Stronger agents tend to push more of the work into code. • Open source models still lag behind leading frontier models on these general tasks. 👇More on the website and in the paper #AI #Agents #LLM #Benchmark #CocoaBench

2

79

34

19

12K

rupspace retweeted

Institute of Foundation Models

@IFM_MBZUAI

about 2 months ago

A visually convincing rollout is not the same thing as a useful world model. WR-Arena is built to test the harder question: can a model simulate futures well enough to support action, planning, and reasoning? That’s the shift from simple next-state prediction to realistic world simulation grounded in real-world utility. Paper + code are live. https://t.co/x4zQfpHzKt https://t.co/FVvnKQpCdd #AI #WorldModels #Benchmarking #EmbodiedIntelligence #PhysicalAI #MachineLearning

0

46

10

41

5K

Rupesh Srivastava @rupspace

2 months ago

@Grad62304977 @kalomaze All networks are mixtures of experts, just gated at unit level :) https://t.co/RJ0KMZi1Gv

0

2

0

73

rupspace retweeted

Alex Shaw

@alexgshaw

2 months ago

The Harbor registry is getting an upgrade. Now, anyone can publish to the registry to make their dataset available to every Harbor user:

alexgshaw's tweet photo. The Harbor registry is getting an upgrade.

Now, anyone can publish to the registry to make their dataset available to every Harbor user: https://t.co/N5PM6m34Wj

4

38

5

4

5K

rupspace retweeted

Institute of Foundation Models

@IFM_MBZUAI

2 months ago

Back in beautiful New Haven this weekend for YHack. We’ll be there with K2 Think V2, a fully open-source reasoning system. Hackers! Dig into how it works: https://t.co/xALonGPL6n

IFM_MBZUAI's tweet photo. Back in beautiful New Haven this weekend for YHack.

We’ll be there with K2 Think V2, a fully open-source reasoning system.

Hackers! Dig into how it works: https://t.co/xALonGPL6n https://t.co/qNitkTX82p

0

7

3

0

601

rupspace retweeted

Lucas Beyer (bl16)

@giffmana

3 months ago

Yes and no. Very often it turns out that what you think solves the problem is not what actually solves it, and this you only find out by not moving on, but making sure you have experiments that back up the *exact* statement you make removing all reasonable confounders. And that, you get from one of: - public review - extremely strict colleagues - insane self discipline

1

162

6

21

9K

rupspace retweeted

Seungwook Han

@seungwookh

3 months ago

Can language models learn useful priors without ever seeing language? We pre-pre-train transformers on neural cellular automata — fully synthetic, zero language. This improves language modeling by up to 6%, speeds up convergence by 40%, and strengthens downstream reasoning. Surprisingly, it even beats pre-pre-training on natural text! Blog: https://t.co/Pni0RsIcxL (1/n)

seungwookh's tweet photo. Can language models learn useful priors without ever seeing language?

We pre-pre-train transformers on neural cellular automata — fully synthetic, zero language. This improves language modeling by up to 6%, speeds up convergence by 40%, and strengthens downstream reasoning.

Surprisingly, it even beats pre-pre-training on natural text!

Blog: https://t.co/Pni0RsIcxL

(1/n)

47

2K

259

1K

254K

rupspace retweeted

Subham Sahoo

@ssahoo_

3 months ago

📢@CVPR 2026: first-ever tutorial dedicated to DISCRETE DIFFUSION 🔥 Part I: Consistency Models + Flow Maps - @JCJesseLai Part II: Discrete Diffusion - by me. ✨Few-step gen + inference-time scaling + live demos Co-orgs: @StefanoErmon @DrYangSong @mittu1204 @gimdong58085414 Full schedule + details👇 (1/3)

ssahoo_'s tweet photo. 📢@CVPR 2026: first-ever tutorial dedicated to DISCRETE DIFFUSION 🔥

Part I: Consistency Models + Flow Maps - @JCJesseLai
Part II: Discrete Diffusion - by me.

✨Few-step gen + inference-time scaling + live demos

Co-orgs: @StefanoErmon @DrYangSong @mittu1204 @gimdong58085414

Full schedule + details👇
(1/3)

5

324

41

198

21K

Rupesh Srivastava @rupspace

3 months ago

@kalomaze @teortaxesTex @kzkirie 👀

0

1

0

33

Rupesh Srivastava @rupspace

3 months ago

@eliebakouch Congrats on a great run!

0

1

0

57

Rupesh Srivastava

@rupspace

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users