Sapient Intelligence

@Sapient_Int

Building efficient & powerful general intelligence through brain-inspired architecture

Palo Alto, CA

Joined July 2024

2 Following

5.3K Followers

32 Posts

Pinned Tweet

Sapient Intelligence @Sapient_Int

17 days ago

Introducing HRM-Text. An ultra-lean 1B-parameter reasoning language model designed to deliver strong general performance with a fraction of the data, compute, and infrastructure. Trained on just 40B structured tokens, HRM-Text achieves competitive performance while using ~1/1000 of the training data of comparable models. The kicker? The full model trains in roughly one day on a $1,000 budget. This opens the door to a new generation of AI that is powerful, accessible, and radically easier to adapt. Theories and research concepts once deemed too expensive to test are officially back in the game. Sapient Intelligence invites you to help us shape a new paradigm for general intelligence.

160

3K

269

2K

505K

Sapient Intelligence @Sapient_Int

9 days ago

Excited to see our community building with HRM-Text! Can’t wait to see where you take it next.🚀 Whether you’re reproducing the benchmarks, testing it out in your own field, or building something entirely new, we’d love to hear about it. Drop your ideas, questions, and experiments below, and tag @Sapient_Int so we can follow along and cheer you on!🎉

Sapient_Int's tweet photo. Excited to see our community building with HRM-Text! Can’t wait to see where you take it next.🚀

Whether you’re reproducing the benchmarks, testing it out in your own field, or building something entirely new, we’d love to hear about it.

Drop your ideas, questions, and experiments below, and tag @Sapient_Int so we can follow along and cheer you on!🎉

7

41

3

7

3K

Sapient Intelligence @Sapient_Int

14 days ago

Paper: https://t.co/MgnPJVldX2 GitHub: https://t.co/GKR8vFJZND Hugging Face: https://t.co/X7DW812tq2

0

4

1

0

535

Sapient Intelligence @Sapient_Int

16 days ago

In this benchmark deep-dive, Sapient’s founders William and Guan are joined by research team members Changling and Yasin to unpack HRM-Text’s performance across MATH, DROP, ARC-Challenge, and MMLU. 📊 Beyond the scores, they discuss what each benchmark measures, how HRM-Text compares with larger models, and why efficiency matters. Watch the full discussion to learn more about HRM-Text and Sapient’s leaner path toward general intelligence.

59

250

21

121

239K

Sapient Intelligence @Sapient_Int

14 days ago

Paper: https://t.co/MgnPJVldX2

0

3

0

0

724

Sapient Intelligence @Sapient_Int

17 days ago

Introducing HRM-Text. An ultra-lean 1B-parameter reasoning language model designed to deliver strong general performance with a fraction of the data, compute, and infrastructure. Trained on just 40B structured tokens, HRM-Text achieves competitive performance while using ~1/1000 of the training data of comparable models. The kicker? The full model trains in roughly one day on a $1,000 budget. This opens the door to a new generation of AI that is powerful, accessible, and radically easier to adapt. Theories and research concepts once deemed too expensive to test are officially back in the game. Sapient Intelligence invites you to help us shape a new paradigm for general intelligence.

160

3K

269

2K

505K

Sapient_Int retweeted

15 days ago

The HRM-Text paper is now available 🎉 HRM-Text explores a different approach to language model pretraining: hierarchical recurrent computation, task-completion training, and latent-space reasoning. At just 1B parameters, HRM-Text achieves competitive performance with dramatically lower training cost and data requirements. 1B parameters 40B unique tokens ~1 day of pretraining ~$1000 training cost

makingAGI's tweet photo. The HRM-Text paper is now available 🎉

HRM-Text explores a different approach to language model pretraining: hierarchical recurrent computation, task-completion training, and latent-space reasoning.

At just 1B parameters, HRM-Text achieves competitive performance with dramatically lower training cost and data requirements.

1B parameters
40B unique tokens
~1 day of pretraining
~$1000 training cost

25

756

104

603

87K

Sapient Intelligence @Sapient_Int

17 days ago

Download HRM-Text🔗 GitHub: https://t.co/GKR8vFJZND Hugging Face: https://t.co/X7DW812tq2

7

52

7

28

3K

Sapient Intelligence @Sapient_Int

17 days ago

HRM-Text 101 is here. This tutorial takes you from zero to one: from setup to fine-tuning to evaluation. Download the base checkpoint. Fine-tune it on a real task. Evaluate the results. End to end, on a single GPU. Watch the tutorial and start building with HRM-Text.

37

492

46

298

187K

Sapient Intelligence @Sapient_Int

17 days ago

Download HRM-Text 🔗 Github: https://t.co/GKR8vFJZND Hugging Face: https://t.co/X7DW812tq2

12

270

20

258

35K

Sapient Intelligence @Sapient_Int

18 days ago

Tomorrow, we will unveil a new path to general intelligence. Lean. Powerful. Efficient. The countdown is on⏳

Sapient_Int's tweet photo. Tomorrow, we will unveil a new path to general intelligence.

Lean. Powerful. Efficient.

The countdown is on⏳ https://t.co/kdxvbUp2Zl

39

595

28

218

45K

Sapient Intelligence @Sapient_Int

20 days ago

It is time to liberate reasoning from language! HRM (Hierarchical Reasoning Model) takes a simple idea from the brain: separating reasoning (thinking) from language. When we think, our brains process information in high-dimensional, abstract streams--deep, instantaneous, and unbounded. Only after we formulate an idea do we compress it into concrete, low-dimensional language for communication. Current LLMs do most of their "thinking" in the token space via Chain-of-Thought. The results are fascinating, but structurally shallow and highly resource-intensive. HRM changes this. By reasoning natively in a dedicated latent space, it unlocks a massive internal "scratchpad." It thinks deeper and is unconstrained by tokens, only translating to language when the thought is fully formed. Deeper reasoning. Way fewer tokens.

Sapient_Int's tweet photo. It is time to liberate reasoning from language!

HRM (Hierarchical Reasoning Model) takes a simple idea from the brain: separating reasoning (thinking) from language.

When we think, our brains process information in high-dimensional, abstract streams--deep, instantaneous, and unbounded. Only after we formulate an idea do we compress it into concrete, low-dimensional language for communication.

Current LLMs do most of their "thinking" in the token space via Chain-of-Thought. The results are fascinating, but structurally shallow and highly resource-intensive.

HRM changes this. By reasoning natively in a dedicated latent space, it unlocks a massive internal "scratchpad." It thinks deeper and is unconstrained by tokens, only translating to language when the thought is fully formed.

Deeper reasoning. Way fewer tokens.

2

32

2

8

3K

Sapient Intelligence @Sapient_Int

22 days ago

Is bigger always better in AI? 🧠 We've reached incredible SOTAs by brute-forcing scale with astronomical token counts. But consider the human brain: running on just 20W of power and trained on ~1B language tokens, it's still making miracles happen. That efficiency is inspiring. There are smarter paths toward smarter models, and much smarter ways to scale, and this will be the next breakthrough.

1

16

0

1

2K

Sapient Intelligence @Sapient_Int

about 1 month ago

At Sapient Intelligence, we enable deep, efficient reasoning with our Hierarchical Reasoning Model (HRM)—a brain-inspired, latent-space architecture that moves beyond traditional, data-heavy AI. By decoupling the cognitive load, HRM uses a Slower Controller to guide abstract, deliberate reasoning and a Faster Processor to handle detailed computations. This dual-stream design allows systems to reason, plan, and converge on solutions within latent space.

Sapient_Int's tweet photo. At Sapient Intelligence, we enable deep, efficient reasoning with our Hierarchical Reasoning Model (HRM)—a brain-inspired, latent-space architecture that moves beyond traditional, data-heavy AI.

By decoupling the cognitive load, HRM uses a Slower Controller to guide abstract, deliberate reasoning and a Faster Processor to handle detailed computations. This dual-stream design allows systems to reason, plan, and converge on solutions within latent space.

1

44

4

27

3K

Sapient Intelligence @Sapient_Int

about 1 month ago

Behind the code, there is a specific kind of expertise. We are a team of researchers and engineers rooted in the labs of Tsinghua University, University of Cambridge, University of Alberta, Carnegie Mellon University, and Peking University—with experience at DeepMind, DeepSeek, xAI, and more. We've seen the limits of the current AI architectures firsthand from within the organizations that scaled them. Now, across three countries, we are building an alternative. We aren't just shipping another wrapper; we are shipping a new fundamental architecture.

Sapient_Int's tweet photo. Behind the code, there is a specific kind of expertise.

We are a team of researchers and engineers rooted in the labs of Tsinghua University, University of Cambridge, University of Alberta, Carnegie Mellon University, and Peking University—with experience at DeepMind, DeepSeek, xAI, and more.

We've seen the limits of the current AI architectures firsthand from within the organizations that scaled them. Now, across three countries, we are building an alternative.

We aren't just shipping another wrapper; we are shipping a new fundamental architecture.

11

358

34

147

48K

Sapient Intelligence @Sapient_Int

4 months ago

We were honored to support the global AI community as a Gold Sponsor of the #AAAI26 Conference on Artificial Intelligence. It was truly inspiring to connect with so many brilliant minds across the industry. The future of AGI isn’t just being imagined, it is being built.

Sapient_Int's tweet photo. We were honored to support the global AI community as a Gold Sponsor of the #AAAI26 Conference on Artificial Intelligence. It was truly inspiring to connect with so many brilliant minds across the industry.

The future of AGI isn’t just being imagined, it is being built. https://t.co/hIm8mw0tSk

0

19

1

1

3K

Sapient Intelligence @Sapient_Int

4 months ago

Our Staff Research Scientist, Tech Lead Yasin Abbasi Yadkori will be giving a presentation in HALL 4 at 11am. Come join us in discussing the path to AGI 👏 #AAAI2026 #SINGAPOREEXPO #sapientintelligence #HRM

Sapient_Int's tweet photo. Our Staff Research Scientist, Tech Lead Yasin Abbasi Yadkori will be giving a presentation in HALL 4 at 11am.

Come join us in discussing the path to AGI 👏
#AAAI2026 #SINGAPOREEXPO #sapientintelligence #HRM

0

14

1

0

2K

Sapient Intelligence @Sapient_Int

4 months ago

Join us at Booth A17, HALL 2 at the AAAI conference from Jan 22-Jan 25🔥 See HRM reason in live! #AAAI2026 #SINGAPOREEXPO #sapientintelligence #HRM

Sapient_Int's tweet photo. Join us at Booth A17, HALL 2 at the AAAI conference from Jan 22-Jan 25🔥

See HRM reason in live!

#AAAI2026 #SINGAPOREEXPO #sapientintelligence #HRM https://t.co/eOo8BHcoRL

1

16

0

1

2K

Sapient Intelligence @Sapient_Int

7 months ago

Thank you @Bloomberg for featuring us! We are guided by the belief that brain-inspired reasoning is the road to AGI, and we continue to advance this vision with unwavering determination🚀 https://t.co/RFZLdHNy0q

Sapient_Int's tweet photo. Thank you @Bloomberg for featuring us! We are guided by the belief that brain-inspired reasoning is the road to AGI, and we continue to advance this vision with unwavering determination🚀 https://t.co/RFZLdHNy0q https://t.co/7XKUTYdRd6

3

18

5

3

2K

Sapient Intelligence @Sapient_Int

7 months ago

Proud to share that TRM, derived from our HRM model, is highlighted in Nature ! 🎉🎉🎉 This marks an important step forward for HRM-based reasoning systems, demonstrating the strength of small, structured models in complex reasoning tasks.💡

Sapient_Int's tweet photo. Proud to share that TRM, derived from our HRM model, is highlighted in Nature ! 🎉🎉🎉

This marks an important step forward for HRM-based reasoning systems, demonstrating the strength of small, structured models in complex reasoning tasks.💡

2

12

1

2

1K

Last Seen Users on Sotwe

Trends for you

Most Popular Users