WeiboLLM @WeiboLLM - Twitter Profile

WeiboLLM @WeiboLLM

about 13 hours ago

WeiboLLM's tweet photo. https://t.co/I7ryj2pgdQ

0

9

0

1

3K

WeiboLLM @WeiboLLM

about 13 hours ago

⭐ VibeThinker-3B is released — a dense 3B model for frontier-level verifiable reasoning. 🚀 Reasoning: 94.3 on AIME’26, 76.4 on IMO-AnsBench, and 80.2 Pass@1 on LCB v6; with CLR, AIME‘26 improves to 97.1 and IMO-AnsBench to 80.6. 💻 OOD Coding: On recent unseen LeetCode weekly contests, VibeThinker-3B passes 123/128 (96.1%) first-attempt Python submissions. ⚡ Efficiency: Only 3B parameters, yet reaching the performance range of much larger top-tier reasoning models. 🧠 Perspective: Small models are not just cheaper substitutes. In parameter-dense domains with clear verification signals, SLMs offer a path to frontier-level reasoning that complements traditional Scaling Law. Model : https://t.co/94A14zpqCV Github: https://t.co/32so5P6C7L Paper: https://t.co/UDd264RsZb #AI #LLM #Reasoning #OpenSource #SmallModel

WeiboLLM's tweet photo. ⭐ VibeThinker-3B is released — a dense 3B model for frontier-level verifiable reasoning.

🚀 Reasoning: 94.3 on AIME’26, 76.4 on IMO-AnsBench, and 80.2 Pass@1 on LCB v6; with CLR, AIME‘26 improves to 97.1 and IMO-AnsBench to 80.6.

💻 OOD Coding: On recent unseen LeetCode weekly contests, VibeThinker-3B passes 123/128 (96.1%) first-attempt Python submissions.

⚡ Efficiency: Only 3B parameters, yet reaching the performance range of much larger top-tier reasoning models.

🧠 Perspective: Small models are not just cheaper substitutes. In parameter-dense domains with clear verification signals, SLMs offer a path to frontier-level reasoning that complements traditional Scaling Law.

Model : https://t.co/94A14zpqCV
Github: https://t.co/32so5P6C7L
Paper: https://t.co/UDd264RsZb

#AI #LLM #Reasoning #OpenSource #SmallModel

37

838

103

627

57K

WeiboLLM @WeiboLLM

about 13 hours ago

WeiboLLM's tweet photo. https://t.co/kGaXwL0nWm

0

10

0

1

3K

WeiboLLM @WeiboLLM

about 13 hours ago

WeiboLLM's tweet photo. https://t.co/8lfHYyUnZW

0

12

0

2

3K

WeiboLLM @WeiboLLM

7 months ago

@FGuzmanAI And with only ~1.5GB of RAM for such high-performance reasoning—simply incredible!

0

4

0

60

WeiboLLM @WeiboLLM

7 months ago

@FGuzmanAI Thrilled to see the model running on iPhone with 4-bit quantization and MLX! The community has been waiting for this. Fantastic work! 🔥

1

5

0

435

WeiboLLM @WeiboLLM

7 months ago

@MaziyarPanahi Thanks for the shout-out! It's great to see your work on quantizing and running VibeThinker-1.5B so smoothly on device. Massive respect!

1

8

0

1

635

WeiboLLM @WeiboLLM

7 months ago

VibeThinker-1.5B hit #1 on @huggingface ’s trending models today! 🔥 Huge thank you to our amazing community for the love, downloads, and priceless feedback.❤️

WeiboLLM's tweet photo. VibeThinker-1.5B hit #1 on @huggingface ’s trending models today! 🔥
Huge thank you to our amazing community for the love, downloads, and priceless feedback.❤️ https://t.co/ivXBYjwDlJ

1

19

2

5

997

WeiboLLM @WeiboLLM

7 months ago

@hrdkbhatnagar @ahochlehnert @vishaal_urao @ameyaprabhu Thank you! Independent evaluations like this are really valuable and help make open-source better for everyone. We truly appreciate your support!

0

5

0

130

WeiboLLM @WeiboLLM

7 months ago

@ahochlehnert Thank you so much! Independent evaluations like this make the open-source world better for everyone. Grateful for the love and support!

0

2

0

1

63

WeiboLLM @WeiboLLM

7 months ago

@MaziyarPanahi @lmstudio Thanks for the support and all the recommendations! Glad we could help. We’ll keep improving and would love to hear your thoughts anytime — together we go further!

1

0

325

WeiboLLM @WeiboLLM

7 months ago

I strongly agree with your perspective. We recently open-sourced a 1.5B small model, which performs well on competition-level math and code problems. On benchmarks like AIME and HMMT, it even surpasses deepseekr1-0120, and its cost is less than $8,000. We are looking forward to your thoughts on this model. https://t.co/gXedxAgukO

WeiboLLM @WeiboLLM

7 months ago

⭐ VibeThinker-1.5B — SOTA reasoning in a tiny model. 🚀 Performance: Highly competitive on AIME24/25 & HMMT25 — surpasses DeepSeek R1-0120 on math, and outperforms same-size models in competitive coding. ⚡ Efficiency: Only 1.5B params — 100-600× smaller than giants like Kimi K2 & DeepSeek R1. 💰 Cost: Full post-training for just $7.8K — 30-60× cheaper than DeepSeek R1 or MiniMax-M1. 🧠 Innovation: Powered by our Spectrum-to-Signal Principle (SSP) and MGPO framework. Model : https://t.co/G2aSB9MInX Github: https://t.co/32so5P64id Arxiv : https://t.co/GsN3ya0QX9 #AI #LLM #Reasoning #OpenSource #SmallModel

WeiboLLM's tweet photo. ⭐ VibeThinker-1.5B — SOTA reasoning in a tiny model.
🚀 Performance: Highly competitive on AIME24/25 & HMMT25 — surpasses DeepSeek R1-0120 on math, and outperforms same-size models in competitive coding.
⚡ Efficiency: Only 1.5B params — 100-600× smaller than giants like Kimi K2 & DeepSeek R1.
💰 Cost: Full post-training for just $7.8K — 30-60× cheaper than DeepSeek R1 or MiniMax-M1.
🧠 Innovation: Powered by our Spectrum-to-Signal Principle (SSP) and MGPO framework.
Model : https://t.co/G2aSB9MInX
Github: https://t.co/32so5P64id
Arxiv : https://t.co/GsN3ya0QX9
#AI #LLM #Reasoning #OpenSource #SmallModel

28

382

58

194

111K

0

2

0

40

WeiboLLM @WeiboLLM

7 months ago

@gm8xx8 Curious to get your thoughts on our new 1.5B model, VibeThinker. We're seeing it challenge scaling laws: it outperforms a 671B model on AIME math, and was trained for only $7.8k using our "Spectrum-to-Signal Principle." It's open-source, details below: https://t.co/gXedxAgukO

WeiboLLM @WeiboLLM

7 months ago

⭐ VibeThinker-1.5B — SOTA reasoning in a tiny model. 🚀 Performance: Highly competitive on AIME24/25 & HMMT25 — surpasses DeepSeek R1-0120 on math, and outperforms same-size models in competitive coding. ⚡ Efficiency: Only 1.5B params — 100-600× smaller than giants like Kimi K2 & DeepSeek R1. 💰 Cost: Full post-training for just $7.8K — 30-60× cheaper than DeepSeek R1 or MiniMax-M1. 🧠 Innovation: Powered by our Spectrum-to-Signal Principle (SSP) and MGPO framework. Model : https://t.co/G2aSB9MInX Github: https://t.co/32so5P64id Arxiv : https://t.co/GsN3ya0QX9 #AI #LLM #Reasoning #OpenSource #SmallModel

28

382

58

194

111K

0

2

0

131

WeiboLLM @WeiboLLM

7 months ago

@rasbt @rasbt Agree, Kimi K2 thinking is a massive leap for open weights! 🔥 But what if I told you a 1.5B model can beat a 671B giant on Olympiad-level problems like AIME and HMMT? We just open-sourced VibeThinker-1.5B, details below: https://t.co/HLO4Uef9JF

WeiboLLM @WeiboLLM

7 months ago

⭐ VibeThinker-1.5B — SOTA reasoning in a tiny model. 🚀 Performance: Highly competitive on AIME24/25 & HMMT25 — surpasses DeepSeek R1-0120 on math, and outperforms same-size models in competitive coding. ⚡ Efficiency: Only 1.5B params — 100-600× smaller than giants like Kimi K2 & DeepSeek R1. 💰 Cost: Full post-training for just $7.8K — 30-60× cheaper than DeepSeek R1 or MiniMax-M1. 🧠 Innovation: Powered by our Spectrum-to-Signal Principle (SSP) and MGPO framework. Model : https://t.co/G2aSB9MInX Github: https://t.co/32so5P64id Arxiv : https://t.co/GsN3ya0QX9 #AI #LLM #Reasoning #OpenSource #SmallModel

28

382

58

194

111K

1

0

40

WeiboLLM @WeiboLLM

7 months ago

@reach_vb Curious to get your thoughts on our new 1.5B model, VibeThinker. We're seeing it challenge scaling laws: it outperforms a 671B model on AIME math, and was trained for only $7.8k using our "Spectrum-to-Signal Principle." It's open-source, details below: https://t.co/gXedxAgukO

WeiboLLM @WeiboLLM

7 months ago

⭐ VibeThinker-1.5B — SOTA reasoning in a tiny model. 🚀 Performance: Highly competitive on AIME24/25 & HMMT25 — surpasses DeepSeek R1-0120 on math, and outperforms same-size models in competitive coding. ⚡ Efficiency: Only 1.5B params — 100-600× smaller than giants like Kimi K2 & DeepSeek R1. 💰 Cost: Full post-training for just $7.8K — 30-60× cheaper than DeepSeek R1 or MiniMax-M1. 🧠 Innovation: Powered by our Spectrum-to-Signal Principle (SSP) and MGPO framework. Model : https://t.co/G2aSB9MInX Github: https://t.co/32so5P64id Arxiv : https://t.co/GsN3ya0QX9 #AI #LLM #Reasoning #OpenSource #SmallModel

28

382

58

194

111K

0

1

0

89

WeiboLLM @WeiboLLM

7 months ago

@_akhaliq Curious to get your thoughts on our new 1.5B model, VibeThinker. We're seeing it challenge scaling laws: it outperforms a 671B model on AIME math, and was trained for only $7.8k using our "Spectrum-to-Signal Principle." It's open-source, details below: https://t.co/gXedxAgukO

WeiboLLM @WeiboLLM

7 months ago

⭐ VibeThinker-1.5B — SOTA reasoning in a tiny model. 🚀 Performance: Highly competitive on AIME24/25 & HMMT25 — surpasses DeepSeek R1-0120 on math, and outperforms same-size models in competitive coding. ⚡ Efficiency: Only 1.5B params — 100-600× smaller than giants like Kimi K2 & DeepSeek R1. 💰 Cost: Full post-training for just $7.8K — 30-60× cheaper than DeepSeek R1 or MiniMax-M1. 🧠 Innovation: Powered by our Spectrum-to-Signal Principle (SSP) and MGPO framework. Model : https://t.co/G2aSB9MInX Github: https://t.co/32so5P64id Arxiv : https://t.co/GsN3ya0QX9 #AI #LLM #Reasoning #OpenSource #SmallModel

28

382

58

194

111K

0

1

0

21

WeiboLLM @WeiboLLM

7 months ago

1st, we encourage it to explore manypossible answers (Spectrum Phase). Then, we teach it to identify & amplify the bestones (Signal Phase) This "explore then focus" method is key to its strong reasoning.

WeiboLLM's tweet photo. 1st, we encourage it to explore manypossible answers (Spectrum Phase). Then, we teach it to identify & amplify the bestones (Signal Phase)
This "explore then focus" method is key to its strong reasoning. https://t.co/LKEsksQS3f

2

22

0

2

3K

WeiboLLM @WeiboLLM

7 months ago

⭐ VibeThinker-1.5B — SOTA reasoning in a tiny model. 🚀 Performance: Highly competitive on AIME24/25 & HMMT25 — surpasses DeepSeek R1-0120 on math, and outperforms same-size models in competitive coding. ⚡ Efficiency: Only 1.5B params — 100-600× smaller than giants like Kimi K2 & DeepSeek R1. 💰 Cost: Full post-training for just $7.8K — 30-60× cheaper than DeepSeek R1 or MiniMax-M1. 🧠 Innovation: Powered by our Spectrum-to-Signal Principle (SSP) and MGPO framework. Model : https://t.co/G2aSB9MInX Github: https://t.co/32so5P64id Arxiv : https://t.co/GsN3ya0QX9 #AI #LLM #Reasoning #OpenSource #SmallModel

28

382

58

194

111K

WeiboLLM @WeiboLLM

7 months ago

more benchmark results and comparisons with other models

1

21

0

4

4K

WeiboLLM

@WeiboLLM

Last Seen Users on Sotwe

Trends for you

Most Popular Users