Saurabh Singh @saurasingh - Twitter Profile

Pinned Tweet

23 days ago

Incredibly proud of the team’s work on this. Recursive self-improvement and automated optimization at inference time yields SOTA on LCB Pro—zero fine-tuning required.

Poetiq

@poetiq_ai

23 days ago

Poetiq's Meta-System built its own coding harness from scratch. It got SOTA on LiveCodeBench Pro. No fine-tuning, no special model access. Just standard APIs. Using Gemini 3.1 Pro, it made a harness that beat all frontier models we tested.

poetiq_ai's tweet photo. Poetiq's Meta-System built its own coding harness from scratch. It got SOTA on LiveCodeBench Pro.

No fine-tuning, no special model access. Just standard APIs. Using Gemini 3.1 Pro, it made a harness that beat all frontier models we tested. https://t.co/v575oUYJeH

43

551

58

237

2M

1

3

0

1

325

Saurabh Singh

@saurasingh

23 days ago

What was the penalty for the same offense if LLM was not involved? Was it the same? Or are these new kinds of infractions that did not happen before?

Thomas G. Dietterich @tdietterich

23 days ago

Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/

140

7K

925

1K

1M

0

86

Saurabh Singh

@saurasingh

3 months ago

This is exactly the kind of paper I desk reject from my reading list. Reliance on too much jargon from other remote fields, makes it only harder for the readers and not worth the effort. Often, terminology from within the field is sufficient.

Dr. Datta M.D. (Radiology) M.B.B.S. 🇮🇳

@DrDatta_AIIMS

4 months ago

How do they produce such crazy papers?

65

2K

299

2K

232K

1

3

0

358

Saurabh Singh

@saurasingh

4 months ago

Ace'd the class ;)

Poetiq

@poetiq_ai

4 months ago

Following up on our SOTA results on ARC-AGI, we’re excited to share new SOTA results on Humanity’s Last Exam (both with and without tools) and SimpleQA! On HLE, Poetiq’s meta-system created multiple new SOTA configurations, going all the way up to 55%.

poetiq_ai's tweet photo. Following up on our SOTA results on ARC-AGI, we’re excited to share new SOTA results on Humanity’s Last Exam (both with and without tools) and SimpleQA!

On HLE, Poetiq’s meta-system created multiple new SOTA configurations, going all the way up to 55%. https://t.co/PT1os7oZTi

12

180

32

56

65K

0

2

0

193

Who to follow

Tanmay Gupta

@tanmay2099

Senior Research Scientist @allen_ai (Ai2) | Building multimodal agents | MolmoWeb | CVPR’23 Best Paper | Prev: PhD @ UIUC & UG @ IIT Kanpur

Sean Welleck

@wellecks

Assistant Professor at CMU. Marathoner, @thesisreview.

Gintare Karolina Dziugaite

@gkdziugaite

Sr Research Scientist at Google DeepMind, Toronto. Member, Mila. Adjunct, McGill CS. PhD Machine Learning & MASt Applied Math (Cambridge), BSc Math (Warwick).

Saurabh Singh

@saurasingh

4 months ago

Proud to be part of the team!

Poetiq

@poetiq_ai

4 months ago

We’re thrilled to announce a new chapter for Poetiq: We have closed $45.8M in Seed funding. It’s a privilege to build alongside partners who understand the scale of our vision, including Surface, FYRFLY, @ycombinator, 468, Operator Collective, NeuronVC, and HICO.

7

152

14

64

146K

0

5

0

189

Saurabh Singh

@saurasingh

5 months ago

Very interesting take. Doesn’t account for regulatory moves though.

sysls

@systematicls

5 months ago

https://t.co/FQe5bCBqW1

1K

26K

5K

47K

16M

0

1

0

137

saurasingh retweeted

Massimo

@Rainmaker1973

6 months ago

Points of view

173

22K

4K

6K

897K

saurasingh retweeted

Steven Pinker

@sapinker

5 months ago

We are primates, with a third of our brain dedicated to vision. Graphics are the most effective way to communicate quantitative, spatial, and causal information, but they are as hard to master as clear and stylish prose. Here's an excellent guide to data visualization by Saloni Dattani @salonium https://t.co/tnXYMKKRYA

1

1K

258

1K

141K

Saurabh Singh

@saurasingh

5 months ago

Awesome to see such fast progress 🚀

Poetiq

@poetiq_ai

5 months ago

We finally had a moment to run our system with GPT-5.2 X-High on ARC-AGI-2! Using the same Poetiq harness as before, we saw results as high as 75% at under $8 / problem using GPT-5.2 X-High on the full PUBLIC-EVAL dataset. This beats the previous SOTA by ~15 percentage points.

poetiq_ai's tweet photo. We finally had a moment to run our system with GPT-5.2 X-High on ARC-AGI-2!

Using the same Poetiq harness as before, we saw results as high as 75% at under $8 / problem using GPT-5.2 X-High on the full PUBLIC-EVAL dataset. This beats the previous SOTA by ~15 percentage points. https://t.co/9XNdequRy5

123

2K

276

534

992K

0

8

0

482

Saurabh Singh

@saurasingh

6 months ago

🥇

Emirhan Erkan

@permaximum88

6 months ago

We have verified Poetiq's solution and it's indeed SotA on ARC-AGI-2! Huge congrats!

2

7

0

1

809

0

6

0

429

Saurabh Singh

@saurasingh

6 months ago

@tbenst Prompt optimization is a reasonable first step and cool! Our released model demonstrates much more than that -- test time reasoning loop. Learned test time reasoning is cooler 😉!

1

4

0

122

Saurabh Singh

@saurasingh

6 months ago

@MLStreetTalk Our results are now officially verified! 🥇 https://t.co/3yuQliGNHO

Poetiq

@poetiq_ai

6 months ago

Poetiq has officially shattered the ARC-AGI-2 SOTA 🚀 @arcprize has officially verified our results: - 54% Accuracy – first to break the 50% barrier! - $30.57 / problem – less than half the cost of the previous best! We are now #1 on the leaderboard for ARC-AGI-2!

poetiq_ai's tweet photo. Poetiq has officially shattered the ARC-AGI-2 SOTA 🚀

@arcprize has officially verified our results:
- 54% Accuracy – first to break the 50% barrier!
- $30.57 / problem – less than half the cost of the previous best!

We are now #1 on the leaderboard for ARC-AGI-2!

110

2K

257

618

472K

2

13

1

2K

Saurabh Singh

@saurasingh

6 months ago

@sachinpeak @arcprize Our results are now officially verified! Though, plot on the official website presents it in a confusing way. https://t.co/3yuQliGNHO

Poetiq

@poetiq_ai

6 months ago

Poetiq has officially shattered the ARC-AGI-2 SOTA 🚀 @arcprize has officially verified our results: - 54% Accuracy – first to break the 50% barrier! - $30.57 / problem – less than half the cost of the previous best! We are now #1 on the leaderboard for ARC-AGI-2!

110

2K

257

618

472K

0

5

0

177

Saurabh Singh

@saurasingh

6 months ago

@dileeplearning Now we are officially verified! 🥇 https://t.co/3yuQliGNHO

Poetiq

@poetiq_ai

6 months ago

Poetiq has officially shattered the ARC-AGI-2 SOTA 🚀 @arcprize has officially verified our results: - 54% Accuracy – first to break the 50% barrier! - $30.57 / problem – less than half the cost of the previous best! We are now #1 on the leaderboard for ARC-AGI-2!

110

2K

257

618

472K

0

2

0

109

Saurabh Singh

@saurasingh

6 months ago

@post555s @poetiq_ai @arcprize Fortunately no: https://t.co/x7ECrN3rmm

0

63

Saurabh Singh

@saurasingh

6 months ago

@_vincentpaul_ @poetiq_ai @arcprize @jm_alexia The results are now validated: https://t.co/x7ECrN3rmm

0

1

0

48

Saurabh Singh

@saurasingh

6 months ago

@apples_jimmy @poetiq_ai @arcprize @GregKamradt The results are now officially verified! Thanks for your interest. https://t.co/x7ECrN3rmm

0

2

0

62

Saurabh Singh

@saurasingh

6 months ago

@giffmana I think people really need to read -- The Structure of Scientific Revolutions by Thomas S. Kuhn. A "Hero Scientist", sole inventor etc., is mostly a myth and science advances by communication, collaboration and resulting improvements of ideas.

saurasingh's tweet photo. @giffmana I think people really need to read -- The Structure of Scientific Revolutions by Thomas S. Kuhn. A "Hero Scientist", sole inventor etc., is mostly a myth and science advances by communication, collaboration and resulting improvements of ideas. https://t.co/REEkVlaKiU

0

58

Saurabh Singh

@saurasingh

6 months ago

Read more on the #ARCPRIZE blogpost about our results: https://t.co/XnDYqGaTPZ

0

3

0

101

Saurabh Singh

@saurasingh

6 months ago

Our ground breaking results on ARCAGI have now been officially verified!🥇

Poetiq

@poetiq_ai

6 months ago

Poetiq has officially shattered the ARC-AGI-2 SOTA 🚀 @arcprize has officially verified our results: - 54% Accuracy – first to break the 50% barrier! - $30.57 / problem – less than half the cost of the previous best! We are now #1 on the leaderboard for ARC-AGI-2!

110

2K

257

618

472K

2

8

1

0

2K

Saurabh Singh

@saurasingh

6 months ago

Quite confusing wording on the official plot though. We are not just a provider. Good luck getting this level of performance without our system!

saurasingh's tweet photo. Quite confusing wording on the official plot though. We are not just a provider. Good luck getting this level of performance without our system! https://t.co/vqfSySEdq8

1

0

122

Saurabh Singh

@saurasingh

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users