Ron Arel @ronusedh - Twitter Profile

Pinned Tweet

8 months ago

Feel the rain on your skin No one else can feel it for you Only you can let it in No one else, no one else Can speak the words on your lips

3

5

0

1K

ronusedh retweeted

Intology

@intology

25 days ago

Can coding agents do research? We release NanoGPT-Bench, an internal eval we’ve used to test agents on an AI R&D problem with months of human progress Codex, Claude Code, Autoresearch recover only 9.3% of human progress, mostly tuning hyperparams & ignoring algorithmic research NanoGPT-Bench is built on the NanoGPT Speedrun, a popular LLM pretraining competition to minimize the training time of a GPT-2 style model. Existing human submissions constitute nearly 2 years of work. To control for dependencies and contamination in frontier models, we standardize evaluation to a 5-month window of world records. Evaluation is fully autonomous and end-to-end, with no human intervention or internet access. 🧵

intology's tweet photo. Can coding agents do research?

We release NanoGPT-Bench, an internal eval we’ve used to test agents on an AI R&D problem with months of human progress

Codex, Claude Code, Autoresearch recover only 9.3% of human progress, mostly tuning hyperparams & ignoring algorithmic research

NanoGPT-Bench is built on the NanoGPT Speedrun, a popular LLM pretraining competition to minimize the training time of a GPT-2 style model. Existing human submissions constitute nearly 2 years of work. To control for dependencies and contamination in frontier models, we standardize evaluation to a 5-month window of world records. Evaluation is fully autonomous and end-to-end, with no human intervention or internet access. 🧵

22

281

61

173

145K

Ron Arel

@ronusedh

25 days ago

"Human" Record btw

Intology

@intology

25 days ago

World record #31 (out of the 33 reference records) was achieved by Locus, our Artificial Scientist, on January 16th, 2026 Locus implemented a fused triton kernel for the softcapped multi-token prediction cross entropy step. Several future records by humans have built further on the kernel. GitHub: https://t.co/2dtR04TnTR

2

18

2

6

6K

0

4

0

122

Ron Arel

@ronusedh

25 days ago

@intology "Human" Record 😁

0

1

0

45

ronusedh retweeted

Sam Rodriques

@SGRodriques

25 days ago

We live in a golden age of biology. So why are people still dying from disease? Because discovery and development move slower than they should. Today, we’re partnering with Incyte to change that. Kosmos is now the first agent that can compress months of drug development into weeks, from the earliest stages of scientific discovery through to FDA approval. @Incyte will be the first company to deploy it across their pipeline. Work that used to take a team of scientists months now happens in weeks. Patients can't wait, and neither can we.

116

2K

491

1K

7M

Ron Arel

@ronusedh

25 days ago

We evaluated Codex, Claude Code, and @karpathy's Autoresearch on their ability to reproduce human progress on NanoGPT Speedrun! Shoutout @kellerjordan0 & @classiclarryd

Intology

@intology

25 days ago

Can coding agents do research? We release NanoGPT-Bench, an internal eval we’ve used to test agents on an AI R&D problem with months of human progress Codex, Claude Code, Autoresearch recover only 9.3% of human progress, mostly tuning hyperparams & ignoring algorithmic research NanoGPT-Bench is built on the NanoGPT Speedrun, a popular LLM pretraining competition to minimize the training time of a GPT-2 style model. Existing human submissions constitute nearly 2 years of work. To control for dependencies and contamination in frontier models, we standardize evaluation to a 5-month window of world records. Evaluation is fully autonomous and end-to-end, with no human intervention or internet access. 🧵

22

281

61

173

145K

0

7

0

385

ronusedh retweeted

Rebecca Wang

@rbccawang

about 2 months ago

agi-timeline gap relationship

9

366

20

23

28K

Ron Arel

@ronusedh

4 months ago

Genuinely horrific way to experience life and the people around you. People don’t try to make new friends because it’s “what you are supposed to do”. Life is given meaning by the wonderful people you meet along the way and choose to spend time with. You don’t have to be friends with everyone, but to shut off the possibility of new human connection due to a lack of calculated “ROI” is fundamentally misunderstanding of the human experience.

0

1

0

212

4 months ago

4 months ago

elon's mistake on xai optics was setting an engineering culture in a field bottlenecked by alchemy

2

75

0

5

12K

0

1

0

267

Ron Arel

@ronusedh

5 months ago

Goodfire casually doing the coolest things ever

Goodfire

@GoodfireAI

5 months ago

We've identified a novel class of biomarkers for Alzheimer's detection - using interpretability - with @PrimaMente. How we did it, and how interpretability can power scientific discovery in the age of digital biology: (1/6)

GoodfireAI's tweet photo. We've identified a novel class of biomarkers for Alzheimer's detection - using interpretability - with @PrimaMente.

How we did it, and how interpretability can power scientific discovery in the age of digital biology: (1/6) https://t.co/SHBawjo7qi

50

2K

221

925

397K

0

1

0

207

ronusedh retweeted

Justin Cho

@HJCH0

5 months ago

I've joined @intology! I'm excited to push the boundaries of AI-accelerated scientific discovery with an incredibly driven and talented team. Looking forward to dive deep into research on AI-driven automation and creativity!

2

11

4

1

2K

ronusedh retweeted

Rishub Tamirisa

@rishub_t

5 months ago

Locus achieves a new WR on the nanogpt speedrun by developing a fused kernel:

0

2

1

0

587

ronusedh retweeted

Larry Dial

@classiclarryd

5 months ago

New NanoGPT Speedrun WR at 105.9s (-1.0s) from @soren_dunn_ , with a triton kernel to fuse the logit softcap and multi-token prediction cross entropy calc. Interestingly, Soren mentioned that their autonomous system Locus at Intology discovered and implemented the improvement. https://t.co/eU5UZT3nYJ

0

139

12

53

20K

Ron Arel

@ronusedh

5 months ago

You'll still be standing next to me You could be my luck Even if we're six feet underground I know that we'll be safe and sound

0

67

ronusedh retweeted

no context memes

@nocontextmemes

5 months ago

38

18K

954

591

210K

Ron Arel

@ronusedh

5 months ago

Cause you're hot, then you're cold You're yes, then you're no You're in, then you're out You're up, then you're down

0

57

ronusedh retweeted

mr. joshua

@pants

5 months ago

87

42K

2K

1K

716K

ronusedh retweeted

Nick Dobos

@NickADobos

6 months ago

Programmer is out Shoggoth pilot is in

16

1K

63

104

64K

Ron Arel

@ronusedh

6 months ago

@akuhayum Grok propaganda

0

28

0

8K

Ron Arel

@ronusedh

6 months ago

0

2

0

80

Ron Arel

@ronusedh

6 months ago

Awesome results by the @poetiq_ai team!

Poetiq

@poetiq_ai

6 months ago

We finally had a moment to run our system with GPT-5.2 X-High on ARC-AGI-2! Using the same Poetiq harness as before, we saw results as high as 75% at under $8 / problem using GPT-5.2 X-High on the full PUBLIC-EVAL dataset. This beats the previous SOTA by ~15 percentage points.

poetiq_ai's tweet photo. We finally had a moment to run our system with GPT-5.2 X-High on ARC-AGI-2!

Using the same Poetiq harness as before, we saw results as high as 75% at under $8 / problem using GPT-5.2 X-High on the full PUBLIC-EVAL dataset. This beats the previous SOTA by ~15 percentage points. https://t.co/9XNdequRy5

123

2K

275

533

992K

0

1

0

157

Ron Arel

@ronusedh

Last Seen Users on Sotwe

Trends for you

Most Popular Users