Nick @nickwal - Twitter Profile

about 1 month ago

@RhysSullivan Depends on the model usually no but in our benchmarks there are models where it helps a ton but they usually have weaker reasoning capabilities

0

1

0

276

Nick

@nickwal

about 2 months ago

@benhylak oh yes!!!

0

2

0

25

Nick

@nickwal

about 2 months ago

@leedsharkey @GoodfireAI this is so cool! great work

0

1

0

411

Nick

@nickwal

2 months ago

Oh my god the whale is back

0

2

0

160

Who to follow

Kalyan Kumar Pichuka

@PichukaKumar

Senior Data Scientist. Passionate about solving complex problems. I am one who thinks Maths is fun

ayameRushia

@ayameRushia

https://t.co/8mIh2cGTXn

nickwal retweeted

3 months ago

Human-in-the-loop RL is necessarily done at group size 1; you cannot do a group of rollouts with only one human. i.e. there is no baseline for you to subtract for each input prompt. This is by far the most interesting and under-discussed part of this announcement. The same was true for their tab-completions model. From the wording in their posts, it sounds like they are using plain REINFORCE (no mention of value functions) with a large batch size + re-evaluating each checkpoint to guard against high variance. Cursor is implicitly revealing an important empirical result: with a large enough batch size, simple REINFORCE just works, no baseline needed. In other words, large scale continual learning is solved.

12

254

23

221

41K

Nick

@nickwal

3 months ago

@leonardtang_ sick

0

1

0

205

nickwal retweeted

Bartosz Naskręcki

@nasqret

4 months ago

It finally happened-my personal move 37 or more. I am deeply impressed. The solution is very nice, clean, and feels almost human. While testing new models in the last few weeks, I felt this coming, but it's an eerie feeling to see an algorithm solve a task one has curated for about 20 years. But at least I have gained a tool that understands my idea on par with the top experts in the field. And I am now working on a completely new level. My singularity has just happened… and there is life on the other side, off to infinity!

102

4K

447

1K

1M

Nick

@nickwal

4 months ago

@xeophon @PrimeIntellect congrats!

0

1

0

12

Nick

@nickwal

4 months ago

@si_pbc sick

0

65

Nick

@nickwal

4 months ago

@jxnlco @OpenAI @romainhuet @OpenAIDevs let’s go!!! congrats

0

1

0

34

Nick

@nickwal

4 months ago

@ProximalHQ Congrats!

0

57

nickwal retweeted

David

@DavidSHolz

4 months ago

5 million humanoid robots working 24/7 can build Manhattan in ~6 months. now just imagine what the world looks like when we have 10 billion of them by 2045. now imagine the year 2100.

3K

13K

1K

2K

25M

Nick

@nickwal

5 months ago

@GoodfireAI this is so cool

0

2

0

704

Nick

@nickwal

5 months ago

very inspiring vision for the future of research the hosted training has been incredible to iterate with

Prime Intellect @PrimeIntellect

5 months ago

Introducing Lab: A full-stack platform for training your own agentic models Build, evaluate and train on your own environments at scale without managing the underlying infrastructure. Giving everyone their own frontier AI lab.

139

3K

294

2K

3M

0

19

2

1

1K

nickwal retweeted

Prime Intellect @PrimeIntellect

5 months ago

Introducing Lab: A full-stack platform for training your own agentic models Build, evaluate and train on your own environments at scale without managing the underlying infrastructure. Giving everyone their own frontier AI lab.

139

3K

294

2K

3M

Nick

@nickwal

5 months ago

@thdxr providing*

0

49

Nick

@nickwal

5 months ago

@thdxr I believe this is regarding intra-turn prefill where you precondition the response by proving the first few tokens for that turn of the assistant response I believe this is unrelated to prior turns in the chat format

2

0

2K