Mann Patel @punsbymann - Twitter Profile

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://t.co/AFJZ5kH7Ku

464

16K

2K

12K

8M

punsbymann retweeted

Will Held @WilliamBarrHeld

about 1 month ago

To train better open models, we need predictable scaling. Delphi is Marin’s first step: we pretrained many small models with one recipe, then extrapolated 300× to predict a 25B-param / 600B-token run with just 0.2% error. Getting there took some work 🧵

14

459

78

329

138K

Mann Patel @punsbymann

about 1 month ago

everyone benchmarks pass@k. everyone deploys pass^k.

0

35

Mann Patel @punsbymann

about 2 months ago

@finbarrtimbers same

0

88

punsbymann retweeted

finbarr

@finbarrtimbers

about 2 months ago

The best part of my job is I get to play with GPUs all day and someone else pays for it

7

218

6

13

11K

punsbymann retweeted

Jamie Simon @learning_mech

about 2 months ago

1/ Deep learning is going to have a scientific theory. We can see the pieces starting to come together, and it's looking a lot like physics! We're releasing a paper pulling together these emerging threads and giving them a name: learning mechanics. 🔨 https://t.co/92nSIHameW 🔧

learning_mech's tweet photo. 1/ Deep learning is going to have a scientific theory. We can see the pieces starting to come together, and it's looking a lot like physics!

We're releasing a paper pulling together these emerging threads and giving them a name: learning mechanics.

🔨 https://t.co/92nSIHameW 🔧 https://t.co/3cshMD33bl

54

2K

292

2K

305K

punsbymann retweeted

Thoughtful @thoughtfullab

about 2 months ago

Model shaping is still a craft of a few. That's what AI agents are for: learning it and doing it for everyone else. As a part of FrontierSWE benchmark we built a 20-hour post-training task on @tinkerapi and found the real bottleneck is research intuition.

11

515

53

552

214K

Mann Patel @punsbymann

about 2 months ago

@littewhite16806 most interesting part of this little project was writing a selenium verifier to 16personalities site and letting llm answer 5 scale questions haha

0

1

0

21

Mann Patel @punsbymann

about 2 months ago

@littewhite16806 your work is totally unique and kind of opposite to mine! while you prune and prove personas already exist in your model (which sounds about right), i tried to train Lora’s that show composition of persona😄

0

1

0

17

punsbymann retweeted

Psyho

@FakePsyho

about 2 months ago

the benchmark game has entered its IPO era

30

9K

488

528

358K

punsbymann retweeted

Michael Lee

@ChiahsuanL

2 months ago

🧵 Decomposing the Delta: What Do Models Actually Learn from Preference Pairs? 1/n 💡 Why do methods like DPO and KTO actually improve reasoning? In standard alignment, we use preference pairs, but we don't fully understand what properties of the data drive downstream gains. We investigate two distinct notions of quality: Generator-level Delta - the capability gap between the models producing the chosen vs. rejected traces Sample-level Delta - The fine-grained quality difference within a single pair (Factuality, Coherence, Precision) 👇

2

7

3

2

350

Mann Patel @punsbymann

2 months ago

@f14bertolotti @ChiahsuanL @MingyangKevinZh 🤩

0

2

0

94

punsbymann retweeted

Francesco Bertolotti @f14bertolotti

2 months ago

This paper digs into what actually makes delta learning work. The authors show that it’s not just about the chosen trajectory being correct relative to the rejected one. What drives the gains is the coherence of reasoning across the trajectory. 🔗https://t.co/TSEllrOdxN

f14bertolotti's tweet photo. This paper digs into what actually makes delta learning work. The authors show that it’s not just about the chosen trajectory being correct relative to the rejected one. What drives the gains is the coherence of reasoning across the trajectory.

🔗https://t.co/TSEllrOdxN https://t.co/5hpyCrVs1c

1

43

12

33

4K

Mann Patel @punsbymann

2 months ago

@ChangJonathanC i might have good results on this one :)

0

1

0

56

Mann Patel

@punsbymann

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users