Kyle Wong @ewveggies - Twitter Profile

Pinned Tweet

6 months ago

Excited to share what our lab has been baking: Amazon Nova Act! Trained with large scale RL on diverse web gyms, Nova Act achieves SOTA on multiple public web agent benchmarks. Check it out!🚀 https://t.co/Ut8ruzDclF

0

12

1

3

11K

Kyle Wong

@ewveggies

about 8 hours ago

@_Suresh2 Their collapse doesn’t seem reward model driven. Fig 15 shows instability on aime and lcb, and the report mentions for stem/code they only do RLVR

0

49

Kyle Wong

@ewveggies

1 day ago

Beautiful tech report, perhaps the best western model report I’ve ever read. Lots of great insights: no synthetic data in midtrain, teacher models are RLed directly on top of midtrain, and adaptive clip higher. But still seems like they didn’t fully nail true on-policy as they admit their RL stage is unstable, leading to a hacky self-distillation stage (imo)

elie

@eliebakouch

1 day ago

WOW microsoft new "MAI Thinking 1" model comes with a 109 page tech report that looks REALLY detailed, this is amazing

23

926

116

651

179K

4

111

8

50

12K

Kyle Wong

@ewveggies

about 8 hours ago

@TechnologyPat ironically msft dropped 😭

0

46

Kyle Wong

@ewveggies

about 19 hours ago

Idk if Claudes/GPTs use synthetic data since they don't share anything. But if you want to train the BEST model, and you are confident that your model has the 'best pre-training' (most world knowledge) and 'best mid-training' (best domain specific capabilities and behaviors), then it doesn't make sense to distill off-policy data from other models, since: 1. Those models have less world and domain knowledge 2. Lots of SFT on off-policy synthetic data pulls your model into a more narrow distribution

1

2

1

0

243

Kyle Wong

@ewveggies

about 20 hours ago

@truthixifi taking notes indeed

1

0

236

Kyle Wong

@ewveggies

2 days ago

Internally we have a model scoring 4 points on ARC-AGI-3 But we won’t release it out of respect for Chet Holmgren’s legendary game 7 performance

ARC Prize

@arcprize

2 days ago

Anthropic Opus 4.8 is new SOTA on ARC-AGI-3 Score: 1.5%, ~$10K ARC-AGI-3 analysis notes: * Opus 4.8 read the environment an abstraction *above* Opus 4.7, as objects & systems, not pictures * Opus 4.8 succeeded on early levels, but still committed to a wrong sub-goal

arcprize's tweet photo. Anthropic Opus 4.8 is new SOTA on ARC-AGI-3

Score: 1.5%, ~$10K

ARC-AGI-3 analysis notes:
* Opus 4.8 read the environment an abstraction *above* Opus 4.7, as objects & systems, not pictures
* Opus 4.8 succeeded on early levels, but still committed to a wrong sub-goal https://t.co/PkQQ1u8NaX

53

1K

116

166

124K

0

3

0

618

Kyle Wong

@ewveggies

3 days ago

Gotta love SF poker: Some absolute degen pre-flop 5 bet jams with 34 offsuit, gets called by pocket queens and pocket aces. Flop comes 2 5 6, flopped the absolute nuts and won $500 pot. Sickest hand I’ve ever seen.

ewveggies's tweet photo. Gotta love SF poker:
Some absolute degen pre-flop 5 bet jams with 34 offsuit, gets called by pocket queens and pocket aces.

Flop comes 2 5 6, flopped the absolute nuts and won $500 pot. Sickest hand I’ve ever seen. https://t.co/77qBESfkXR

Corgi @UseCorgi

3 days ago

So much fun hosting a poker night with our friends @SignalFire, packed with founders and operators from the ecosystem. Reminder that we have a space in the heart of SF for community events like this. The whole point is creating room for people to meet and for serendipity to do its thing.

UseCorgi's tweet photo. So much fun hosting a poker night with our friends @SignalFire, packed with founders and operators from the ecosystem.

Reminder that we have a space in the heart of SF for community events like this. The whole point is creating room for people to meet and for serendipity to do its thing.

11

50

2

13

92K

0

4

0

744

Kyle Wong

@ewveggies

4 days ago

@sanjayramesh64 Yooo howd you find my twitter

0

44

Kyle Wong

@ewveggies

4 days ago

To my grand total of 3 followers who are gonna see this, the 1000 Leetcode streak has been achieved

2

14

0

2

470

Kyle Wong

@ewveggies

4 days ago

@lukas_hellesch @UCSB @McDonalds @BurgerKing dropout mentality

0

1

0

168

Kyle Wong

@ewveggies

5 days ago

hey everyone! i’m kyle - new grad 2025 @UCSB - no prev internships - no prev research - no employment ever - bottom 5th percentile in math/coding contests - 2 stars on github class projects - turned down competitve offers @McDonalds and @BurgerKing to hustle on my own - got kicked out of parents basement yesterday; disowned - staying in sf for a few days! looking to raise for my neolab hmu if interested!

ewveggies's tweet photo. hey everyone! i’m kyle

- new grad 2025 @UCSB
- no prev internships
- no prev research
- no employment ever
- bottom 5th percentile in math/coding contests
- 2 stars on github class projects
- turned down competitve offers @McDonalds and @BurgerKing to hustle on my own
- got kicked out of parents basement yesterday; disowned
- staying in sf for a few days! looking to raise for my neolab

hmu if interested!

Samuel Zhang

@samuelxzhang

6 days ago

hey everyone! i'm samuel - 2nd year cs @uwaterloo - prev eng @memories_ai, ai research @uwaterloo - 99th percentile in multiple national math/coding contests - prev national level fencer; 275lbs max bench - 1.2k+ stars on github projects - turned down swe offers @Gemini @openart_ai and @ yc startups this summer to build smth of my own - got flown out for yc s26 interview yesterday; rejected - staying in sf for a few more days; looking to raise from other investors hmu if interested!