Mithil Vakde @evilmathkid - Twitter Profile

Pinned Tweet

3 months ago

44% on ARC-AGI-1 in 67 cents! Trained from scratch in 2hrs on a 5090 Matches TRM, beats HRM and is way faster & cheaper No recursion, just a transformer Also, 7% on ARC-2 🧵

evilmathkid's tweet photo. 44% on ARC-AGI-1 in 67 cents!
Trained from scratch in 2hrs on a 5090

Matches TRM, beats HRM and is way faster & cheaper
No recursion, just a transformer

Also, 7% on ARC-2 🧵 https://t.co/FqJAtFHUPg

30

685

72

348

57K

Mithil Vakde

@evilmathkid

7 days ago

@ergodicthought @CSProfKGD @ylecun Actually lecun says exactly that in the slide before this I assume the subtext is "for human level ai", build these types of world models.

evilmathkid's tweet photo. @ergodicthought @CSProfKGD @ylecun Actually lecun says exactly that in the slide before this

I assume the subtext is "for human level ai", build these types of world models. https://t.co/dunKIvWBBM

0

2

0

43

Mithil Vakde

@evilmathkid

8 days ago

@ergodicthought @CSProfKGD @ylecun irl in a separate interaction I mean also found a video of this talk. He says "world models should be ..." https://t.co/UAd9uQJR0h

1

0

1

38

Mithil Vakde

@evilmathkid

8 days ago

@ergodicthought @CSProfKGD @ylecun I think he is just saying "build world models with these characteristics". Thats how it came across irl and in his paper Idk why Kosta called it "his definition". Im pretty sure Lecun knows they are all models of the world

1

0

27

Mithil Vakde

@evilmathkid

8 days ago

@ergodicthought @CSProfKGD @ylecun i think the prediction space in those 2 are designed and aim for full fidelity while lecun wants to predict representations that are learnt and intentionally lossy (ignore details irrelevant to task)

1

0

42

evilmathkid retweeted

Adithya Dsilva @AdithyaDsilva

20 days ago

Hiring IC+ level computer engineers with an interest in building large scale, dynamic scrapers that work autonomously. In person @ indiranagar, Bangalore. (IC - someone confident to the point of being the authoritative say in the team. Not bothered by age / exp)

16

107

15

42

34K

Mithil Vakde

@evilmathkid

about 1 month ago

@o_v_shake friend needed it back

1

0

138

Mithil Vakde

@evilmathkid

about 1 month ago

Anyone have a used mac i can buy? I dont have a laptop anymore Older ones like M1/M2 also work

3

6

0

2

1K

Mithil Vakde

@evilmathkid

about 1 month ago

@mike64_t @willdepue @francoisfleuret what's SPD btw? chatgpt is confused too

1

0

78

Mithil Vakde

@evilmathkid

about 1 month ago

the most entertaining outcome strikes again

0

1

0

368

Mithil Vakde

@evilmathkid

about 1 month ago

@gauravisnotme Nobody thinks we can catch up. Its worse - Those who understand this is existential have given up - Those who don't think AI is a gimmick Capital exists but not a single serious effort has been made so it has gone to grifters and the incompetent

0

4

0

284

Mithil Vakde

@evilmathkid

about 1 month ago

@plugyawn oh yeah but i think vie doesn't think big enough why only scaling or RL breakthroughs?

1

0

49

Mithil Vakde

@evilmathkid

about 1 month ago

@pranesh @nileshtrivedi @NandanNilekani good job with the sneaky ad hominem. it distracts from the fact that your frame and understanding of my pos are both wrong happy to continue the discussion if we stick to arguments

1

0

27

Mithil Vakde

@evilmathkid

about 1 month ago

> preventing modification is not the goal of alignment ??? whats the point of alignment if i can modify and de-align it > no amount of "alignment" can prevent modifications today, partially true. but you assume that future models will look like the LLMs of today. you are also making other assumptions that are unlikely like "open models are gonna be as capable frontier ones"

1

0

29

Mithil Vakde

@evilmathkid

about 1 month ago

@pranesh @nileshtrivedi @NandanNilekani You sure we'll have capabilities to modify powerful aligned models a few years in the future? One of the most important goals of alignment is to prevent modification

1

0

25

Mithil Vakde

@evilmathkid

about 1 month ago

This is extremely short sighted. No reason why this will continue to be true. Steelman: As alignment capabilities improve, frontier open models will be aligned to the home countries. In this future indias entire tech stack will be rendered useless, forcing us to depend on foreign tech for everything https://t.co/nQGoaIi6D9

1

0

29

Mithil Vakde

@evilmathkid

about 1 month ago

This argument and Sutton's Big world hypothesis is why I'm very optimistic about humanity's future Bring on the all-encompassing automators. We'll still find new things to do

Dwarkesh Patel

@dwarkesh_sp

about 1 month ago

.@michael_nielsen made an incredibly interesting argument about why the tech tree is actually way larger than we realize, and how our descendants will get to explore very little of it.

12

284

29

159

107K

0

8

0

3

729

Mithil Vakde

@evilmathkid

about 1 month ago

finally got some time to try openai's parameter golf. only 2 days left tho and the top scores are extremely optimised lets see if i can work some magic

Mithil Vakde

@evilmathkid

3 months ago

Reduce kolmogorov complexity in a ~turing machine defined by 8xH100s, PTX, a CPU and 10min while also optimising the hell out of the code execution Love it

2

81

1

55

12K

1

5

0

1

700

Mithil Vakde

@evilmathkid

about 1 month ago

Feeling FOMO over all the cool work on optimisers. How hard can the linal be? was my fav course in 1st year

0

7

0

1

754

Mithil Vakde

@evilmathkid

about 1 month ago

@pfau @leothecurious Then the award loses meaning

0

104

Mithil Vakde

@evilmathkid

Last Seen Users on Sotwe

Trends for you

Most Popular Users