Madhav Kumar @mkumar_ish - Twitter Profile

7 days ago

@econoadabsurdam @TomRed43 This doesn’t disprove OP’s post. I personally know people who have negatively geared so aggressively to tip their >$400k salaried job to being net negative. (I mean, you can find examples of articles on the same)

1

0

50

Madhav Kumar @mkumar_ish

8 days ago

@damian_b @bernhardsson if one is really maxxing out the usage of the 1m context window frontier models, and has like the fifteen claude code sessions running at once, sub-agents etc etc i feel like it's not that hard

0

1

0

53

Madhav Kumar @mkumar_ish

8 days ago

as if it's not difficult enough to read already

0

7

Madhav Kumar @mkumar_ish

8 days ago

will we lose the ability to read ulysses in a generation's time because of AI

1

0

8

Who to follow

Go Green! Go White! Go Pack go!

baysii

@konpeitoez

slick biscuits

Madhav Kumar @mkumar_ish

9 days ago

unironically cheap gg

Tero Kuittinen @teroterotero

10 days ago

When I tell an American woman this is a Finnish luxury kitchen they start convulsing physically. They can’t help it.

17

914

9

191

135K

0

14

mkumar_ish retweeted

Prompter

@PromptLLM

11 days ago

Insane idea from Opus 4.8

195

9K

526

4K

587K

mkumar_ish retweeted

Jonathon Belotti

@jonobelotti_IO

14 days ago

This is the opposite of what we've seen at https://t.co/jfUt9MiLQ7. Our most experience SWE is taking most advantage. It's also in disagreement with the Hotz post linked: "A trait you find in all high performing people is the ability to error correct, and they have mostly been good at seeing when slop is slop."

1

44

2

5

1K

mkumar_ish retweeted

Quantіan

@quantian1

14 days ago

(The data for this analysis was gathered by Claude. Normally I would disclaim this and suggest you be skeptical, but VC data is sufficiently hallucinated already that it really can't be much worse to have an LLM make it up instead of some guy at Pitchbook)

3

138

7

5

8K

mkumar_ish retweeted

oxcrow @oxcrowx

4 months ago

Anon, Do you know who you're replying to?

18

992

13

145

164K

mkumar_ish retweeted

Brendan Dolan-Gavitt

@moyix

18 days ago

All AI can do is plagiarize, here we see it regurgitating one of the proofs from The Book

5

561

27

48

69K

mkumar_ish retweeted

Bryan Johnson

@bryan_johnson

24 days ago

can you get a perfect score?

317

12K

221

1K

2M

Madhav Kumar @mkumar_ish

24 days ago

i love the idea of being a modern day anthropologist -- i feel like see so much good stuff online totally by chance (e.g. a chinese cheat cheat on the pubbattlegrounds subreddit)

0

5

mkumar_ish retweeted

staysaasy

@staysaasy

27 days ago

It’s 2018 and your coworker just sent you a 400 line pull request. You get a cup of coffee and sit down to review it. It’s beautiful. Elegant micro-refactors. Crispy method names. You catch a few things, but that’s ok. It’s part of the dance. They didn’t consider extensibility on part of their API. Here’s a comment buddy. They respond in an hour saying they think we should do one piece differently than your comment. Hey let’s jump into a room and figure it out. We can’t just agree to disagree, this code is too important. The PR merges and goes to prod. You feel a shared sense of ownership and accomplishment. That night you go to sleep and dream of that code. You can still see the shapes of it on the backs of your eyelids, your IDE syntax highlighting sparking neurons in your reptile brain. You go to work the next day ready to go. You understand the system. N is your foundation. Time to build n+1.

144

10K

432

1K

958K

Madhav Kumar @mkumar_ish

about 1 month ago

i am so uncreative

Riley Walz

@rtwlz

about 1 month ago

10% of AMC movie showings sell no tickets at all. I made a site that finds them. Go enjoy your private theater

319

40K

1K

17K

7M

0

17

mkumar_ish retweeted

Ricardo Olmedo @rdolmedo_

about 1 month ago

Surely a model pre-trained on the web would fare much better? Yes, and no. We also fine-tune their web-retrained model, and observe a modest +1% solve-rate on SWE-bench, achieving 5.7% pass@1 compared to 4.5% Surprisingly little seems to be lost by throwing away the internet.

rdolmedo_'s tweet photo. Surely a model pre-trained on the web would fare much better?

Yes, and no. We also fine-tune their web-retrained model, and observe a modest +1% solve-rate on SWE-bench, achieving 5.7% pass@1 compared to 4.5%

Surprisingly little seems to be lost by throwing away the internet. https://t.co/t1vY68OviF

3

263

19

51

79K

Madhav Kumar @mkumar_ish

about 1 month ago

i think there needs to be an opencode like thing that is totally irresponsible with tokens to make full use of these things

Jia-Bin Huang

@jbhuang0604

about 1 month ago

Keep getting rate-limited by Claude, so I tried out DeepSeek V4 for the first time. After 10M+ tokens, holy crap the cost is ... 🤯

245

6K

250

1K

746K

0

1

0

49

Madhav Kumar @mkumar_ish

about 1 month ago

Anthropic’s mech interp team vs OpenAI’s:

OpenAI

@OpenAI

about 1 month ago

We solved the goblin mystery—with the help of Codex. The culprit: Nerdy personality (RIP).

37

1K

37

84

159K

0

19

mkumar_ish retweeted

Nick

@nickcammarata

about 1 month ago

spent a few years reading through leonardo's notebooks and have often wondered what he'd be doing if he were around today whatever kat is doing is my best guess

24

8K

363

4K

419K

mkumar_ish retweeted

Zain Shah

@zan2434

about 2 months ago

Imagine every pixel on your screen, streamed live directly from a model. No HTML, no layout engine, no code. Just exactly what you want to see. @eddiejiao_obj, @drewocarr and I built a prototype to see how this could actually work, and set out to make it real. We're calling it Flipbook. (1/5)

1K

29K

4K

25K

6M

mkumar_ish retweeted

Bryan Cheong @bryancsk

about 2 months ago

It's kinda sad how DeepSeek saved everyone billions and billions of dollars by inventing GRPO and captured exactly 0 of that value. Maybe open source is unsustainable

47

2K

72

316

255K

Madhav Kumar

@mkumar_ish

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users