Arun Krishnamurthy @arun279 - Twitter Profile

11 days ago

perhaps you should have devs use regular $100/$200 usage plans part of the time @bcherny @lydiahallie @trq212 having unlimited toks internally at all times disconnects devs from users Like Facebook in 2015 when Zuck made devs use 2g on “2g tuesdays”, which led to Lite products

0

1

0

34

Arun Krishnamurthy @arun279

4 months ago

@avilesrafa @mattpocockuk cool idea!

0

41

Arun Krishnamurthy @arun279

4 months ago

@mattpocockuk I’m currently working on a hook to grab used context & tokens to compaction into the context after every PostToolCall. I want to instruct it to write out notes for itself before compaction.

0

18

Arun Krishnamurthy @arun279

4 months ago

@HeyBilt I was told this but I never got any emails. This is hilariously bad service.

0

16

Who to follow

🚀Kristy Eisele✨💫

@razemfrazem

More fun, less shun!!! -EM ⚡️Space nerd 🚗 🚀❤️ Tesla long💪⚡️✨

This too shall pass. #MUFC is home.

Arun Krishnamurthy @arun279

5 months ago

Trying to get a human to respond from @HeyBilt is a great way to waste time. Someone has hard coded “59 minutes” to the support service and it has no relationship to reality.

arun279's tweet photo. Trying to get a human to respond from @HeyBilt is a great way to waste time. Someone has hard coded “59 minutes” to the support service and it has no relationship to reality. https://t.co/eQHHeLe7oA

1

2

0

111

Arun Krishnamurthy @arun279

5 months ago

only one team in the entire premier league right now has more than 2 wins in the last 5. how common is that?

0

24

Arun Krishnamurthy @arun279

12 months ago

.@stevenbjohnson please consider making a @NotebookLM API! there are a lot of interesting use cases that can be unlocked with this.

0

1

0

40

arun279 retweeted

Andrej Karpathy

@karpathy

over 1 year ago

New 3h31m video on YouTube: "Deep Dive into LLMs like ChatGPT" This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental models of how to think about their "psychology", and how to get the best use them in practical applications. We cover all the major stages: 1. pretraining: data, tokenization, Transformer neural network I/O and internals, inference, GPT-2 training example, Llama 3.1 base inference examples 2. supervised finetuning: conversations data, "LLM Psychology": hallucinations, tool use, knowledge/working memory, knowledge of self, models need tokens to think, spelling, jagged intelligence 3. reinforcement learning: practice makes perfect, DeepSeek-R1, AlphaGo, RLHF. I designed this video for the "general audience" track of my videos, which I believe are accessible to most people, even without technical background. It should give you an intuitive understanding of the full training pipeline of LLMs like ChatGPT, with many examples along the way, and maybe some ways of thinking around current capabilities, where we are, and what's coming. (Also, I have one "Intro to LLMs" video already from ~year ago, but that is just a re-recording of a random talk, so I wanted to loop around and do a lot more comprehensive version of this topic. They can still be combined, as the talk goes a lot deeper into other topics, e.g. LLM OS and LLM Security) Hope it's fun & useful! https://t.co/75mXcUBI8L

karpathy's tweet photo. New 3h31m video on YouTube:
"Deep Dive into LLMs like ChatGPT"

This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental models of how to think about their "psychology", and how to get the best use them in practical applications.

We cover all the major stages:
1. pretraining: data, tokenization, Transformer neural network I/O and internals, inference, GPT-2 training example, Llama 3.1 base inference examples
2. supervised finetuning: conversations data, "LLM Psychology": hallucinations, tool use, knowledge/working memory, knowledge of self, models need tokens to think, spelling, jagged intelligence
3. reinforcement learning: practice makes perfect, DeepSeek-R1, AlphaGo, RLHF.

I designed this video for the "general audience" track of my videos, which I believe are accessible to most people, even without technical background. It should give you an intuitive understanding of the full training pipeline of LLMs like ChatGPT, with many examples along the way, and maybe some ways of thinking around current capabilities, where we are, and what's coming.

(Also, I have one "Intro to LLMs" video already from ~year ago, but that is just a re-recording of a random talk, so I wanted to loop around and do a lot more comprehensive version of this topic. They can still be combined, as the talk goes a lot deeper into other topics, e.g. LLM OS and LLM Security)

Hope it's fun & useful!
https://t.co/75mXcUBI8L

767

20K

3K

15K

2M

Arun Krishnamurthy @arun279

over 1 year ago

@robot the order 22509929 says it’s delivered but it’s not, i tried everything and emailed your support chat but i got no response.

1

0

35

Arun Krishnamurthy @arun279

almost 2 years ago

@MKBHD how many charging cycles would this battery be rated for if it's charged 320W every time?

0

1

0

12

Arun Krishnamurthy @arun279

almost 2 years ago

my favorite @LinusTech product!

0

3

0

61

arun279 retweeted

internet hall of fame

@InternetH0F

about 2 years ago

287

214K

8K

11K

11M

Arun Krishnamurthy @arun279

about 2 years ago

camouflage @PlayStation

0

1

0

27

arun279 retweeted

Linus Tech Tips

@LinusTech

about 2 years ago

179

72K

5K

1K

2M

arun279 retweeted

Andrej Karpathy

@karpathy

about 2 years ago

Can I just say I loooove Suno. Some of my favorites: Dog dog dog dog dog dog dog dog woof woof https://t.co/3yWAqFGDe3 Chemical elements https://t.co/p7EEc4iYgd train_gpt2.c header (who did this lol) https://t.co/6gz25sxiKA Suno tutorial (in Suno!): https://t.co/vN5lPa55Tg Many others. So good. Anyone else favorites?

176

2K

195

1K

433K

arun279 retweeted

Yann LeCun

@ylecun

about 2 years ago

How to be as "smart" as Auto-Regressive LLMs: - memorize lots of problem statements together with recipes on how to solve them. - to solve a new problem, retrieve the recipe whose problem statement superficially matches the new problem. - apply the recipe blindly and declare victory. - do not use basic logic. - do not use common sense to check your solution. - do not use a mental model of the situation as a sanity check. - do not simulate the scenario in your mind using your world model. - when someone tells you your solution is wrong, reply "I'm sorry, you are right" and apply another irrelevant recipe. Knowledge accumulation is not a substitute for actual understanding.

200

3K

516

1K

800K

arun279 retweeted

Andreia Ribeiro @andreiacribeir

about 2 years ago

Java developers, where are you?

130

2K

204

51

79K

arun279 retweeted

MatLab crashes

@memecrashes

about 2 years ago

234

5K

425

599

457K

Arun Krishnamurthy @arun279

about 2 years ago

new @PatrickRothfuss fan

0

3

0

84

arun279 retweeted

Sully

@SullyOmarr

over 2 years ago

Gemini 1.5 pro is STILL under hyped I uploaded an entire codebase directly from github, AND all of the issues (@vercel ai sdk,) Not only was it able to understand the entire codebase, it identified the most urgent issue, and IMPLEMENTED a fix. This changes everything

SullyOmarr's tweet photo. Gemini 1.5 pro is STILL under hyped

I uploaded an entire codebase directly from github, AND all of the issues (@vercel ai sdk,)

Not only was it able to understand the entire codebase, it identified the most urgent issue, and IMPLEMENTED a fix.

This changes everything

115

2K

327

1K

701K

Arun Krishnamurthy

@arun279

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users