Simon Holm @simonholm - Twitter Profile

@thsottiaux Windows app still feels like it needs some love. The core functionality is there, but there are enough rough edges and UI quirks that it doesn't feel as polished as the web experience yet.

0

15

simonholm retweeted

Turing Post

@TheTuringPost

16 days ago

Why KV cache is one of the main reasons LLMs are fast? KV cache is what connects attention mechanism with generation stage of autoregressive models. These models generate text token by token, but each new token still attends to all previous ones. → To optimize decode phase, models store previously computed key and value vectors in a KV cache. → During generation, they only compute new Q/K/V states for the latest token and attend over cached past representations. Without KV cache, the model would recompute keys and values for the entire sequence at every step (like token 501 recomputes tokens 1–500), that's very slow. ▪️ But the tradeoff of KV cache is memory, because it grows with sequence length, batch size, layers, and attention heads. That’s why so much research today targets KV efficiency and memory optimization. For example: - Upgrading attention mechanism, since it influences how KV cache is formed. Use more advanced attention like CompactAttention, MHA, MLA, etc. based on your needs. - Improve memory management. System needs to identify what to store long-term or keep local, when to summarize, and when to trim. You can learn more about KV cache + attention here: https://t.co/YlRyxCM9Tj And how they fit into the full LLM inference pipeline here: https://t.co/tKjX8Wvdkp

TheTuringPost's tweet photo. Why KV cache is one of the main reasons LLMs are fast?

KV cache is what connects attention mechanism with generation stage of autoregressive models.
These models generate text token by token, but each new token still attends to all previous ones.

→ To optimize decode phase, models store previously computed key and value vectors in a KV cache.
→ During generation, they only compute new Q/K/V states for the latest token and attend over cached past representations.

Without KV cache, the model would recompute keys and values for the entire sequence at every step (like token 501 recomputes tokens 1–500), that's very slow.

▪️ But the tradeoff of KV cache is memory, because it grows with sequence length, batch size, layers, and attention heads.

That’s why so much research today targets KV efficiency and memory optimization. For example:

- Upgrading attention mechanism, since it influences how KV cache is formed. Use more advanced attention like CompactAttention, MHA, MLA, etc. based on your needs.
- Improve memory management. System needs to identify what to store long-term or keep local, when to summarize, and when to trim.

You can learn more about KV cache + attention here: https://t.co/YlRyxCM9Tj
And how they fit into the full LLM inference pipeline here: https://t.co/tKjX8Wvdkp

11

676

130

538

29K

Simon Holm @simonholm

19 days ago

@skdh lucky you then🍀

0

3

simonholm retweeted

Andrej Karpathy

@karpathy

21 days ago

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

8K

150K

11K

14K

27M

simonholm retweeted

Haider.

@haider1

22 days ago

Creator of C++, Bjarne Stroustrup: AI-generated code isn't ready — it generates more bugs, more bloat, more security holes, and is nearly impossible to validate "senior developers are already retiring rather than deal with it" The problem is that even a small prompt change can shift the entire codebase in unpredictable ways

621

11K

2K

4K

2M

Simon Holm @simonholm

23 days ago

@JenMsft Beautiful photos, congratulations 🙂

0

1

0

55

simonholm retweeted

Microsoft Research

@MSFTResearch

26 days ago

New releases from Microsoft Research, live in 1 hour. Join for ai that runs your repo + verification-first research + more. 👉 https://t.co/OVxLvELUxr ⏰ 9 AM PT/12 PM ET 💬 Join live + ask questions in chat

0

57

20

10

6K

simonholm retweeted

Ethan Siegel @StartsWithABang

26 days ago

What physics gets wrong about the idea of “fundamental” "Fundamental" in physics usually just means the smallest indivisible quanta plus the laws that govern them. But we wouldn't get far without something more: boundary/initial conditions. https://t.co/Bir0Z1UGuB

11

31

11

12

4K

simonholm retweeted

Digital EU 🇪🇺 @DigitalEU

about 1 month ago

🇪🇺🤝🇯🇵Accelerating regulatory, research & industry cooperation. The EU & Japan held their fourth meeting of the Digital Partnership Council & discussed: 🔹data governance 🔹digital identity 🔹AI 🔹quantum 🔹platforms & more. Learn more: https://t.co/QficZqPpsG

DigitalEU's tweet photo. 🇪🇺🤝🇯🇵Accelerating regulatory, research & industry cooperation.

The EU & Japan held their fourth meeting of the Digital Partnership Council & discussed:

🔹data governance
🔹digital identity
🔹AI
🔹quantum
🔹platforms
& more.

Learn more: https://t.co/QficZqPpsG https://t.co/PYAQJSh5oe

15

129

43

12

13K

simonholm retweeted

Burke Holland

@burkeholland

about 2 months ago

It would be hilarious if we go through this whole AI thing just to find out that it’s ultimately cheaper to pay humans to do it

518

20K

1K

551

2M

Simon Holm @simonholm

2 months ago

@JenMsft Congratulations and celebrations🪊🎶🎂🎉

0

1

0

27

simonholm retweeted

Boris Cherny

@bcherny

2 months ago

I wanted to share a bunch of my favorite hidden and under-utilized features in Claude Code. I'll focus on the ones I use the most. Here goes.

549

23K

3K

52K

4M

Simon Holm @simonholm

3 months ago

@JenMsft No dishwasher—just vibes, water, and silent judgmen

0

1

0

16

Simon Holm @simonholm

3 months ago

@OpenAI Every UX improvement is welcome, but progress still feels slow—still in an early-browser, mosaic phase.

0

11

simonholm retweeted

European Parliament @Europarl_EN

3 months ago

Stronger together in uncertain times 🇪🇺🇨🇦 Canada is a strategic partner to the EU and at a time of growing global instability, this bond matters more than ever. Parliament wants the EU to take its cooperation with Canada to the next level. Read more: https://t.co/IVsuu3Tr1o

Europarl_EN's tweet photo. Stronger together in uncertain times 🇪🇺🇨🇦

Canada is a strategic partner to the EU and at a time of growing global instability, this bond matters more than ever.

Parliament wants the EU to take its cooperation with Canada to the next level.

Read more: https://t.co/IVsuu3Tr1o https://t.co/ZET6aZLltb

437

2K

562

107

82K

simonholm retweeted

Kaja Kallas @kajakallas

3 months ago

The EU and Iceland are close friends. Our security is shared, and so are the challenges we face. Today’s signature of an EU-Iceland Security and Defence Partnership takes our relationship to the next level. This will deepen our cooperation in areas that matter for the safety of our citizens, from maritime security to the protection of critical infrastructure. This is a win for the EU and for Iceland.

kajakallas's tweet photo. The EU and Iceland are close friends.
Our security is shared, and so are the challenges we face.

Today’s signature of an EU-Iceland Security and Defence Partnership takes our relationship to the next level.

This will deepen our cooperation in areas that matter for the safety of our citizens, from maritime security to the protection of critical infrastructure.

This is a win for the EU and for Iceland.

253

3K

595

72

68K

Simon Holm

@simonholm

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users