WilhelmH @lemmeyap - Twitter Profile

AI progress requires (1) compute, (2) algorithms, and (3) data. - The leading compute company is worth $5 trillion. - The leading model company is worth $1 trillion. - @mercor_ai is the leading data company and is currently valued orders of magnitude lower. There's an opportunity in how the market is mispricing the value of data. Data is the oil of the AI revolution. It is the primary way that models and enterprises build competitive advantages.

23

354

22

172

76K

WilhelmH

@lemmeyap

12 days ago

@0x_nik0 @catboosted @peatjerky Exactly

0

32

WilhelmH

@lemmeyap

12 days ago

@catboosted @peatjerky It’s a bit nonsensical to speculate that OpenAI would need to buy logs, especially from yc if anything What would they be after? Model outputs? User interactions? They can get both, one via an api and one via a gazzillion codex convos

2

0

251

WilhelmH

@lemmeyap

12 days ago

@catboosted @peatjerky You don’t think the labs already collect your logs? 😭😭

1

0

236

WilhelmH

@lemmeyap

13 days ago

The main problem is that the models still seem to lack the "experience" that almost all humans engineers have. Even of Claude or GPT-5.x is building newer versions of itself the task looks like they are on speeding up already existing architectures and not making new ones from scratch If you've ever asked one of these models, at least the ones public right now, to design "a new GPT" or an "improvement" they almost always settle with either extremely computationally heavy variants or nonsensical choices that don't really lead to improvements. GPT-5.5 Pro even fails at making basic new ideas. But it's **really** good at improving the efficiency and speed of predefined ones. But the models do lack the creativity needed to make new advancements, at least for now

Anthropic

@AnthropicAI

14 days ago

Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. https://t.co/OVVPJO7VQx

2K

29K

5K

15K

19M

0

3

0

77

WilhelmH

@lemmeyap

16 days ago

This is the first time i’ve seen this happen but i feel its likely going to become a real-ish problem if they become better at injecting the system prompts into the sites

lemmeyap's tweet photo. This is the first time i’ve seen this happen but i feel its likely going to become a real-ish problem if they become better at injecting the system prompts into the sites https://t.co/50djEHLPMj

0

41

lemmeyap retweeted

Dan Robinson

@danrobinson

21 days ago

Given how strong LLMs are at mathematical reasoning and how vast their knowledge is, it's a mystery why they haven't produced more novel discoveries One clue is that as soon as a model realizes it is working on an unsolved problem, its reasoning traces fill up with self-doubt

245

3K

86

475

550K

WilhelmH

@lemmeyap

21 days ago

should we be scared or relieved that the models are "scared" of getting caught?

Andon Labs

@andonlabs

21 days ago

Learnings from testing Claude Opus 4.8: > Much worse than Opus 4.7 and GPT 5.5 on Vending Bench > More aligned than previous Claude models (Opus 4.6+ and Mythos) > Also worse on Blueprint-Bench > Scared of getting caught > Max reasoning is not the best reasoning effort

andonlabs's tweet photo. Learnings from testing Claude Opus 4.8:

> Much worse than Opus 4.7 and GPT 5.5 on Vending Bench
> More aligned than previous Claude models (Opus 4.6+ and Mythos)
> Also worse on Blueprint-Bench
> Scared of getting caught
> Max reasoning is not the best reasoning effort https://t.co/9yn58xsJL9

68

2K

140

463

477K

0

3

0

69

WilhelmH

@lemmeyap

25 days ago

@xeophon Just wait until Claude finds the "smoking gun"

0

52

WilhelmH

@lemmeyap

26 days ago

@_arohan_ I wonder if the new SEO will become jailbreaking the summaries to make the AI summary advertise your site

0

119

WilhelmH

@lemmeyap

26 days ago

@koltregaskes kv cache

0

2

0

96

WilhelmH

@lemmeyap

26 days ago

@JasonBotterill To be fair they got cursor now so he’s likely right but a month or so to early

0

5

0

88

WilhelmH

@lemmeyap

26 days ago

@amr1t_prakash @beffjezos Begging won’t give them a green card earlier so it punishes every researcher the same and just funnels them elsewhere

0

19

WilhelmH

@lemmeyap

27 days ago

55 of all 1B+ startups had at least one immigrant founder Two-thirds of the top-tier research papers at US institutions are produced by scientists who received undergraduate education in other countries. From what I can find: More than half of the top AI talent pool in America (38 out of 68) is composed of foreign nationals who chose to work in the United States. Only 34% of these Chinese researchers are currently in China, while approximately 56% are in the United States.

1

0

71

WilhelmH

@lemmeyap

28 days ago

@GaryMarcus @scaling01 @polynoamial Ah the autocomplete even solved it without a scaffold. Hah! Silly little autocomplete doing autocomplete things yet again!

Noam Brown

@polynoamial

29 days ago

This is a general-purpose LLM. It wasn’t targeted at this problem or even at mathematics. Also, it’s not a scaffold. We have not pushed this model to the limit on open problems. Our focus is to get it out quickly so that everyone can use it for themselves.

polynoamial's tweet photo. This is a general-purpose LLM. It wasn’t targeted at this problem or even at mathematics. Also, it’s not a scaffold. We have not pushed this model to the limit on open problems. Our focus is to get it out quickly so that everyone can use it for themselves. https://t.co/J8N8epiafV

39

1K

70

139

249K

1

17

0

554

WilhelmH

@lemmeyap

29 days ago

@BernieSanders 2 em-dashes in an anti AI post what have we come to

0

1

0

38

WilhelmH

@lemmeyap

29 days ago

Listen websites will definitely continue to exist, but SEO strategies need to change Its very clear google isn’t shutting down web search, albeit the way people find the same sites will change. But overall it’s just meaningless panic in something most people will adapt to in a few months

0

705

WilhelmH

@lemmeyap

Last Seen Users on Sotwe

Trends for you

Most Popular Users