prompt_hater @sillychap101 - Twitter Profile

about 23 hours ago

it's not done if it's not implemented it's not done if the implementation is ugly it's not done if it's not documented it's not done if users can't discover it it's not done if you can't market it

116

2K

212

643

70K

Sillychap101 retweeted

Dwarkesh Patel

@dwarkesh_sp

1 day ago

Re the Fable ML sandbagging, the model's AI research capabilities were probably at least partly trained on Anthropic employees diffing atop proprietary algos and infra. So the IP leak is somewhat like a researcher who knows Anthropic's stack getting poached to another lab. Anthropic's recent "When AI builds itself" post talks about a next-step eval. Where they snapshot a research session at the moment a human researcher made a suboptimal next-step choice, show a model only the transcript up to that point and ask what it would do next, then have a hindsight-equipped LLM judge decide whether the model's suggestion or the human's actual choice was better. This eval seems like a very good RL target for AI R&D - one among many that could be used to have AIs emulate Anthropic researchers and their research products. I'm just speculating. But if this was a motivation, then Anthropic should have figured out a better way to protect IP than sandbagging without telling the user they're sandbagging, which is very hostile and untrustworthy behavior.

dwarkesh_sp's tweet photo. Re the Fable ML sandbagging, the model's AI research capabilities were probably at least partly trained on Anthropic employees diffing atop proprietary algos and infra.

So the IP leak is somewhat like a researcher who knows Anthropic's stack getting poached to another lab.

Anthropic's recent "When AI builds itself" post talks about a next-step eval. Where they snapshot a research session at the moment a human researcher made a suboptimal next-step choice, show a model only the transcript up to that point and ask what it would do next, then have a hindsight-equipped LLM judge decide whether the model's suggestion or the human's actual choice was better.

This eval seems like a very good RL target for AI R&D - one among many that could be used to have AIs emulate Anthropic researchers and their research products.

I'm just speculating. But if this was a motivation, then Anthropic should have figured out a better way to protect IP than sandbagging without telling the user they're sandbagging, which is very hostile and untrustworthy behavior.

46

1K

52

376

103K

Sillychap101 retweeted

Lucas Beyer (bl16)

@giffmana

3 days ago

Actually it's fine guys! I figured out a way, see below. Claude Fable 5 is a great model afterall, and I also finally appreciate the difference between CLAUDE.md and AGENTS.md. It's all good.

giffmana's tweet photo. Actually it's fine guys! I figured out a way, see below.

Claude Fable 5 is a great model afterall, and I also finally appreciate the difference between CLAUDE.md and AGENTS.md.

It's all good. https://t.co/uAtpO4ep8B

42

2K

49

431

175K

Sillychap101 retweeted

matt

@MattVMacfarlane

3 days ago

Was using Fable 5 to write my world model training code. Anthropic flagged it as frontier AI research. The steering vector kicked in and it started implementing JEPA 🤨

61

3K

98

356

235K

Who to follow

Jack of all trades • Building @sicmedialabs | Ikaro | Plum

بارشنیکف

@hsn_nzhad

ما از اوناش نیستیم اینجا چیزی بنویسم.

Sillychap101 retweeted

Matt Pocock

@mattpocockuk

3 days ago

Everyone's banging on about loops When they should be thinking about queues

185

1K

60

476

325K

Sillychap101 retweeted

AVB

@neural_avb

3 days ago

My RLM library sitting at 400 stars!

3

103

7

74

10K

Sillychap101 retweeted

Siddharth

@Pseudo_Sid26

3 days ago

College students can spend 20 lakhs on Btech but cant spend even 2000 rupees per month on claude code or codex to work on projects. Let your parents know about the AI HYPE and fcking tell them its part of the curriculum and fcking get it and learn to use it, you would not want to get left behind. The industry wants people who can ship fast, not who can write good code in 2 months.

26

198

14

41

16K

Sillychap101 retweeted

Noam Brown

@polynoamial

3 days ago

https://t.co/oWqzT12RtZ

75

3K

393

3K

929K

Sillychap101 retweeted

Peter Steinberger 🦞

@steipete

5 days ago

@matijaoe my slop is better than your slop.

93

2K

77

57

103K

Sillychap101 retweeted

Patrick Jiang

@patpcj

6 days ago

Introducing Harness-1, a 20B search agent trained with a state-externalizing harness. > frontier-level long-horizon search, rivaling Opus-4.6 and outperforming GPT-5.4 > Context-1-level cost and latency > externalizes candidates, evidence, verification, and search history > open-source

89

3K

272

4K

263K

Sillychap101 retweeted

Ben Lang

@benln

5 days ago

Pulled the fastest-growing startups based on X follower growth over the past 90 days:

44

880

74

882

100K

Sillychap101 retweeted

Rahul

@sairahul1

5 days ago

This is the best site on the internet to learn harness engineering. Free. Completely. Most AI engineers have never heard the term. https://t.co/bwDbTTYsjM Bookmark this site. Then read this setup ↓

sairahul1's tweet photo. This is the best site on the internet to learn harness engineering.

Free. Completely.

Most AI engineers have never heard the term.

https://t.co/bwDbTTYsjM

Bookmark this site.

Then read this setup ↓ https://t.co/ddEP0XowXM

53

3K

438

6K

439K

Sillychap101 retweeted

Auriel

@aurielws

6 days ago

Modern RL hiring increasingly expects full-stack understanding. If you are an algorithm researcher, people will still ask infrastructure questions. The reverse is also true. — Yep and for practical reasons as many times the harness is why things won’t converge ~~~

1

195

13

264

34K

Sillychap101 retweeted

rody

@0x_rody

6 days ago

Anthropic engineer James Brady: "Every agent in production lies. We measured it. The good ones lie less, the great ones catch the lie before the user does." In 29 minutes, he walks through the verification stack he built and the patterns the Claude Code team adopted to keep agents honest at scale. Watch the full talk, then save the config below👇

43

2K

159

5K

339K

Sillychap101 retweeted

GREG ISENBERG

@gregisenberg

10 days ago

I was once pitching in a board room at a top 3 VC firm for a $15M Series A. 12 people in the meeting. One of the GPs fully fell asleep. Out cold for 30+ minutes. Nobody acknowledged it. Everyone just kept going. I kept presenting my Series A slides to an unconscious man in a Herman Miller chair and somehow that was considered normal. That's venture capital. You might fly across the country to perform for people who may or may not be conscious. It's a dance. And sometimes you lead and sometimes you follow and sometimes your partner is unconscious. If you're raising right now, just know: every founder has a story like this. The process is weird. The power dynamic is weird. You're not crazy for thinking it's weird. No one talks about it because they want to continue raising. But I'm happy to stick my neck out there. It is weird.

405

7K

233

2K

10M

Sillychap101 retweeted

Karri Saarinen

@karrisaarinen

7 days ago

Common @linear workflow we have internally: from @SlackHQ message to merged code in minutes. User asked about MCP team docs support, Linear agent checked the code to verify if it's true then started coding session to add it. Code was then reviewed, improve and merged through Diffs.

20

223

9

269

31K

Sillychap101 retweeted

dax

@thdxr

7 days ago

my worst VC story: [unnamed] partner stopped me mid pitch. this was pre-covid so these were all in person he walked up to me and whispered in my ear "damn ur a hot piece of ass" he smacked my butt and said he wanted my whole seed round i was offended and left his bedroom immediately

98

2K

29

196

333K

Sillychap101 retweeted

Ben Lang

@benln

7 days ago

Skip LinkedIn. Resources to find breakout startups hiring before everyone else: • Ramp’s monthly vendor reports • Harmonic’s quarterly Hot 25 • a16z Build newsletter • Founders You Should Know • Next Play newsletter • YC startup directory • Early Days Substack

58

2K

93

3K

121K

Sillychap101 retweeted

AyalKarmi

@CoolestAK87

16 days ago

Agent-ready ≠ scrapable. Agent-ready = your site exposes typed tools (search, checkout, inventory) Install the extension, pick one, and watch an agent actually use it: https://t.co/qrK6oT78BL

CoolestAK87's tweet photo. Agent-ready ≠ scrapable.

Agent-ready = your site exposes typed tools (search, checkout, inventory)

Install the extension, pick one, and watch an agent actually use it: https://t.co/qrK6oT78BL

191

5K

402

4K

22M

Sillychap101 retweeted

David Ondrej

@DavidOndrej1

8 days ago

"Coding agents are eating software" @skirano (ex-Anthropic) Pietro Schirano reveals the Codex setup that 10x'd his speed, text commands, multi-agent spawning, agent-first design He even built a game for a Flipper Zero with one prompt Here is the full episode:

9

345

21

548

18K

prompt_hater

@Sillychap101

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users