Kevin Smith @_KevinSmith - Twitter Profile

Pinned Tweet

about 2 months ago

Two kinds of senior engineers right now: One is dabbling with AI and waiting to see how it all shakes out. The other has already pushed the agents to their breaking point, has hard-won opinions about where they fail, and is shipping in hours what used to take weeks. We’re hiring the second one. Fully remote. https://t.co/GGAKVHvXOk

1

2

0

270

_KevinSmith retweeted

Gergely Orosz

@GergelyOrosz

about 9 hours ago

What are "smart" model routers you know of? Services or vendors that take queries and route the most efficient model they deem, saving cost. I sense there is a massive demand for these, and will be even more...

32

123

5

53

20K

_KevinSmith retweeted

Vincenzo @HeHateV

1 day ago

The Internet is sleeping on the Japanese in Nashville but they’re having a time too

39

28K

2K

1K

801K

_KevinSmith retweeted

John Scott-Railton

@jsrailton

1 day ago

NEW: malware developers added nuclear & biological weapons text to to their spyware. Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner. Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky. When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit. We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted. In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation. H/T to colleagues that shared this with me https://t.co/f3Aj9TYxU4

jsrailton's tweet photo. NEW: malware developers added nuclear & biological weapons text to to their spyware.

Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner.

Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky.

When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit.

We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted.

In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation.

H/T to colleagues that shared this with me https://t.co/f3Aj9TYxU4

215

12K

2K

4K

1M

Who to follow

EE CONF

@EE_CONF

The ultimate networking and learning hub for all things ExpressionEngine → 100% community-organized → 100% volunteer-run → 100% awesome #eeconf #eecms

Bjørn Børresen

@bjornbjorn

Software developer. CMS connoisseur. #flutter + #dartlang. Also #laravel 🛠Started a company called @LastFriday_no which hired me (luckily).

Ben Croker

@ben_pylo

Founder of https://t.co/1Q0Q6BLsa5 and core maintainer of https://t.co/vfMTOsk3X6

_KevinSmith retweeted

Andrew Qu

@andrewqu

1 day ago

He sounds more like mark zuckerberg than mark zuckerberg sounds like mark zuckerberg

9

2K

42

193

201K

_KevinSmith retweeted

Yaser

@yaser_najjar_en

2 days ago

- Composer 2.5: for $1 it scored 65% - Fable: for $12 it scored 70% Why would I use it Fable for only 5% increase and paying 12x the price? Am I missing something? @jediahkatz

107

2K

38

234

328K

Kevin Smith

@_KevinSmith

2 days ago

👀

Tim Sneath @timsneath

2 days ago

One of my personal favorite features announced at WWDC will I suspect be a sleeper hit: container machines, allowing your Mac to run a lightweight, persistent Linux environment with your home directory and repos automatically mounted: https://t.co/dOBdfOOVxC

228

10K

814

6K

719K

0

51

Kevin Smith

@_KevinSmith

2 days ago

“On June 23, we’ll remove Fable 5 from those plans. Using it after that will require usage credits.” Ah, I see the era of subsidized plans is coming to an end.

Claude

@claudeai

2 days ago

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.

5K

103K

14K

22K

53M

0

63

_KevinSmith retweeted

Cursor @cursor_ai

2 days ago

Claude Fable 5 is now available in Cursor. It sets a new state of the art on CursorBench at 72.9%, 8 points above the previous best.

cursor_ai's tweet photo. Claude Fable 5 is now available in Cursor.

It sets a new state of the art on CursorBench at 72.9%, 8 points above the previous best. https://t.co/L3Wm8mSYq9

251

6K

449

675

1M

_KevinSmith retweeted

Matt Pocock

@mattpocockuk

2 days ago

Everyone's banging on about loops When they should be thinking about queues

185

1K

60

477

323K

_KevinSmith retweeted

Kush

@kushbhuwalka

3 days ago

I’m pretty AI pilled. This loop stuff is slop. I respect @steipete for his innovation - but openclaw is a bloated unstable pile of garbage because of stuff like this. I’m all for loops of crons and webhooks where an AI agent wakes up and performs some task like cleanup, or updates the docs or triages errors. I think these are great for standard well defined tasks with a fairly deterministic route (a.k.a workflows). I think what these guys are talking about now is jumping the gun. The models need to be guided, and you want to atleast skim their output so you don’t end up with slop. Humans are far better planners and architects than models. You absolutely shouldn’t delegate away prompting and reviews in my opinion. this encourages the creation of crappy buggy unsafe software that actually hurts adoption.

72

649

37

101

39K

_KevinSmith retweeted

David K 🎹

@DavidKPiano

4 days ago

Too many developers don't understand what "compounding slop" is. A loop that prompts agents is a great way to automate slop creation. Constrain the state-action space so the loop can't drift, then automate inside it. Human-in-the-loop = feature, not bottleneck.

40

413

33

57

29K

_KevinSmith retweeted

Ed Andersen

@edandersen

3 days ago

Reminder that OpenClaw abusing flat rate subscriptions by commandeering oauth tokens is why the industry has moved to token based billing and ruined things for everyone without huge deep unlimited pockets

71

2K

56

145

193K

Kevin Smith

@_KevinSmith

4 days ago

I hope they never take Opus 4.6 away. They nailed it with that one. Fantastic coding model with Claude Code.

0

48

_KevinSmith retweeted

Blake Burge

@blakeaburge

4 days ago

A rule that will improve your life: Never assume bad intent, but don’t ignore repeated patterns. Everyone gets grace. Nobody gets unlimited passes.

74

14K

3K

2K

206K

_KevinSmith retweeted

jason

@jxnlco

5 days ago

I need Google Docs but just for markdown files. Multiplayer comments. Syncing resolving comments. Suggestion mode Edit mode Edit history Maybe some sense of multi edits. Easy cli access.

287

2K

27

1K

492K

_KevinSmith retweeted

Steve Bauman

@heystevebauman

7 days ago

Y’all must be merging absolutely monstrous shit code if you’re running agents “24/7” The models are more powerful than ever before but I’m still directing and touching their work constantly. I don’t think I’ve ever actually “one-shot” anything of substance

11

37

3

0

4K

_KevinSmith retweeted

Joe Consorti

@JoeConsorti

6 days ago

Bitcoin isn't crashing below $60k because Saylor sold 32 BTC. It's crashing because $19 trillion of new AI market cap got created in 12 months... 13x the size of Bitcoin. The most liquid risk asset on earth is being drained to fund the biggest IPO cycle since 2000.

315

5K

434

1K

835K

_KevinSmith retweeted

htmx.org / CEO of Bad Flags (same thing)

@htmx_org

6 days ago

i know people have fed https://t.co/F8etT392fz into LLMs and said "talk and think like this" and as a result have cut token consumption and gotten push back on overly complicated features

9

46

1

2

4K

_KevinSmith retweeted

Peter H. Diamandis, MD

@PeterDiamandis

8 days ago

Optimism is not the belief that everything will be fine, but the belief that problems are SOLVABLE, combined with the willingness to actually go solve them. That thesis has built every good thing we have.

87

1K

252

199

50K

_KevinSmith retweeted

jon allie

@jonallie

7 days ago

@ShrekOverflow My experience is that these things happen because of a failure to _actually_ prioritize, or because of repeatedly prioritizing short term work. If you truly set a priority, and stick with it, it won't be a mystery to anyone.

0

5

1

0

212

Kevin Smith

@_KevinSmith

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users