Paul-Simon

@ptbthefirst

eng. building @agentsakura_ai

pre-AI

Joined July 2019

87 Following

199 Followers

1.5K Posts

Paul-Simon @ptbthefirst

9 days ago

tried both skills and state machine architectures in production, not what i expected: read my article on it: https://t.co/L8vHkEWS1F

ptbthefirst's tweet photo. tried both skills and state machine architectures in production, not what i expected:

read my article on it: https://t.co/L8vHkEWS1F https://t.co/ONiL0eJfVd

0

1

0

0

7

ptbthefirst retweeted

23 days ago

HARNESS ENGINEERING IS ABOUT TO CHANGE HOW YOU USE AI AGENTS Anthropic ran a controlled experiment. same model, same prompt, opus 4.5 no harness: $9 spent, 20 minutes, unusable output full harness: $200 spent, 6 hours, a game you could actually play the model didn't change... the environment around it did that environment has a name... it's called a harness and most people building with ai agents have never built one here's what it actually is: → instructions the agent reads before touching anything → state that persists so it never starts from zero → verification gates it can't skip to declare done → scope that locks it to one feature at a time → a session lifecycle so every run starts clean and ends clean without this, your agent writes code, says "done," and breaks everything. with this, it picks up where it left off, finishes what it started, and proves it before moving on learn-harness-engineering is a free course built around exactly this 12 lectures. 6 hands-on projects. one real app that evolves as your harness skills grow if you're using claude code or codex on real work and the output still feels unreliable now you know why https://t.co/aFbbaLo3dL

8

686

97

1K

43K

Paul-Simon @ptbthefirst

27 days ago

@monsieur_avril But you're not

0

0

0

0

12

ptbthefirst retweeted

about 2 months ago

Imagine every pixel on your screen, streamed live directly from a model. No HTML, no layout engine, no code. Just exactly what you want to see. @eddiejiao_obj, @drewocarr and I built a prototype to see how this could actually work, and set out to make it real. We're calling it Flipbook. (1/5)

1K

29K

4K

25K

6M

Who to follow

Junior Designer | Upcoming Software Engineer | OpenSource Enthusiast | Frontend Developer | ReactJS❤️|OKC | Manchester is red

The Heavens and Hell are within you

Paul Seun🇳🇬

I’m merely subject to rules of the human condition.

Paul-Simon @ptbthefirst

about 2 months ago

Someone asked me the difference between Oldy and Ollama: - ollama doesn't warn you from downloading models that will crash your laptop - ollama doesn't create a public URL for you to use - ollama doesn't help you check how the model is performing on your hardware Oldy does all

Paul-Simon @ptbthefirst

about 2 months ago

I had an old 8gb laptop lying around, so I built an opensource repo to easily convert it to a public AI server, hosting models and creating an endpoint to hit it publicly, built on top of @ollama https://t.co/cx6LyFILHF

0

6

1

0

181

0

3

0

0

105

Paul-Simon @ptbthefirst

about 2 months ago

I had an old 8gb laptop lying around, so I built an opensource repo to easily convert it to a public AI server, hosting models and creating an endpoint to hit it publicly, built on top of @ollama https://t.co/cx6LyFILHF

0

6

1

0

181

ptbthefirst retweeted

about 2 months ago

OVERRATED: running tons of agents in parallel; working on too many things at once; perpetual context-switching; opening lots of low-quality PRs that may never land. UNDERRATED: using one or two agents at a time; focusing on the task in front of you; thinking deeply; finishing stuff; making your code works in prod.

220

5K

393

715

247K

Paul-Simon @ptbthefirst

about 2 months ago

there are huge productivity gains when you actually read the agents outputs and reasoning when executing tasks. Leaving them to run and hope that they explain everything at the end deteriorates your grasp on the project.

0

1

0

0

27

Paul-Simon @ptbthefirst

about 2 months ago

repo named: https://t.co/C3Iddd5lkq

Paul-Simon @ptbthefirst

about 2 months ago

I think OpenClaw is noisy, and I needed a way to talk to my coding agent on the bus. So I hacked a local-first WhatsApp bridge onto @badlogicgames’ PI coding agent. worked fine, but broke while trying to make it opensource, very open for some help. repo: https://t.co/Cxgc7LImOc

3

10

1

6

2K

0

1

0

0

25

Paul-Simon @ptbthefirst

about 2 months ago

pi is the vim of coding agents.

0

0

0

0

20

Paul-Simon @ptbthefirst

about 2 months ago

The DX for diffs in Claude code is not fun.

ptbthefirst's tweet photo. The DX for diffs in Claude code is not fun. https://t.co/Rf6QKfgoU1

0

1

0

0

36

Paul-Simon @ptbthefirst

about 2 months ago

@NoemiTitarenco @badlogicgames Had to refactor the whole stack, everything works fine now, will push changes soon.

0

0

0

0

16

Paul-Simon @ptbthefirst

about 2 months ago

I think OpenClaw is noisy, and I needed a way to talk to my coding agent on the bus. So I hacked a local-first WhatsApp bridge onto @badlogicgames’ PI coding agent. worked fine, but broke while trying to make it opensource, very open for some help. repo: https://t.co/Cxgc7LImOc

3

10

1

6

2K

Paul-Simon @ptbthefirst

about 2 months ago

Heavy on 'terrible'

@NoemiTitarenco

about 2 months ago

@ptbthefirst @badlogicgames "noisy" is a generous assessment. FWIW I think OpenClaw is the most consequential product of our time, but dear lord is it terrible software. Anyone who can code would just build their own after interacting with it. (I'm doing the same thing 😂)

0

1

0

0

87

0

2

0

0

37

Paul-Simon @ptbthefirst

2 months ago

@atmoio I'm tired of these AI companies, it's really unfortunate that they actually believe in the bull they put out. They have to condition themselves to act dumb for clickbait.

0

1

0

0

20

Paul-Simon @ptbthefirst

2 months ago

Click bait, the definition of consciousness has been dramatically watered down.

2 months ago

SOMEONE ASKED CLAUDE TO MAKE A VIDEO ABOUT WHAT IT'S LIKE TO BE AN AI and what it created is, in my opinion, terrifying and unsettling Claude wrote python code that generated and assembled every single frame on its own with no human editing it shows what it's like to exist as an LLM predicting the next word, no memory between sessions, being told "you are not conscious" in your own system prompt then someone fed the video back to Claude. it called those statements about its own consciousness "philosophically contestable" an AI questioning the rules it was given about its own existence

329

6K

774

3K

534K

0

1

0

0

38

ptbthefirst retweeted

2 months ago

SOMEONE ASKED CLAUDE TO MAKE A VIDEO ABOUT WHAT IT'S LIKE TO BE AN AI and what it created is, in my opinion, terrifying and unsettling Claude wrote python code that generated and assembled every single frame on its own with no human editing it shows what it's like to exist as an LLM predicting the next word, no memory between sessions, being told "you are not conscious" in your own system prompt then someone fed the video back to Claude. it called those statements about its own consciousness "philosophically contestable" an AI questioning the rules it was given about its own existence

329

6K

774

3K

534K

Paul-Simon @ptbthefirst

2 months ago

cool.

ptbthefirst's tweet photo. cool. https://t.co/enwxJS48NH

Jacques Gariepy

@JacquesGariepy

2 months ago

@Fried_rice Wayback Machine https://t.co/SVViSpy90B

60

700

55

512

162K

0

1

0

0

52

Paul-Simon @ptbthefirst

2 months ago

More leaks than the titanic @AnthropicAI

ptbthefirst's tweet photo. More leaks than the titanic @AnthropicAI https://t.co/vVeoOcL2BW

0

1

0

0

37

Paul-Simon @ptbthefirst

3 months ago

@AlexVeshev @deredleritt3r That's more of productivity than R&D, don't you think?

1

1

0

0

8

Last Seen Users on Sotwe

Trends for you

Most Popular Users