Dan Constantini

Verified account

@danoandco

Building (YC S25) Co-founded Chainlit

Joined June 2016

1.1K Following

385 Followers

356 Posts

Dan Constantini

about 8 hours ago

Claude Tag is great for delegation, but personal chat remains valuable for clarification Sometimes you are brainstorming and this requires fast back-and-forth, correction, and reframing So a useful pattern: 1. use https://t.co/HikqArMp0f to clarify the intent in a private chat 2. ask it to send the clarified task to the right Slack channel via @SlackHQ MCP, tagging @Claude private clarification first => shared execution second

danoandco's tweet photo. Claude Tag is great for delegation, but personal chat remains valuable for clarification

Sometimes you are brainstorming and this requires fast back-and-forth, correction, and reframing

So a useful pattern:
1. use https://t.co/HikqArMp0f to clarify the intent in a private chat
2. ask it to send the clarified task to the right Slack channel via @SlackHQ MCP, tagging @Claude

private clarification first => shared execution second

0

2

0

0

42

Dan Constantini

2 days ago

@levie so good!

0

0

0

0

139

Dan Constantini

24 days ago

@utpalnadiger @twill_ai thank you!

0

0

0

0

50

Dan Constantini

26 days ago

Introducing https://t.co/ymuN68eFXG Dev Station: every workspace now has a persistent, multi-repo dev station that tasks fork from. @twill_ai station is now the source of truth for the project: repos connected, dependencies installed, env vars configured, dev/test servers running, and project context warm. Agents and humans can both configure it, so the workspace keeps getting closer to the way your team actually builds software. Tasks are no longer independent cold starts. Each task forks from the station’s current working environment into its own isolated sandbox. Powered by @daytonaio. That means: • No cold-start, so tasks start/finish faster. • Multi-repo work across frontend, API, and shared libraries. • Less setup work for agents, which means fewer tokens spent re-setting the dev env. This is the model we think coding agents need: a persistent workspace that stays warm, then forks cleanly for each job. Try it at https://t.co/BjDTPcquKi

3

28

5

24

7K

Who to follow

Building @trysummon_com

🇺🇦 Dzmitry Bahdanau

Team member at @periodiclabs. Adjunct Prof @ McGill. Member of Mila, Quebec AI Institute. Stream of consciousness is my own.

Verified account

Open Source Model Lover @ NVIDIA AI Views my own.

Dan Constantini

26 days ago

@ivanburazin Thank you @ivanburazin! Wouldn't be possible without Daytona

0

0

0

0

75

Dan Constantini

26 days ago

@benswerd @twill_ai thx, will check out the docs!

0

0

0

0

80

Dan Constantini

about 2 months ago

@eLkay0027 we don't think code is a moat, so agentbox is worth sharing with the community. kind of like vercel AI SDK

0

0

0

0

32

Dan Constantini

about 2 months ago

https://t.co/ymuN68fdNe update: new runtime, new features! We rebuilt the runtime that drives every Twill task and open-sourced it as agentbox 👉 https://t.co/u3oTBHrr2W New: - Live preview pane: see your app run, open a terminal, watch entrypoint logs - Reasoning level control (per task) - Token-level streaming of agent responses - Message editing (rewind a run from any prior message) Faster: - Sandbox setup ~5x faster - Follow-up response start ~4x faster More info here: https://t.co/ZFdWS5kvfX

1

7

3

2

375

Dan Constantini

about 2 months ago

@pmarca “Don’t be a Yes Man”

0

0

0

0

696

Dan Constantini

2 months ago

@_felx is there something @twentycrm can't do

0

1

0

0

74

Dan Constantini

2 months ago

@_felx @thomasdesfrancs amazing work @thomasdesfrancs !!

1

0

0

0

250

Dan Constantini

2 months ago

@_felx @thomasdesfrancs Mythos yessss!!!

1

1

0

0

174

Dan Constantini

3 months ago

@gabepereyra cool! what was the build vs buy rationale?

0

3

0

0

660

Dan Constantini

3 months ago

AI research gets smarter, so progress gets faster. This becomes an environments + success criteria game around verifiable objectives. Exponential slope incoming.

3 months ago

Mythos speeds up AI research by up to 400 times A 300X speedup over the baseline requires 40 hours of work by a human expert It also clears the >8h threshold of human equivalent work time on ALL tasks!

scaling01's tweet photo. Mythos speeds up AI research by up to 400 times

A 300X speedup over the baseline requires 40 hours of work by a human expert

It also clears the >8h threshold of human equivalent work time on ALL tasks! https://t.co/vg0lHAvwAF

50

2K

153

412

86K

0

2

1

0

161

Dan Constantini

3 months ago

@alexalbert__ any clues on price or latency?

0

1

0

0

3K

Dan Constantini

3 months ago

@benhylak 1/ new paradigms when we get to 1k tks/s, then 10k tks/s 2/ objective-driven development, powered by long-running agents in a sandbox

0

0

0

0

832

Dan Constantini

3 months ago

New mode in Twill: Claude codes, Codex reviews. Until they converge. Ralph loop is a new opt-in mode for complex tasks where you want more rigor than a single agent pass. You select it and set a budget when you create the task. 𝗛𝗼𝘄 𝗶𝘁 𝘄𝗼𝗿𝗸𝘀 1. You set a budget and describe the task 2. A criteria agent explores your repo and proposes acceptance criteria 3. You review, refine, and approve 4. Claude implements against the criteria 5. Codex verifies the result against the criteria. Pass or fail. 6. On fail, the feedback goes back in. Claude continues to work. 7. Loop runs until criteria pass or the budget runs out. Two things make this different from a normal agent run. 𝗩𝗲𝗿𝗶𝗳𝗶𝗮𝗯𝗹𝗲 𝗰𝗿𝗶𝘁𝗲𝗿𝗶𝗮 𝗯𝗲𝗳𝗼𝗿𝗲 𝗰𝗼𝗱𝗲. Separating "what does done look like" from "write the code" forces the ambiguity to surface upfront, in a document you can read and edit, instead of mid-implementation when it's expensive. 𝗖𝗿𝗼𝘀𝘀-𝗺𝗼𝗱𝗲𝗹 𝘃𝗲𝗿𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻. The verifier gets the criteria and the repo state. It has no memory of the implementation decisions, no attachment to the approach. It reads the code cold, the same way a reviewer sees a PR for the first time. A model checking its own output tends to confirm what it intended, not what it produced. A different model doesn't have that bias. The name comes from @GeoffreyHuntley's Ralph loop pattern, a bash one-liner that runs a coding agent in a tight loop with full context resets. Same iterative philosophy. Different mechanism: structured criteria up front, cross-model verification at each pass.

danoandco's tweet photo. New mode in Twill: Claude codes, Codex reviews. Until they converge.

Ralph loop is a new opt-in mode for complex tasks where you want more rigor than a single agent pass. You select it and set a budget when you create the task.

𝗛𝗼𝘄 𝗶𝘁 𝘄𝗼𝗿𝗸𝘀

1. You set a budget and describe the task
2. A criteria agent explores your repo and proposes acceptance criteria
3. You review, refine, and approve
4. Claude implements against the criteria
5. Codex verifies the result against the criteria. Pass or fail.
6. On fail, the feedback goes back in. Claude continues to work.
7. Loop runs until criteria pass or the budget runs out.

Two things make this different from a normal agent run.

𝗩𝗲𝗿𝗶𝗳𝗶𝗮𝗯𝗹𝗲 𝗰𝗿𝗶𝘁𝗲𝗿𝗶𝗮 𝗯𝗲𝗳𝗼𝗿𝗲 𝗰𝗼𝗱𝗲. Separating "what does done look like" from "write the code" forces the ambiguity to surface upfront, in a document you can read and edit, instead of mid-implementation when it's expensive.

𝗖𝗿𝗼𝘀𝘀-𝗺𝗼𝗱𝗲𝗹 𝘃𝗲𝗿𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻. The verifier gets the criteria and the repo state. It has no memory of the implementation decisions, no attachment to the approach. It reads the code cold, the same way a reviewer sees a PR for the first time. A model checking its own output tends to confirm what it intended, not what it produced. A different model doesn't have that bias.

The name comes from @GeoffreyHuntley's Ralph loop pattern, a bash one-liner that runs a coding agent in a tight loop with full context resets. Same iterative philosophy. Different mechanism: structured criteria up front, cross-model verification at each pass.

0

6

4

0

360

Dan Constantini

3 months ago

my most common instruction right now is forcing the main agent to use subagents "use an async subagent expert in X to do Y" or "use a team of agents" for researching, planning and coding-reviewing

0

2

0

0

94

danoandco retweeted

Logan Kilpatrick

@OfficialLoganK

3 months ago

Introducing Gemma 4, our series of open weight (Apache 2.0 licensed) models, which are byte for byte the most capable open models in the world! Gemma 4 is build to run on your hardware: phones, laptops, and desktops. Frontier intelligence with a 26B MOE and a 31B Dense model!

OfficialLoganK's tweet photo. Introducing Gemma 4, our series of open weight (Apache 2.0 licensed) models, which are byte for byte the most capable open models in the world!

Gemma 4 is build to run on your hardware: phones, laptops, and desktops.

Frontier intelligence with a 26B MOE and a 31B Dense model! https://t.co/PVtYRnKQW0

286

6K

587

1K

526K

Dan Constantini

3 months ago

@tryhardfifi Mixture of experts!

0

0

0

0

17

Dan Constantini

3 months ago

Introducing Twill Clone 🤖 Clone any web app. Fully tested. Pixel-perfect output. Full source code. Point Twill at any public URL and watch our agent reverse-engineer user stories, design system and inner logic, and ship a perfect rebuild. Try it: https://t.co/uxVZQrF9iN

2

8

3

2

417

Last Seen Users on Sotwe

Trends for you

Most Popular Users