LSBusakwe @LSBusakwe - Twitter Profile

LSBusakwe retweeted

Codez

@0xCodez

1 day ago

https://t.co/eJB3TgyJwV

16

717

121

2K

143K

LSBusakwe retweeted

Boris Cherny

@bcherny

about 22 hours ago

We talk a lot about how important it is to set up self-verification loops. Especially in the age of powerful models that can run for long periods of time, self-verification is a key ingredient that enables the model to run for much longer, delivering a result that is closer to what you intended, so you can do more without having to constantly check in on Claude as it works. @delba_oliveira gives a great breakdown of what that looks like and why it matters

58

1K

109

2K

171K

LSBusakwe retweeted

Nainsi Dwivedi

@NainsiDwiv50980

about 24 hours ago

The biggest unlock in Opus 4.8 isn't the model. It's Dynamic Workflows inside Claude Code. Here's what that means in plain English: Old way → you give Claude one task, it does one task. New way → Claude plans the entire project, breaks it into subtasks, spawns hundreds of parallel subagents, runs them all simultaneously, then synthesizes everything back into one output. Real example of what this handles: → Migrate 100,000+ lines of code in a single session → Refactor an entire codebase while keeping tests green → Build, test, and document a full feature in one run This is not "AI autocomplete." This is an AI engineering team working for you.

NainsiDwiv50980's tweet photo. The biggest unlock in Opus 4.8 isn't the model.

It's Dynamic Workflows inside Claude Code.

Here's what that means in plain English:

Old way → you give Claude one task, it does one task.

New way → Claude plans the entire project,
breaks it into subtasks,
spawns hundreds of parallel subagents,
runs them all simultaneously,
then synthesizes everything back into one output.

Real example of what this handles:
→ Migrate 100,000+ lines of code in a single session
→ Refactor an entire codebase while keeping tests green
→ Build, test, and document a full feature in one run

This is not "AI autocomplete."
This is an AI engineering team working for you.

8

32

6

27

5K

LSBusakwe retweeted

Addy Osmani

@addyosmani

2 days ago

https://t.co/hIe0UX7z6T

236

5K

738

13K

956K

Who to follow

LSBusakwe retweeted

2 days ago

Prompt engineering has been replaced by loop engineering. What is it? (Explained in 60 seconds) For the past 2 years we have been prompting agents with individual tasks. That is starting to change. So far, if you wanted an agent to build a dashboard for a client, you would give it a task, review the output, improve the prompt, and repeat the process until the work was done. Looping changes that. Instead of giving an agent individual tasks, you give it a goal and let it work through a recursive loop until that goal is met. For example: → Research → Draft → Evaluate → Test → Improve → Repeat The agent keeps cycling through the loop until it reaches the standard you defined. Within loop engineering there are two main approaches: 1. Open Looping You give the agent a goal and allow it significant freedom in how it achieves it. This is powerful, but also expensive and harder to control. 2. Closed Looping The human defines the architecture, constraints and evaluation criteria. The agent is then responsible for executing, improving and iterating within those boundaries until the goal is reached. The next evolution is orchestrated looping. Instead of a single agent running a loop, one agent breaks the goal into smaller tasks and assigns them to specialist agents. Each specialist runs its own loop and reports back. In other words: You move from one agent improving itself to an entire team of agents iterating together until the goal is achieved.

Maxsteinbrenner's tweet photo. Prompt engineering has been replaced by loop engineering.
What is it? (Explained in 60 seconds)

For the past 2 years we have been prompting agents with individual tasks. That is starting to change.

So far, if you wanted an agent to build a dashboard for a client, you would give it a task, review the output, improve the prompt, and repeat the process until the work was done.

Looping changes that.

Instead of giving an agent individual tasks, you give it a goal and let it work through a recursive loop until that goal is met.

For example:

→ Research
→ Draft
→ Evaluate
→ Test
→ Improve
→ Repeat
The agent keeps cycling through the loop until it reaches the standard you defined.
Within loop engineering there are two main approaches:
1. Open Looping
You give the agent a goal and allow it significant freedom in how it achieves it.
This is powerful, but also expensive and harder to control.

2. Closed Looping

The human defines the architecture, constraints and evaluation criteria.

The agent is then responsible for executing, improving and iterating within those boundaries until the goal is reached.

The next evolution is orchestrated looping.

Instead of a single agent running a loop, one agent breaks the goal into smaller tasks and assigns them to specialist agents.

Each specialist runs its own loop and reports back.

In other words:

You move from one agent improving itself to an entire team of agents iterating together until the goal is achieved.

15

590

93

713

53K

LSBusakwe retweeted

Khairallah AL-Awady

@eng_khairallah1

8 days ago

Anthropic engineer: "You're not supposed to prompt Claude. You're supposed to build a system that prompts itself." this is one of the best workflows I've seen in a long time in this video she breaks down exactly how most people are using Claude: - the 14% you lose to CLAUDE.md before typing a word - the automation workflows most users don't know exist - the daily task pipelines that run without touching the keyboard - the daily workflows Anthropic's own engineers automated first if you've been using Claude for more than a month and never left the chat window, you've been using one agent when you could be running a team of them instead of another show tonight, watch this make sure to bookmark it before it gets lost in your feed the guide is in the article below

97

4K

541

12K

905K

LSBusakwe retweeted

Mr.Un1k0d3r @MrUn1k0d3r

7 days ago

I decided to publish my internal Azure Entra ID tool. There are a lot of these already available, but I've added some interesting features that have made a difference for me over the years. You can capture token through the browser using playwright https://t.co/xiZaz0PKsC #Azure

0

290

81

216

14K

LSBusakwe retweeted

Codez

@0xCodez

10 days ago

https://t.co/zCuvtRzera

10

246

35

545

124K

LSBusakwe retweeted

Johan Theo @tec_johan5

10 days ago

How To Never Hit Claude’s Limits Follow me (@tec_johan5 )

38

249

93

157

7K

LSBusakwe retweeted

AilaunchX

@Ai_Tech_tool

10 days ago

ANDREJ KARPATHY COULD HAVE CHARGED $2,000 FOR THIS COURSE. He put it on YouTube. The full training stack. Tokenization. Neural network internals. Hallucinations. Tool use. Reinforcement learning. RLHF. DeepSeek. AlphaGo. 3 hours of the most comprehensive LLM education that exists anywhere at any price. Not how to use the tools. How the entire system was built from the ground up and why it behaves the way it does. The engineers who understand this build things the ones who only use the tools cannot even conceive of. The gap between those two groups is not 3 hours. It is everything those 3 hours quietly unlock for the rest of your career.

8

141

31

153

19K

LSBusakwe retweeted

Asteri

@Asteri_eth

11 days ago

Karpathy found a way to reduce token consumption by 90% The problem is that the LLM re-reads the same files over and over again, loses context between documents, and provides less accurate answers as a result The solution is called Wiki Layer the LLM cleans, structures, and links all your data once, after which it never works with raw files again Three folders `raw/` for originals, `wiki/` for a clean knowledge base in Markdown, and files with rules for the agent Result up to 90% token savings on repeat queries, automatic links between documents, and a visual knowledge graph in Obsidian Everything stays on your local machine nothing goes to the cloud

154

4K

422

9K

1M

LSBusakwe retweeted

Sulekha Tripathi

@sulekhat95

10 days ago

Anthropic engineer: "You can build 5 assistants in one afternoon. Each one handles a task you've been doing manually every single day." In 45 minutes he builds 5 focused agents from scratch on camera. Most people are still doing code review, testing, and documentation by hand every single day

4

45

8

66

5K

LSBusakwe retweeted

Ishika Rawat

@Ishh_021

10 days ago

Claude FULL COURSE 1 HOUR (Build & Automate Anything)

20

693

161

674

41K

LSBusakwe retweeted

Codez

@0xCodez

10 days ago

Anthropic Managed Agents engineer revealed how to build a production-ready agent team in one session. 26-minutes. free. by Managed Agents team. here's what they cover: • 4 blocks: Agent, Environment, Session, Events • outcomes - Claude iterates a rubric until it passes • self-hosted sandboxes on Cloudflare, Modal, Vercel • live observation - every tool call, every subagent most people are still rebuilding agent infrastructure from scratch - while the people who get this ship real agents in one afternoon watch the full workshop, then read the article below

15

189

34

243

21K

LSBusakwe retweeted

Nicolas Krassas

@Dinosn

11 days ago

Security scanner for AI agent skills. Detect vulnerabilities, malicious patterns, and security risks. https://t.co/QTnbJID5UK

8

1K

224

1K

81K

LSBusakwe retweeted

Movez

@0xMovez

10 days ago

Anthropic team member just revealed the 3 layers that turn Claude into a self-running agent team. 36 minutes. free. by Claude Agents engineer. here's what he covers: • verification - Claude checks its own work • multi-сlaude - many agents in parallel background • loops - keyboard out of the hot • path routines - prompts that run themselves  most people babysit one agent at a time - while the people who get this delegate entire workflows to running loops Watch master class, then read the article below ↓

24

269

31

323

44K

LSBusakwe retweeted

Khairallah AL-Awady

@eng_khairallah1

10 days ago

Anthropic engineer: "You're not supposed to prompt Claude. You're supposed to build a system that prompts itself." this is one of the best workflows I've seen in a long time in this video she breaks down exactly how most people are using Claude: - the 14% you lose to CLAUDE.md before typing a word - the plugins that 95% of users have never installed - the workflows that run without you typing a single prompt - why typing one prompt and closing the tab is leaving 90% on the table if you've been using Claude for months and still start every session from scratch, you have at least 28 untouched features. probably 30 instead of another show tonight, watch this make sure to bookmark it before it gets lost in your feed full guide in the article below

64

3K

389

8K

550K

LSBusakwe retweeted

Codez

@0xCodez

11 days ago

Anthropic senior engineer just revealed how to use Claude Code at 110% in a 47-minute session. 47-minutes. free. by Anthropic team. worth more than any $500 vibe-coding course here's what she covers: • 3 things Claude needs: access, knowledge, tooling • git worktrees + one Claude each = a team you manage • /loop that babysits PRs and fixes CI overnight • agents that message each other while you sleep most people use Claude on one repo in one window - while the people who get this run teams of agents in parallel watch the full talk, then read the article below

26

438

75

719

70K

LSBusakwe retweeted

Sumanth

@Sumanth_077

11 days ago

Microsoft just turned SKILL .md into a trainable object! SkillOpt is a text-space optimizer for agent skills. Instead of hand-writing or one-shot generating your SKILL .md, SkillOpt treats the skill document as the trainable external state of a frozen agent and optimizes it through a feedback loop. The core idea: a separate optimizer model analyzes agent rollout trajectories, proposes bounded add/delete/replace edits to the skill document, and accepts only edits that strictly improve performance on a held-out validation split. Rejected edits go into a buffer as negative feedback for future iterations. The deep learning analogy is intentional. Rollout batch is your training data. Edit budget is your learning rate. Validation gate is your validation set. Rejected-edit buffer is your negative feedback signal. The optimizer runs offline. The deployed artifact is just a static SKILL .md file. Results on GPT-5.5 across 6 benchmarks: +23.5 points average over no-skill baseline in direct chat, +24.8 inside Codex, +19.1 inside Claude Code. SpreadsheetBench jumped from 41.8 to 80.7. OfficeQA from 33.1 to 72.1. Best or tied-best on 52 of 52 evaluated cells. What's striking: these gains come from just 1-4 accepted edits. The final skill stays compact at 300-2000 tokens. One accepted edit gave OfficeQA a +39 point gain. Optimized skills also transfer. A SpreadsheetBench skill trained in Codex transferred to Claude Code with a +59.7 point gain. Skills trained on GPT-5.4 improved every smaller GPT variant tested. Key capabilities: • Text-space skill optimization with no model weight updates • Bounded add/delete/replace edits with validation gating • Rejected-edit buffer as negative feedback • Epoch-wise slow/meta update for longer-horizon learning • Works across Claude Code, Codex, and direct chat harnesses • Optimized skills transfer across models, harnesses, and benchmarks 100% Open Source I've shared the link to the paper and repo in the comments!

Sumanth_077's tweet photo. Microsoft just turned SKILL .md into a trainable object!

SkillOpt is a text-space optimizer for agent skills. Instead of hand-writing or one-shot generating your SKILL .md, SkillOpt treats the skill document as the trainable external state of a frozen agent and optimizes it through a feedback loop.

The core idea: a separate optimizer model analyzes agent rollout trajectories, proposes bounded add/delete/replace edits to the skill document, and accepts only edits that strictly improve performance on a held-out validation split. Rejected edits go into a buffer as negative feedback for future iterations.

The deep learning analogy is intentional. Rollout batch is your training data. Edit budget is your learning rate. Validation gate is your validation set. Rejected-edit buffer is your negative feedback signal. The optimizer runs offline. The deployed artifact is just a static SKILL .md file.

Results on GPT-5.5 across 6 benchmarks: +23.5 points average over no-skill baseline in direct chat, +24.8 inside Codex, +19.1 inside Claude Code. SpreadsheetBench jumped from 41.8 to 80.7. OfficeQA from 33.1 to 72.1. Best or tied-best on 52 of 52 evaluated cells.

What's striking: these gains come from just 1-4 accepted edits. The final skill stays compact at 300-2000 tokens. One accepted edit gave OfficeQA a +39 point gain.

Optimized skills also transfer. A SpreadsheetBench skill trained in Codex transferred to Claude Code with a +59.7 point gain. Skills trained on GPT-5.4 improved every smaller GPT variant tested.

Key capabilities:

• Text-space skill optimization with no model weight updates
• Bounded add/delete/replace edits with validation gating
• Rejected-edit buffer as negative feedback
• Epoch-wise slow/meta update for longer-horizon learning
• Works across Claude Code, Codex, and direct chat harnesses
• Optimized skills transfer across models, harnesses, and benchmarks

100% Open Source

I've shared the link to the paper and repo in the comments!

10

148

40

181

18K

LSBusakwe retweeted

Arun Sekhar @rcarunmsft

12 days ago

Customers kept asking how to run Claude on Microsoft Foundry inside their own Azure tenant, with the IaC their team already uses. Packaged the answer into a starter kit. One azd up command and you are done. https://t.co/owepu6GCkA #Anthropic #Azure

rcarunmsft's tweet photo. Customers kept asking how to run Claude on Microsoft Foundry inside their own Azure tenant, with the IaC their team already uses.

Packaged the answer into a starter kit. One azd up command and you are done.

https://t.co/owepu6GCkA

#Anthropic #Azure https://t.co/dZacKNQnrD

3

93

12

107

6K

LSBusakwe

@LSBusakwe

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users