Harsh Verma

12 days ago

Human-in-the-loop undetectable browser-use!

0

3

2

0

268

kindaharsh retweeted

12 days ago

what if your AI agent could call a human as a tool? wired browser-handoff into a @browser_use agent doing a shopping checkout. when it hits the login wall or card form, it raises its hand. discord ping, I take over, agent resumes.

1

5

2

0

533

kindaharsh retweeted

Burhan

@BurhannKhatri

11 days ago

No barriers with the Agent handoff by @kindaharsh, amazing work

0

1

0

59

12 days ago

open source, drops into any browser-use agent as a custom action. https://t.co/rzSVxPFPYO full run (1 min): https://t.co/HhZ6t95dHO

0

24

12 days ago

what if your AI agent could call a human as a tool? wired browser-handoff into a @browser_use agent doing a shopping checkout. when it hits the login wall or card form, it raises its hand. discord ping, I take over, agent resumes.

1

5

2

0

533

12 days ago

the agent decides when to ask — not the library. it has a `request_human_help(reason, done_when)` tool. when it hits something it shouldn't do (credentials, card), it raises its hand and waits. I typed nothing outside the login + card form. agent did the rest.

1

0

24

kindaharsh retweeted

Neal Chopra

@nealchopra

24 days ago

A lot of people have been asking about our harness / approach - some thoughts: 1/ it’s fully open source on github! 2/ it is quite simple - and we think this is where harness engineering is heading. you no longer need elaborate scaffolding to force the model to reason in a prescribed way 3/ we initially included a verifier to check the executor’s work. it ended up being *more* accurate than the benchmark’s grader, but omitted it (you can't score above the ceiling set by the grader). we have a lot more to say on this. 4/ we were most excited by the performance uplift in sonnet (lighter model). it reflects a shift toward picking the model at the intelligence/cost pareto max for a task, not just the largest one. sonnet achieved near parity with opus in performance, while costing less than half.

10

196

16

379

43K

about 1 month ago

@kylejeong @jamesmurdza @daytonaio @browserbase instead of a manual hand-back button, it auto-detects completion via URL, element, content, or even LLM-based conditions - the agent resumes itself 👀

0

29

about 1 month ago

I made browser-handoff, a tool that lets humans temporarily take over AI browser agents when human input is needed. Built it while helping @jamesmurdza ship https://t.co/IFndgSwctr: sandboxed agents needed a way to log into Claude. Demo runs in @daytonaio 👇

2

6

1

4

3K

about 1 month ago

@kylejeong @jamesmurdza @daytonaio @browserbase tried it - too laggy to complete a login, inputs barely worked 😅. Director is a product built around browserbase’s stream, browser-handoff is a library you attach to any playwright session

1

0

28

about 1 month ago

@kylejeong @jamesmurdza @daytonaio @browserbase Cool! Though from what I can tell, Live View is the stream URL — browser-handoff handles the layer above: trigger detection, pause, notify, wait for completion, then resume. Couldn’t find that @browserbase does that natively — happy to be corrected if I’m missing something 👀

1

0

43

about 1 month ago

Repo: https://t.co/rzSVxPFPYO

0

60

kindaharsh retweeted

about 2 months ago

People keep telling me "Git was not designed for agents". Git is SO good for agents: Commit, rebase, merge, repeat.

0

6

1

3

453

about 2 months ago

@vague_anshul

0

1

0

29

about 2 months ago

@vague_anshul hiiiii

1

0

29

kindaharsh retweeted

Brooklyn @vague_anshul

about 2 months ago

X is cool. but it’s 100x better when your timeline is full with people who code and build things. if you’re into tech, AI, startups, product, design, development or programming. say hi 👋

4

29

3

16K

kindaharsh retweeted

about 2 months ago

Claude Code is free right now!

1

13

1

7

660

kindaharsh retweeted

about 2 months ago

Thanks to: @pluuto19 @kindaharsh @burhannkhatri for their great open source contributions!

1

9

4

2

583

2 months ago

0

17

kindaharsh retweeted