Vamsi Aribandi

@varibandi

training and shipping browser/computer use agents @yutori_ai

san francisco

Joined November 2019

284 Following

340 Followers

67 Posts

Pinned Tweet

Vamsi Aribandi @varibandi

7 months ago

co-driving RL for Navigator - our SOTA browser use agent - has been a professional highlight for me, have learned so much working on it with @CysuXiao. read the blog post (linked below) for more!

7 months ago

Introducing Yutori Navigator 31 years ago, the modern web era began with Netscape Navigator. Today, we’re introducing Yutori Navigator — a web agent that autonomously navigates websites on its own cloud browser to complete tasks for you. Navigator achieves pareto-domination over Gemini 2.5, Claude 4.5, and OpenAI Operator • 10%-20% accuracy gains across benchmarks • 2-3x faster • Uniformly preferred in head-to-head human-evals Simply put, the best web agent in the world

DhruvBatra_'s tweet photo. Introducing Yutori Navigator

31 years ago, the modern web era began with Netscape Navigator.

Today, we’re introducing Yutori Navigator — a web agent that autonomously navigates websites on its own cloud browser to complete tasks for you.

Navigator achieves pareto-domination over Gemini 2.5, Claude 4.5, and OpenAI Operator

• 10%-20% accuracy gains across benchmarks
• 2-3x faster
• Uniformly preferred in head-to-head human-evals

Simply put, the best web agent in the world

32

264

46

95

93K

1

17

4

1

3K

varibandi retweeted

28 days ago

We gave some of our partners early access to n1.5 — the most capable computer use model for the web. It is in production at FAANG scale as we speak, replacing a computer use model from a frontier lab. If your product can benefit from web automation — extracting structured data from dynamic webpages, filling forms, completing workflows on the web, testing vibe coded web apps — you should try out @yutori_ai's Navigator n1.5! Save your GPT / Claude / Gemini capacity for something else :)

4

19

6

3

4K

Vamsi Aribandi @varibandi

29 days ago

I genuinely think this should be the first model anyone tries for their browser agent

29 days ago

𝐈𝐧𝐭𝐫𝐨𝐝𝐮𝐜𝐢𝐧𝐠 𝐍𝐚𝐯𝐢𝐠𝐚𝐭𝐨𝐫 𝐧𝟏.𝟓 The most capable computer-use model for the web. Pareto-domination: accuracy, latency, cost • SoTA across all benchmarks • +5-10% over GPT 5.5, Opus 4.7, n1 • +25% over Gemini • 2x faster, significantly cheaper Expanded action space • UI actions (like n1) + JavaScript generation & execution

DhruvBatra_'s tweet photo. 𝐈𝐧𝐭𝐫𝐨𝐝𝐮𝐜𝐢𝐧𝐠 𝐍𝐚𝐯𝐢𝐠𝐚𝐭𝐨𝐫 𝐧𝟏.𝟓

The most capable computer-use model for the web.

Pareto-domination: accuracy, latency, cost

• SoTA across all benchmarks
• +5-10% over GPT 5.5, Opus 4.7, n1
• +25% over Gemini
• 2x faster, significantly cheaper

Expanded action space
• UI actions (like n1)
+ JavaScript generation & execution

15

192

23

96

151K

1

6

2

0

762

Vamsi Aribandi @varibandi

about 2 months ago

@Shahules786 @xdotli @adityak6798 @DavidAVenuto @GottliebEli @jackminong thanks for having me!

0

1

0

0

194

Who to follow

PhD student @berkeley_ai. AI persuasion, safety, sign language. Prev @carnegiemellon @polytechnique, intern @msftresearch @deepmind. 🇫🇷🇯🇵

Kartik Sreenivasan

Research scientist at MosaicML/Databricks. PhD from UW-Madison. Interested in LLMs, optimization, and the meaning of life.

@royschwartzNLP

Senior Lecturer at @CseHuji. #NLPROC

varibandi retweeted

2 months ago

I don’t like Pulse in ChatGPT Pro. Its inaccurate, serves a lot of older information and is mostly slop. Which is weird, given how good ChatGPT is w/ search. Scouts by Yutori (CUA startup) does those things better, imo.

9

68

6

9

8K

varibandi retweeted

3 months ago

Two updates from Yutori: 1. We benchmarked GPT 5.4 on browser-use tasks • Matches/slightly-outperforms Opus 4.6 (+0.3%) • Big jump over previous OpenAI CUAs 2. Latest version of n1 • Outperforms GPT 5.4 and Opus 4.6 (+3%) • 2.5x faster, 4-5x cheaper.

DhruvBatra_'s tweet photo. Two updates from Yutori:

1. We benchmarked GPT 5.4 on browser-use tasks
• Matches/slightly-outperforms Opus 4.6 (+0.3%)
• Big jump over previous OpenAI CUAs

2. Latest version of n1
• Outperforms GPT 5.4 and Opus 4.6 (+3%)
• 2.5x faster, 4-5x cheaper. https://t.co/HjIYUA5sPo

3

43

5

9

8K

Vamsi Aribandi @varibandi

6 months ago

@_xjdr great post! how much of this would apply to post training? I assume flops section still applies, but router balancing/stability may already be alleviated so not as many tricks would be needed? and the data section isn’t specific to MoEs?

0

0

0

0

74

Vamsi Aribandi @varibandi

6 months ago

@_arohan_ just confirming: this guy was at cal academy of sciences right?

1

0

0

0

76

Vamsi Aribandi @varibandi

7 months ago

https://t.co/ui0vWEti0G

0

3

0

0

162

Vamsi Aribandi @varibandi

7 months ago

co-driving RL for Navigator - our SOTA browser use agent - has been a professional highlight for me, have learned so much working on it with @CysuXiao. read the blog post (linked below) for more!

7 months ago

Introducing Yutori Navigator 31 years ago, the modern web era began with Netscape Navigator. Today, we’re introducing Yutori Navigator — a web agent that autonomously navigates websites on its own cloud browser to complete tasks for you. Navigator achieves pareto-domination over Gemini 2.5, Claude 4.5, and OpenAI Operator • 10%-20% accuracy gains across benchmarks • 2-3x faster • Uniformly preferred in head-to-head human-evals Simply put, the best web agent in the world

DhruvBatra_'s tweet photo. Introducing Yutori Navigator

31 years ago, the modern web era began with Netscape Navigator.

Today, we’re introducing Yutori Navigator — a web agent that autonomously navigates websites on its own cloud browser to complete tasks for you.

Navigator achieves pareto-domination over Gemini 2.5, Claude 4.5, and OpenAI Operator

• 10%-20% accuracy gains across benchmarks
• 2-3x faster
• Uniformly preferred in head-to-head human-evals

Simply put, the best web agent in the world

32

264

46

95

93K

1

17

4

1

3K

Vamsi Aribandi @varibandi

11 months ago

@alexgraveley congrats, looks cool!

0

1

0

0

129

Vamsi Aribandi @varibandi

11 months ago

@finbarrtimbers @natolambert another place this comes up is when it *always* uses .get on dicts instead of raw indexing. don’t think it learned that sometimes raising an error is good because it means something is wrong lol

1

10

0

1

439

Vamsi Aribandi @varibandi

11 months ago

if you want to figure this out for web agents, consider applying to @yutori_ai :)

11 months ago

there’s a palpable tension in the air as hundreds of AI researchers (including me!) quietly work nights and weekends trying to figure out the “right way” to scale RL math & code are not the universe we will not rest until post-training is as clean and elegant as pre-training

35

807

33

203

65K

0

9

3

3

4K

varibandi retweeted

12 months ago

We're excited to launch Scouts — always-on AI agents that monitor the web for anything you care about.

138

3K

167

3K

529K

Vamsi Aribandi @varibandi

about 1 year ago

@ParssaKyanzadeh looks really cool, can’t wait to try it!

0

1

0

0

68

Vamsi Aribandi @varibandi

about 1 year ago

@YiTayML agreed! (except blame being on you :)) we were definitely too grounded in the pre train-finetune and transfer learning paradigm, which was apparent when flan (and instructgpt) came out. though personally it was a great learning experience for me so early in my career :)

1

5

1

1

2K

Vamsi Aribandi @varibandi

about 1 year ago

@jxmnop bishop’s deep learning is recent and is very well written, imo should be the default textbook https://t.co/nl7cbXBCpg

0

2

0

4

291

Vamsi Aribandi @varibandi

over 1 year ago

me in 2018 after training a cnn to classify dogs and cats with 60% accuracy

over 1 year ago

honestly ai is so easy and neural networks are so simple. this was always going to happen to the first intelligent species to come to our planet. we’re about to learn something important about how universes tend to go I think, because I don’t believe we’re in a niche one

41

1K

46

371

119K

1

3

0

1

1K

Vamsi Aribandi @varibandi

almost 2 years ago

@teortaxesTex I hate it too but most of it (besides fraud) is harmless cringe. imo the grifter attitude comes from intense career ambition to make money (good), and shady shortcut-taking mentality being rewarded. Ive seen both in India, which people should remember is still a very poor country

1

1

0

0

834

Vamsi Aribandi @varibandi

almost 2 years ago

varibandi's tweet photo. https://t.co/BQkFLwGJym

almost 2 years ago

What a crazy week for AI: - OpenAI launches SearchGPT - Meta releases Llama 3.1 - Mistral AI releases Mistral Large 2 - DeepMind AI gets silver medal at Int. Math Olympiad - Elon announces push to Grok 2&3 Competition is intensifying. The months ahead will be super exciting 🤯

444

8K

945

813

542K

0

1

0

0

703

Vamsi Aribandi @varibandi

almost 2 years ago

@junichikawaAI ご利用いただきありがとうございます。楽しんでいただけると幸いです !

1

2

0

0

74

Last Seen Users on Sotwe

Trends for you

Most Popular Users