Ishit Vachhrajani

@IshitV

CTO | now @awscloud | transformation, leadership, change, tech, cricket, food, travel, media, not in that order (cricket is #1) | my own opinions

United States

Joined June 2009

213 Following

294 Followers

182 Posts

IshitV retweeted

Brian Armstrong

@brian_armstrong

5 days ago

How to keep AI spend flat while token usage grows exponentially: Not with friction and spend alerts. With better defaults, routing, and caching. Better Defaults (not Usage Caps) – Engineers can choose any model they want, but defaults matter. We’re experimenting with defaulting to open weight models like GLM 5.2 and Kimi 2.7 through our LLM gateway, while still encouraging engineers to choose the right model for the task. 91% of our employees were never hitting their usage caps, so instead of lowering caps and driving up alerts, we're moving to cheaper defaults. Note that code reviews use a diversity of models, so they can check each other's work. Better Routing – In our custom harnesses, we preprocess prompts and route to the best model for the job, considering cache hits and model pricing. For instance, you may want a frontier model for planning, but not for execution where they can be overkill. Ultimately, humans shouldn't be choosing models - AI can automate this task. Better Caching – Cache misses are the easiest way to drive your cost up. All of our requests are cache aware, so we’re reusing a warm cache wherever possible. For example, our cache hit rate went from 5% → 60% in LibreChat once properly implemented. Keep Context Lean – Start fresh sessions when switching tasks. Scope file context narrowly. Disconnect unused tools. Don't just compact. The goal isn't fewer tokens used, it's fewer tokens wasted. Better Visibility – Our engineers can use as many tokens as they want, from whatever model they want, but we’ve made usage visible – and the more you spend on AI, the more impact we expect. The goal isn't to suppress usage. It's to build the infrastructure that makes exponential growth sustainable. Putting this into practice has cut our AI spend nearly in half, while our token usage continues to grow.

brian_armstrong's tweet photo. How to keep AI spend flat while token usage grows exponentially: Not with friction and spend alerts. With better defaults, routing, and caching.

Better Defaults (not Usage Caps) – Engineers can choose any model they want, but defaults matter. We’re experimenting with defaulting to open weight models like GLM 5.2 and Kimi 2.7 through our LLM gateway, while still encouraging engineers to choose the right model for the task. 91% of our employees were never hitting their usage caps, so instead of lowering caps and driving up alerts, we're moving to cheaper defaults. Note that code reviews use a diversity of models, so they can check each other's work.

Better Routing – In our custom harnesses, we preprocess prompts and route to the best model for the job, considering cache hits and model pricing. For instance, you may want a frontier model for planning, but not for execution where they can be overkill. Ultimately, humans shouldn't be choosing models - AI can automate this task.

Better Caching – Cache misses are the easiest way to drive your cost up. All of our requests are cache aware, so we’re reusing a warm cache wherever possible. For example, our cache hit rate went from 5% → 60% in LibreChat once properly implemented.

Keep Context Lean – Start fresh sessions when switching tasks. Scope file context narrowly. Disconnect unused tools. Don't just compact. The goal isn't fewer tokens used, it's fewer tokens wasted.

Better Visibility – Our engineers can use as many tokens as they want, from whatever model they want, but we’ve made usage visible – and the more you spend on AI, the more impact we expect.

The goal isn't to suppress usage. It's to build the infrastructure that makes exponential growth sustainable.

Putting this into practice has cut our AI spend nearly in half, while our token usage continues to grow.

466

6K

726

6K

4M

IshitV retweeted

12 days ago

What we call talent is often just the combination of: A deep need to win and high agency The ability to learn fast from mistakes A beginner’s mind that never disappears The common thread: an unusually high rate of learning.

76

3K

267

859

144K

Ishit Vachhrajani @IshitV

12 days ago

🍎🏀

12 days ago

Good lord this edit. Insane.

331

80K

11K

22K

4M

0

0

0

0

80

Ishit Vachhrajani @IshitV

17 days ago

😂

Daniel Vassallo

17 days ago

A good general rule is when your instinct says something is dumb, and that something was planned for years by people with way more information than you, the dumb one might be you.

7

53

3

1

16K

0

0

0

0

27

Who to follow

Mark L Donnelly

@BizIntelCypher

My thoughts are not your thoughts, nor are they reflective of any organization I work for or with. Follow me at your own risk.

Ishit Vachhrajani @IshitV

23 days ago

❤️ 🍎

Mayor Zohran Kwame Mamdani

23 days ago

Welcome to the greatest city on Earth. Here are some tips and tricks for how to get around and make the most of your time here. Learn more at https://t.co/mki8gOyIED.

700

49K

4K

5K

5M

0

0

0

0

27

Ishit Vachhrajani @IshitV

about 1 month ago

The companies letting their teams burn tokens and experiment right now aren’t being reckless. They’re building instincts their competitors won’t be able to buy later. May your tokens be with you!

0

1

0

0

17

Ishit Vachhrajani @IshitV

about 1 month ago

Most CFOs want to minimize AI token spend. That’s the wrong instinct. Tokens aren’t a cost. They’re how your team learns to think differently.

1

0

0

1

36

Ishit Vachhrajani @IshitV

about 1 month ago

There’s a chasm between people who’ve seen AI demos and people who’ve felt what’s possible firsthand. It’s the difference between reading about the internet in 1995 and getting your first dial-up connection. You can’t unfeel that moment.

1

0

0

0

18

Ishit Vachhrajani @IshitV

3 months ago

Let’s go! 🌕🇺🇸

3 months ago

Liftoff. The Artemis II mission launched from @NASAKennedy at 6:35pm ET (2235 UTC), propelling four astronauts on a journey around the Moon. Artemis II will pave the way for future Moon landings, as well as the next giant leap — astronauts on Mars.

4K

177K

55K

11K

14M

0

0

0

0

27

Ishit Vachhrajani @IshitV

4 months ago

Remains one of the greatest heists in Test cricket history.

4 months ago

25 years ago at Eden Gardens Rahul and I shared a partnership that will forever remain special. In a moment when the game looked beyond us we chose belief, patience and resilience. That stand was not just about runs but was about trust, teamwork and fighting for every session. Grateful to have shared that journey with Rahul and to be part of a Test that reminded us all that in cricket comebacks are always possible👍 @BCCI #PowerOfPartnership #Believe #Resilience

VVSLaxman281's tweet photo. 25 years ago at Eden Gardens Rahul and I shared a partnership that will forever remain special. In a moment when the game looked beyond us we chose belief, patience and resilience. That stand was not just about runs but was about trust, teamwork and fighting for every session. Grateful to have shared that journey with Rahul and to be part of a Test that reminded us all that in cricket comebacks are always possible👍 @BCCI #PowerOfPartnership #Believe #Resilience

642

26K

4K

311

643K

0

1

0

0

29

Ishit Vachhrajani @IshitV

4 months ago

🏏🏆

4 months ago

𝐂.𝐇.𝐀.𝐌.𝐏.𝐈.𝐎.𝐍.𝐒 🇮🇳 #TeamIndia clinch a record 3️⃣rd ICC Men's #T20WorldCup title 🏆 Take. A. Bow 🫡 #MenInBlue | #Final | #INDvNZ

BCCI's tweet photo. 𝐂.𝐇.𝐀.𝐌.𝐏.𝐈.𝐎.𝐍.𝐒 🇮🇳

#TeamIndia clinch a record 3️⃣rd ICC Men's #T20WorldCup title 🏆

Take. A. Bow 🫡

#MenInBlue | #Final | #INDvNZ https://t.co/nml1AZY5tK

2K

69K

17K

672

1M

0

0

0

0

50

Ishit Vachhrajani @IshitV

5 months ago

🫳🎤

0

0

0

0

99

IshitV retweeted

6 months ago

🗞️ @sdxcentral shares AI inferencing trends in 2026 featuring @IshitV and the AWS vision to build Amazon Bedrock as the world's biggest inference engine. Read: https://t.co/8U4VY2xhv4

AWSNewsroom's tweet photo. 🗞️ @sdxcentral shares AI inferencing trends in 2026 featuring @IshitV and the AWS vision to build Amazon Bedrock as the world's biggest inference engine. Read: https://t.co/8U4VY2xhv4 https://t.co/Ohtqdo51VE

2

3

3

0

178

Ishit Vachhrajani @IshitV

7 months ago

Its too cold to walk to train from the parking lot 😜

7 months ago

NJTRANSIT's tweet photo. https://t.co/t6eo26KoHv

174

10K

544

491

2M

0

0

0

0

75

Ishit Vachhrajani @IshitV

7 months ago

#52 👑

7 months ago

A leap of joy ❤️💯 A thoroughly entertaining innings from Virat Kohli 🍿 Updates ▶️ https://t.co/MdXtGgRkPo #TeamIndia | #INDvSA | @IDFCFIRSTBank | @imVkohli

400

42K

6K

1K

1M

0

0

0

0

108

Ishit Vachhrajani @IshitV

8 months ago

What a team! #Champions

8 months ago

The moment all of India has been waiting for as ICC Chairman @JayShah hands India captain Harmanpreet Kaur the trophy 🏆 #CWC25

201

29K

4K

816

593K

0

0

0

0

67

IshitV retweeted

8 months ago

𝐂𝐇𝐀𝐌𝐏𝐈𝐎𝐍𝐒 𝐎𝐅 𝐓𝐇𝐄 𝐖𝐎𝐑𝐋𝐃 🇮🇳🏆 India clinch their maiden Women’s @cricketworldcup title at #CWC25 🤩

ICC's tweet photo. 𝐂𝐇𝐀𝐌𝐏𝐈𝐎𝐍𝐒 𝐎𝐅 𝐓𝐇𝐄 𝐖𝐎𝐑𝐋𝐃 🇮🇳🏆

India clinch their maiden Women’s @cricketworldcup title at #CWC25 🤩 https://t.co/S19w75A4Ch

1K

90K

19K

720

2M

Ishit Vachhrajani @IshitV

10 months ago

@collinsadam Hi Adam - Can I please get the hi-res version of this photo for a fan print? I appreciate it if you can DM me, happy to also purchase it if you have it somewhere. Thank you!

0

0

0

0

16

Ishit Vachhrajani @IshitV

11 months ago

@collinsadam Thank you. Done.

1

0

0

0

23

Last Seen Users on Sotwe

Trends for you

Most Popular Users