Ziv Navoth

@ziv

Opinions are mine.

40.789839,-73.975321

Joined March 2007

582 Following

676 Followers

1.4K Posts

Ziv Navoth @ziv

over 1 year ago

@moritzkremb builder

0

0

0

0

7

Ziv Navoth @ziv

over 1 year ago

@DevanGockenbach 20K

0

0

0

0

6

Ziv Navoth @ziv

over 1 year ago

@nabil_haouam scale

0

0

0

0

5

Ziv Navoth @ziv

over 1 year ago

A game that builds itself while you're playing it? https://t.co/K9fJWkbTw2

0

3

0

0

190

Who to follow

anna frenkel ��️

born in moldova, raised in brooklyn | mama, marketer, formerly @airbnb and @foursquare | happiest when surrounded by trees 🌳

Jonathan Grossman

Senior Lecturer, Department of Political Studies at @Bar_ilan. I teach IR and diaspora/migration politics. Opinions are my own. I post mostly in Hebrew. Peace🏃

Sign Man. Bridge enthusiast. Co-author of Tales of the San Francisco Cacophony Society. Baycoastal - San Francisco - Oakland

ziv retweeted

over 1 year ago

Caching will make your LLM application cheaper and faster to run. But caching is hard. As the famous saying goes, "There are 2 hard problems in computer science: cache invalidation, naming things, and off-by-1 errors." Here is how caching works at a very high level: 1. A new request comes in with a prompt. 2. The application checks whether an identical or similar prompt already exists in the cache. 3. If found, the application returns the cached response. 4. If not found, the application generates a new response for the prompt and caches it. If you implement this right, you'll get two main benefits: 1. Your application will be much faster. Returning responses from the cache have much lower latency than generating the response with an LLM. 2. Your application will be much cheaper. You will be saving a ton of money in tokens. However, implementing a robust caching system is a ton of work. Here is an idea: If you are using OpenAI’s models, Llama 3, Mixtral, or Gemma, take a look at CogCache. They are sponsoring this post: https://t.co/Gq0EaUl5qc CogCache is an out-of-the-box caching solution with intelligent caching: It will automatically cache and serve responses for semantically similar queries. Some of the metrics: • You'll get up to 100x faster response times. • You'll save up to 50% in costs. • They integrate with Groq for super fast response times. • Lowest token price in the market thanks to their partnership with Microsoft. They have a pay-as-you-go model, which is great for all sorts of businesses. And if you're an Azure customer, you can use your annual Azure commitment to cover your inference costs. The attached image shows a Python example. Your code doesn't change at all, and you use the same OpenAI's Completion API, but now with cache enabled. That's pretty sweet!

svpino's tweet photo. Caching will make your LLM application cheaper and faster to run.

But caching is hard. As the famous saying goes, "There are 2 hard problems in computer science: cache invalidation, naming things, and off-by-1 errors."

Here is how caching works at a very high level:

1. A new request comes in with a prompt.

2. The application checks whether an identical or similar prompt already exists in the cache.

3. If found, the application returns the cached response.

4. If not found, the application generates a new response for the prompt and caches it.

If you implement this right, you'll get two main benefits:

1. Your application will be much faster. Returning responses from the cache have much lower latency than generating the response with an LLM.

2. Your application will be much cheaper. You will be saving a ton of money in tokens.

However, implementing a robust caching system is a ton of work.

Here is an idea:

If you are using OpenAI’s models, Llama 3, Mixtral, or Gemma, take a look at CogCache. They are sponsoring this post:

https://t.co/Gq0EaUl5qc

CogCache is an out-of-the-box caching solution with intelligent caching: It will automatically cache and serve responses for semantically similar queries.

Some of the metrics:

• You'll get up to 100x faster response times.
• You'll save up to 50% in costs.
• They integrate with Groq for super fast response times.
• Lowest token price in the market thanks to their partnership with Microsoft.

They have a pay-as-you-go model, which is great for all sorts of businesses. And if you're an Azure customer, you can use your annual Azure commitment to cover your inference costs.

The attached image shows a Python example. Your code doesn't change at all, and you use the same OpenAI's Completion API, but now with cache enabled.

That's pretty sweet!

16

689

88

601

67K

Ziv Navoth @ziv

almost 2 years ago

@adamsilverman @crewAIInc @AgentOpsAI Count me in

0

0

0

0

9

Ziv Navoth @ziv

almost 2 years ago

@dvassallo No need to extract or aggregate. Simply create a project on Claude, upload all the PDFs to it and start querying it. Then ask it to create an artifact to chart how specific variables have changes over time.

0

0

0

0

14

Ziv Navoth @ziv

almost 2 years ago

@itsedaxe Count me in

0

0

0

0

6

Ziv Navoth @ziv

about 2 years ago

@Bharambe2Kiran template

0

0

0

0

9

Ziv Navoth @ziv

about 2 years ago

@girdley @pozzoron @girdley we've been to CDMX (and SMDA, Oaxaca, Puebla) many times without knowing a word in Spanish. Hitlist is here: https://t.co/wFqy9PEEme

0

1

0

0

55

ziv retweeted

@EllaTravelsLove

about 2 years ago

This is a photo of the city of Tel Aviv in 1944. Tel Aviv was founded in 1909 by 66 Jewish families. The neighbor city of Jaffa had a Jewish community and Jewish history. Palestine was the name of the region, but it was never a country. Khalissee and many others are trying to hint that Jews didn't live here before the current state of Israel existed and to rewrite history, when it's well documented. #ThePalestinianLie

EllaTravelsLove's tweet photo. This is a photo of the city of Tel Aviv in 1944.

Tel Aviv was founded in 1909 by 66 Jewish families.

The neighbor city of Jaffa had a Jewish community and Jewish history.

Palestine was the name of the region, but it was never a country.

Khalissee and many others are trying to hint that Jews didn't live here before the current state of Israel existed and to rewrite history, when it's well documented.

#ThePalestinianLie

173

2K

395

21

46K

ziv retweeted

about 2 years ago

Letter from 94 year old Holocaust survivor David Schaecter to Jon Glazer. @TheAcademy

johnondrasik's tweet photo. Letter from 94 year old Holocaust survivor David Schaecter to Jon Glazer. @TheAcademy https://t.co/XvE6pNJFyF

166

6K

2K

449

494K

ziv retweeted

Visegrád 24 @visegrad24

over 2 years ago

A message from women across the world to the women of Israel on International Women’s Day 2024.

363

8K

2K

115

206K

ziv retweeted

StopAntisemitism

@StopAntisemites

over 2 years ago

Happy International Women’s Day🎗️

StopAntisemites's tweet photo. Happy International Women’s Day🎗️ https://t.co/tReFQqa6se

153

9K

2K

59

141K

ziv retweeted

@EllaTravelsLove

over 2 years ago

There's no International Women’s Day without the freedom of ALL women. Share and call for the immediate release of all hostages unconditionally. #bringthemhomenow #bringthemhome

503

6K

2K

103

411K

ziv retweeted

@EllaTravelsLove

over 2 years ago

Send it to all your woke friends. #educateyourself

326

6K

2K

159

324K

ziv retweeted

@EllaTravelsLove

over 2 years ago · Israel

We want Naama home.

14

535

113

0

13K

ziv retweeted

miha schwartzenberg

over 2 years ago

Millions of people watched this tonight, at @SuperBowl ! @StandUp2JewHate teamed up with Dr. Clarence Jones, activist and author of the best speech ever wrote, “I have a dream”. Robert Kraft graciously spent 7 mil for…30 sec of pure emotion ! #StanduptoJewishhate #SuperBowl

4

2K

602

93

66K

ziv retweeted

over 2 years ago

This Super Bowl ad that will be aired today, sponsored by FCAS, is a poignant reminder for every American that while it may not be popular, standing up against anti-Jewish hate is the only team we must be on.

342

10K

2K

313

649K

ziv retweeted

@EllaTravelsLove

over 2 years ago

How can one deal with these conflicting narratives? Maybe you should look a little bit closer. #TheRealImage #ThePalestinianLie

318

2K

916

57

43K

Last Seen Users on Sotwe

Trends for you

Most Popular Users

Olivia

Online

✨

⭐

💫