Top Tweets for #TextGrad

about 1 year ago

Here's the non-paywall version of our #TextGrad Nature paper https://t.co/5YlLzhaG36! 📜

about 1 year ago

⚡️Really thrilled that #textgrad is published in @nature today!⚡️ We present a general method for genAI to self-improve via our new *calculus of text*. We show how this optimizes agents🤖, molecules🧬, code🖥️, treatments💊, non-differentiable systems🤯 + more!

james_y_zou's tweet photo. ⚡️Really thrilled that #textgrad is published in @nature today!⚡️

We present a general method for genAI to self-improve via our new *calculus of text*.

We show how this optimizes agents🤖, molecules🧬, code🖥️, treatments💊, non-differentiable systems🤯 + more!

20

660

123

344

92K

0

28

6

9

3K

about 1 year ago

I had a lot of fun discussing #textgrad on the @Nature podcast! It starts at around 12 minutes here https://t.co/e2rkyJWocZ

1

21

1

2

2K

about 1 year ago

🚀 I’m thrilled to announce that #textgrad has been published in @Nature today! It’s been an incredible journey working with the TextGrad team, I am grateful for the wonderful collaboration within the Zou Group. @james_y_zou. 🙌 #Nature #AI #LLMs #AgenticAI

about 1 year ago

⚡️Really thrilled that #textgrad is published in @nature today!⚡️ We present a general method for genAI to self-improve via our new *calculus of text*. We show how this optimizes agents🤖, molecules🧬, code🖥️, treatments💊, non-differentiable systems🤯 + more!

20

660

123

344

92K

0

40

6

8

5K

Pan Lu

@lupantech

about 1 year ago

🚀 Thrilled to share that #textgrad is published in @Nature today! 🎉 It’s been an incredible journey working with the amazing TextGrad team and the Zou Group @james_y_zou. 🙌 ✨ What is TextGrad? A groundbreaking framework that automates optimization of LLMs and compound systems using insights from "textual gradients." 🔍 Check it out for more details! 📄 Paper: https://t.co/B4RCEDL7WP 💻 Code: https://t.co/R6TPUDY2wL 📚 Docs: https://t.co/uLqIRNc5Mp 🎥 Video by Discover AI: https://t.co/EfwGUvRjcZ #Nature #AI #LLMs #AgenticAI

about 1 year ago

⚡️Really thrilled that #textgrad is published in @nature today!⚡️ We present a general method for genAI to self-improve via our new *calculus of text*. We show how this optimizes agents🤖, molecules🧬, code🖥️, treatments💊, non-differentiable systems🤯 + more!

20

660

123

344

92K

0

39

4

5

5K

about 1 year ago

💡The key idea of #textgrad is to optimize by backpropagating textual gradients produced by #LLM. Paper: https://t.co/CjBpSxcnpn Code: https://t.co/nCGqp15kJJ Amazing job by @mertyuksekgonul leading this project w/ fantastic collaborators @federicobianchy Joseph Boen @ShengLiu_ @lupantech @guestrin @ZhiHuangPhD 👏

1

46

6

26

5K

Amrit Singh Bedi @amritsinghbedi3

about 1 year ago

⚡️Really thrilled that #textgrad is published in @nature today!⚡️ We present a general method for genAI to self-improve via our new *calculus of text*. We show how this optimizes agents🤖, molecules🧬, code🖥️, treatments💊, non-differentiable systems🤯 + more!

20

660

123

344

92K

over 1 year ago

🚀 The Future is Multi-LLM-based AI Systems🚀 In the upcoming multi-LLM systems, there’s a BIG question on the horizon: 𝗛𝗼𝘄 𝗱𝗼 𝘄𝗲 𝗘𝗩𝗔𝗟𝗨𝗔𝗧𝗘 𝘁𝗵𝗲𝘀𝗲 𝗔𝗜 𝘀𝘆𝘀𝘁𝗲𝗺𝘀 𝗱𝘂𝗿𝗶𝗻𝗴 𝗶𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲? (as in #TextGrad @mertyuksekgonul @james_y_zou)

amritsinghbedi3's tweet photo. 🚀 The Future is Multi-LLM-based AI Systems🚀

In the upcoming multi-LLM systems, there’s a BIG question on the horizon:

𝗛𝗼𝘄 𝗱𝗼 𝘄𝗲 𝗘𝗩𝗔𝗟𝗨𝗔𝗧𝗘 𝘁𝗵𝗲𝘀𝗲 𝗔𝗜 𝘀𝘆𝘀𝘁𝗲𝗺𝘀 𝗱𝘂𝗿𝗶𝗻𝗴 𝗶𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲?

(as in #TextGrad @mertyuksekgonul @james_y_zou) https://t.co/oUxVKul0PR

2

16

5

8

4K

Thomas Ahle

@thomasahle

almost 2 years ago

A while ago I wrote a thread about #TextGrad, which is an alternative prompt optimization method, based on "natural language gradients". Cool! Since we are still waiting for @karpathy's video reimplementing this from scratch... I thought I had to make my own... So here is the just 300 lines of code! a lot of which is (ironically) prompts: https://t.co/SeL9Ssekhv This screenshot is real code using tiny-text-grad! - The equality_loss function asks an LLM judge whether the answer is correct, or provide feedback otherwise. - The loss.backwards() call distributes the feedback through the call-graph--using more LLM calls! - And optimizer.step() updates all the parameters (prompts) using a "gradient step" in the direction of the feedbacks received. The MultihopModel is based on #dspy's Simplified Baleen: https://t.co/0tXiSCYbeu and allows the LLM to perform multiple Wikipedia calls as it builds up context. This creates an interesting call graph, which is visualized here: The loss node is furthest to the right, and all the nodes that it depends on are to the left. In the text-backprop step each node accumulates feedback in a list, rather than with a sum as in "real" backprop. That's all! Except... Does it work? Well... It's definitely better than bad prompts and no tuning. But sometimes the optimization "explodes" just like normal gradient descent. I'm not 100% convinced this is the way to go, over few-shot optimization or just giving the LLM the complete call graph directly, and asking it to optimize it.

thomasahle's tweet photo. A while ago I wrote a thread about #TextGrad, which is an alternative prompt optimization method, based on "natural language gradients". Cool!

Since we are still waiting for @karpathy's video reimplementing this from scratch... I thought I had to make my own...

So here is the just 300 lines of code! a lot of which is (ironically) prompts: https://t.co/SeL9Ssekhv

This screenshot is real code using tiny-text-grad!

- The equality_loss function asks an LLM judge whether the answer is correct, or provide feedback otherwise.
- The loss.backwards() call distributes the feedback through the call-graph--using more LLM calls!
- And optimizer.step() updates all the parameters (prompts) using a "gradient step" in the direction of the feedbacks received.

The MultihopModel is based on #dspy's Simplified Baleen: https://t.co/0tXiSCYbeu and allows the LLM to perform multiple Wikipedia calls as it builds up context.

This creates an interesting call graph, which is visualized here:

The loss node is furthest to the right, and all the nodes that it depends on are to the left.

In the text-backprop step each node accumulates feedback in a list, rather than with a sum as in "real" backprop.

That's all!

Except... Does it work?

Well... It's definitely better than bad prompts and no tuning. But sometimes the optimization "explodes" just like normal gradient descent.

I'm not 100% convinced this is the way to go, over few-shot optimization or just giving the LLM the complete call graph directly, and asking it to optimize it.

5

108

21

93

15K

almost 2 years ago

🔥Very cool application of #textgrad to reduce hallucination of visual-language #AI! Significantly increases reliability of GPT-4v/o.

almost 2 years ago

⚡️#TextGrad reduces hallucination in multimodal LLMs! MMVP 🏆 (multiple choice questions) - TextGrad optimized prompts increase the accuracy of GPT-4v from 71% -> 76%! HQH - Relation📍(open-ended generation) - TextGrad boosts the accuracy of GPT-4o from 77.2% to 82.5%!

ShengLiu_'s tweet photo. ⚡️#TextGrad reduces hallucination in multimodal LLMs!

MMVP 🏆 (multiple choice questions) - TextGrad optimized prompts increase the accuracy of GPT-4v from 71% -> 76%!

HQH - Relation📍(open-ended generation) - TextGrad boosts the accuracy of GPT-4o from 77.2% to 82.5%! https://t.co/BRgI31N8qM

1

86

14

51

19K

0

17

1

2

3K

Pan Lu

@lupantech

almost 2 years ago

🚀 #TextGrad is advancing multimodal reasoning and reducing hallucinations! Join us in contributing to TextGrad, an innovative framework that automatically optimizes foundation models via natural language gradients! Check it out here: https://t.co/WpwqCKvpn0! 🌟

almost 2 years ago

⚡️#TextGrad reduces hallucination in multimodal LLMs! MMVP 🏆 (multiple choice questions) - TextGrad optimized prompts increase the accuracy of GPT-4v from 71% -> 76%! HQH - Relation📍(open-ended generation) - TextGrad boosts the accuracy of GPT-4o from 77.2% to 82.5%!

1

86

14

51

19K

0

15

2

3

3K

almost 2 years ago

⚡️#TextGrad reduces hallucination in multimodal LLMs! MMVP 🏆 (multiple choice questions) - TextGrad optimized prompts increase the accuracy of GPT-4v from 71% -> 76%! HQH - Relation📍(open-ended generation) - TextGrad boosts the accuracy of GPT-4o from 77.2% to 82.5%!

almost 2 years ago

⚡️This is the most fun project! We built PyTorch-for-text! 🔥 #TextGrad: automated "differentiation" via text to optimize AI systems by backpropagating LLM text feedback. TextGrad + GPT4o: 💻LeetCodeHard best score ❓GPQA sota 🧬Designs new molecules 🩺Improves treatments 🧵

james_y_zou's tweet photo. ⚡️This is the most fun project!

We built PyTorch-for-text! 🔥
#TextGrad: automated "differentiation" via text to optimize AI systems by backpropagating LLM text feedback.

TextGrad + GPT4o:
💻LeetCodeHard best score
❓GPQA sota
🧬Designs new molecules
🩺Improves treatments 🧵

10

583

124

425

126K

1

86

14

51

19K

Urmish Thakker @UrmishThakker

almost 2 years ago

It was a lot of fun talking about #TextGrad and mixture-of-agents at @agihouse_org!

almost 2 years ago

Great presentation by @james_y_zou on mixture of agents and text grad at hackathon organized by @SambaNovaAI, @togethercompute, @NumbersStnAI at @agihouse_org . Multiple agents collaborating to achieve a task is becoming an increasingly important aspect to develop useful enterprise products! By combining multiple agents, they were able to beat #GPT4 on AlpacaEval leaderboard. These systems require switching between multiple agents and running them at extremely fast speed. This is where SambaNova’s SN40L chip really shines. 100s of models on a single node each at 1000s tokens per second. Try our apis at - 1.https://t.co/RPrWNbxA4j 2.https://t.co/u6bOwITBG2

UrmishThakker's tweet photo. Great presentation by @james_y_zou on mixture of agents and text grad at hackathon organized by @SambaNovaAI, @togethercompute, @NumbersStnAI at @agihouse_org . Multiple agents collaborating to achieve a task is becoming an increasingly important aspect to develop useful enterprise products! By combining multiple agents, they were able to beat #GPT4 on AlpacaEval leaderboard.

These systems require switching between multiple agents and running them at extremely fast speed. This is where SambaNova’s SN40L chip really shines. 100s of models on a single node each at 1000s tokens per second. Try our apis at -

1.https://t.co/RPrWNbxA4j
2.https://t.co/u6bOwITBG2

1

24

8

3

8K

0

47

3

2

7K

almost 2 years ago

🔥🔥Wow, a group of Cambridge students used #TextGrad to win the LLMxLaw 1st Prize and AWS Challenge. They used #TextGrad to improve Claude 3 by 10% on legal questions. Very cool! https://t.co/bOgJNYISSY

1

99

20

40

11K

almost 2 years ago

🔥#TextGrad is now multi-modal! TextGrad boosts GPT-4o's visual reasoning ability: 📊MathVista score 63.8➡️66.1 w/ TextGrad 🧬Reduces ScienceQA error rate by 20%. Best reported 0-shot score Tutorial: https://t.co/9NGJJtsQf8 Great work @lupantech @mertyuksekgonul + team! Works w/ any VLM. Check out @lupantech's 🧵for more examples! https://t.co/2lQb4Fch6J

2

167

30

73

15K

Pan Lu

@lupantech

almost 2 years ago

#TextGrad now features multimodal reasoning! 🔬 ScienceQA (multimodal scientific reasoning) - Error rate drops by 20%, achieving the highest zero-shot performance we know of. 📊 MathVista (multimodal math reasoning) - Boosting the score from 63.8% to 66.1% on GPT-4o! Explore more: 💻 Code: https://t.co/WpwqCKvpn0 📄 Doc: https://t.co/Wwb5XTNTkF 🌐 Project: https://t.co/3n8OfPYM91 🧵

lupantech's tweet photo. #TextGrad now features multimodal reasoning!

🔬 ScienceQA (multimodal scientific reasoning)
- Error rate drops by 20%, achieving the highest zero-shot performance we know of.

📊 MathVista (multimodal math reasoning)
- Boosting the score from 63.8% to 66.1% on GPT-4o!

Explore more:
💻 Code: https://t.co/WpwqCKvpn0
📄 Doc: https://t.co/Wwb5XTNTkF
🌐 Project: https://t.co/3n8OfPYM91 🧵

almost 2 years ago

⚡️This is the most fun project! We built PyTorch-for-text! 🔥 #TextGrad: automated "differentiation" via text to optimize AI systems by backpropagating LLM text feedback. TextGrad + GPT4o: 💻LeetCodeHard best score ❓GPQA sota 🧬Designs new molecules 🩺Improves treatments 🧵

10

583

124

425

126K

7

158

31

72

30K