Lee Butterman

almost 3 years ago

@simonw I’ve got text embeddings of Wikipedia for semantic search in the browser at https://t.co/MycZ6RSjIi

6

198

12

99

101K

3 months ago

@dbreunig @isaacbmiller1 Sounds cool! I’ve found DSPy and GEPA super productive and very excited about RLMs https://t.co/BoWcFiz3ao

7 months ago

https://t.co/6WF3RfTi0d

0

16

2

19

5K

2

10

3

7

4K

leebutterman retweeted

Omar Khattab

@lateinteraction

4 months ago

exciting talks on dspy in production from dropbox and shopify teams at the next DSPy SF meetup!

2

50

5

10

5K

I Saw Fate Whiz Back By Me banner/avatar A550 digital ambient unedited trinkets & glass

5 months ago

@jeffreyhuber Upper right is Deep Research

1

0

107

Who to follow

Mina Doroudi

@mdoroudi

Product at Microsoft Designer. GenAI, Children's Book Author, Entrepreneur. Photographer. Rock Climber. Yogi. Gardener. Reader. Views are my own.

leebutterman retweeted

Farouk

@FaroukAdeleke3

5 months ago

Open sourcing Microcode! Microcode is a context-efficient, general purpose terminal agent fully powered by a packaged `dspy.RLM()` program. Set your own OpenRouter API key via and choice of models. Supports MCP servers too with @MaximeRivest mcp2py. Because the CI/CD of the RLM engine is exposed through @modaicdev, per-user or per-codebase prompt optimization is plug and play. If you set the --verbose flag, it pretty prints the RLM's trajectories as reasoning or code within it's REPL.

7

138

16

191

15K

5 months ago

@nbaschez We’ve also found that giving Claude access to start and stop an app with multiple daemons and access to read logs is crucial for longer self directed exploration. Any reason you like pm2 over docker?

0

102

5 months ago

@lateinteraction 💯. Python is often my object code, and my human intention from reading and writing in the multi turn code agent setup is the only important part and is often lost in PR comments. I know I especially don’t record praise of what the agent got right that I was worried about

0

43

leebutterman retweeted

Misha

@drethelin

6 months ago

I’ve heard of boba tea, but there’s a new place in the neighborhood that sells something they call Kiki tea, so I got some to try

77

22K

2K

980

525K

leebutterman retweeted

Salah Alzu'bi

@salahalzubi401

7 months ago

Also a big fan of GEPA! If you’re optimization runs are taking a long time you should take a look at our k-way proposal function called GEPA+. It reduces metric runs by 30-40% while improving relative accuracy by 2-5% depending on tasks! https://t.co/603AmYLSOD

1

11

1

8

814

leebutterman retweeted

Vaibhav (VB) Srivastav

@reach_vb

7 months ago

This! Sometime last year I made the switch, the default is greyscale and it automatically turns to colour when I’m looking at photos and back when I close it! 100% would recommend

reach_vb's tweet photo. This! Sometime last year I made the switch, the default is greyscale and it automatically turns to colour when I’m looking at photos and back when I close it!

100% would recommend https://t.co/i8o7B8HBgr

72

15K

387

12K

4M

leebutterman retweeted

Christopher Potts

@ChrisGPotts

7 months ago

A clip from my practice run of this talk, providing more context for this slide:

5

66

16

69

26K

@chrisbolas @noah_vandal @DSPyOSS

7 months ago

0

2

0

28

7 months ago

@chrisbolas @noah_vandal @DSPyOSS As I’d explained a bit farther down, you want a dense model if you only have 8GB before you hit slow disk (micro sd), and gpt-oss is a MoE, most of your experts aren’t going to mix in, so a dense model like Qwen3 4B Thinking 2507 utilizes memory more

leebutterman's tweet photo. @chrisbolas @noah_vandal @DSPyOSS As I’d explained a bit farther down, you want a dense model if you only have 8GB before you hit slow disk (micro sd), and gpt-oss is a MoE, most of your experts aren’t going to mix in, so a dense model like Qwen3 4B Thinking 2507 utilizes memory more https://t.co/0AnSKYzudd

2

1

0

49

7 months ago

@JoshPurtell I have found that GEPA is extremely sample efficient. I got from 7% to 50% accurate on chat to sql for qwen3 0.6B on a MacBook in a day. And 7% to 28% in one pass through my <300 item dataset with a Qwen3 4B reflection LLM https://t.co/dwt6WBKWhf

7 months ago

DSPy on a Pi: Cheap Prompt Optimization with GEPA and Qwen3 “It took me about sixteen hours on a Raspberry Pi to boost performance of chat-to-SQL using Qwen3 0.6B from 7.3% to 28.5%. Using gpt-oss:20b, to boost performance from ~60% to ~85% took 5 days.”

DSPyOSS's tweet photo. DSPy on a Pi: Cheap Prompt Optimization with GEPA and Qwen3

“It took me about sixteen hours on a Raspberry Pi to boost performance of chat-to-SQL using Qwen3 0.6B from 7.3% to 28.5%. Using gpt-oss:20b, to boost performance from ~60% to ~85% took 5 days.” https://t.co/tB5Dmuk4vp

6

171

22

134

12K

0

9

2

8

1K

leebutterman retweeted

7 months ago

https://t.co/6WF3RfTi0d

0

16

2

19

5K

7 months ago

@noah_vandal @DSPyOSS With under 1GB to spare! But gpt-oss:20b (in Ollama) has weights of ~12GB and takes another few gigs of inference memory so I mostly had my four cores going full time on the 16GB Pi 5. Qwen 4B optimizing Qwen 4B would be even smaller. On a MacBook it is within an afternoon :)

1

2

0

81

7 months ago

@karanjagtiani04 @DSPyOSS No manual tuning! That’s the whole point :) I just made a pretty basic prompt with some bland facts about the data (multiple rows per paper, one per author, read only, call out content policy violations) and GEPA did the rest, and DSPy was super easy to use to separate concerns

1

6

2

0

1K

leebutterman retweeted

7 months ago

DSPy on a Pi: Cheap Prompt Optimization with GEPA and Qwen3 “It took me about sixteen hours on a Raspberry Pi to boost performance of chat-to-SQL using Qwen3 0.6B from 7.3% to 28.5%. Using gpt-oss:20b, to boost performance from ~60% to ~85% took 5 days.”

6

171

22

134

12K

8 months ago

@lmeyerov Those piles of boring features that are getting automated are all closer to each other than making the initial proofs of concept so it’s easy to imagine that coding agents will get better at them, and that iterative polish is an iceberg of work, esp re compliance

1

0

56

9 months ago

@charles_irl These flow charts https://t.co/gM1RRTQqi4

0

1

0

64