Kevin Madura @kmad - Twitter Profile

Pinned Tweet

Kevin Madura

@kmad

5 months ago

My AIE talk on @DSPyOSS is up - check it out! Thanks to @aiDotEngineer for a high-signal, well-run event

AI Engineer

@aiDotEngineer

5 months ago

🆕 DSPy is (really) All You Need https://t.co/0y4nMyr1cE @kmad returns to AIE for a special workshop sure to please the @DSPyOSS fans - a comprehensive overview of DSPy! (our second after @lateinteraction's talk with us at AIEWF)

6

209

41

264

86K

5

44

7

21

9K

Kevin Madura

@kmad

26 minutes ago

Epic line up right here

sarah guo

@saranormous

38 minutes ago

1/ 🔥 @NoPriorsPod x @LatentSpacePod chat with @SatyaNadella at @Microsoft Build. He has the sharpest mental models of any public company CEO I've interviewed. $MSFT is at its heart still a tools company! Big focus on agentic coding, harness & AI evals. Takeaways:👇

saranormous's tweet photo. 1/ 🔥 @NoPriorsPod x @LatentSpacePod chat with @SatyaNadella at @Microsoft Build. He has the sharpest mental models of any public company CEO I've interviewed.

$MSFT is at its heart still a tools company! Big focus on agentic coding, harness & AI evals. Takeaways:👇 https://t.co/iAAjHdCVnK

5

40

3

12

5K

0

40

kmad retweeted

Shubham Saboo

@Saboo_Shubham_

about 11 hours ago

https://t.co/w7v8Vq1JNN

20

425

59

1K

198K

Kevin Madura

@kmad

about 6 hours ago

@antoine_chaffin I will have to check that out then!

0

1

0

16

Who to follow

about 15 hours ago

*If* this is confirmed this is fascinating - a team converted Google’s quantum algo ZKP into a benchmark and had agents hillclimb against it, eventually exceeding their results!

0

2

0

165

kmad retweeted

Harold Benoit

@harold_matmul

about 23 hours ago

@lateinteraction it was my idea :) Using GEPA is a very natural workflow for creating LLM programs. The iteration speed is very quick, and it easily allows researchers to bias the optimization with some priors (usually derived from just looking at the data). Thanks a lot for the great tool!

0

79

8

33

8K

Kevin Madura

@kmad

about 16 hours ago

GEPA and @DSPyOSS seen in the Microsoft MAI Pre-training write up!

Lakshya A Agrawal

@LakshyAAAgrawal

1 day ago

Excited to see the use of GEPA-optimized LLM judges for data filtering in MAI-Thinking-1 model's pre-training pipeline!

LakshyAAAgrawal's tweet photo. Excited to see the use of GEPA-optimized LLM judges for data filtering in MAI-Thinking-1 model's pre-training pipeline! https://t.co/wAtVx3KEUE

3

147

19

64

46K

0

25

4

3K

Kevin Madura

@kmad

1 day ago

@JayD0ubleu @_philschmid It’s demonstrably not a gimmick, you can try it yourself!

0

1

0

31

kmad retweeted

will brown

@willccbb

1 day ago

god i'm so excited to have noah on the team. been trying to get him here for almost a year. his record of innovation at the frontier of algorithms + infra for self-improving ai is honestly insane, and i think his recent work is my favorite yet. idk how he's so chill about it.

10

416

7

106

77K

Kevin Madura

@kmad

1 day ago

@dbreunig @lennypruss @trq212 @CAISconf Whoa that’s an elegant way of putting it. Have been thinking about exactly this topic (how many tasks today are under-specified to be useful)

1

4

0

271

kmad retweeted

swyx

@swyx

1 day ago

12.30pm today on the @Microsoft Build stream @NoPriorsPod x @latentspacepod x @satyanadella Join us! :)

25

114

7

18

297K

Kevin Madura

@kmad

1 day ago

@litcapital * that may or may not have valid shares

0

2

1

0

450

Kevin Madura

@kmad

1 day ago

@bclavie @aiDotEngineer See you there! The RLM discourse on here has been spicy so be sure to check out my talk

0

2

0

68

Kevin Madura

@kmad

1 day ago

@ThePeshwa @lateinteraction @PrimeIntellect Both, actually. I would switch to the other when I ran out of credits. Opus was nice for the big picture, gpt-5.5 for the execution and diagnostics

0

1

0

74

Kevin Madura

@kmad

2 days ago

So /goal is awesome Over the past few weeks I used @PrimeIntellect to train a 149M late interaction model based on GTE-ModernColBERT-v1 using PyLate, focused on clause extraction from legal contracts. On the MLEB benchmark it does well for its size: it's the best accuracy-per-parameter open model on the task, 3rd of 17 open-source models, ahead of Google's EmbeddingGemma (308M, 0.829) and the same-size legal peer Free Law ModernBERT (0.764), behind only Qwen3-Embedding-4B/8B (which are 27–53× larger). The agents love the prime cli. I only used the UI for paying my bill.

kmad's tweet photo. So /goal is awesome

Over the past few weeks I used @PrimeIntellect to train a 149M late interaction model based on GTE-ModernColBERT-v1 using PyLate, focused on clause extraction from legal contracts.

On the MLEB benchmark it does well for its size: it's the best accuracy-per-parameter open model on the task, 3rd of 17 open-source models, ahead of Google's EmbeddingGemma (308M, 0.829) and the same-size legal peer Free Law ModernBERT (0.764), behind only Qwen3-Embedding-4B/8B (which are 27–53× larger).

The agents love the prime cli. I only used the UI for paying my bill.

8

134

12

117

10K

Kevin Madura

@kmad

1 day ago

@antoine_chaffin None taken 😆 It was many /goals over time but… yes this was codex/cc doing the experimentation. I was just the human asking dumb questions.

1

0

173

Kevin Madura

@kmad

2 days ago

And yes this is another @lateinteraction inspired project

0

4

0

391

Kevin Madura

@kmad

2 days ago

HF is here: https://t.co/jzXBbZSffo MLEB benchmark: https://t.co/jVmbGkqe1U Please, poke holes, this is a learning experiment.

1

8

0

4

542

kmad retweeted

Gabriel Lespérance

@GabLesperance

2 days ago

RLMs are so resilient. Multiple times I've run into bugs in our setups. What's interesting is that those bugs only became apparent after careful trace reviews, because the RLM actually found a way forward despite some broken state. Truly mind-boggling.

7

110

7

43

12K

Kevin Madura

@kmad

2 days ago

@bradenjhancock @CAISconf 100% this was such a refreshing experience - the content, people, events - stellar work by the team

0

2

0

71

Kevin Madura

@kmad

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users