Jason Rute @JasonRute - Twitter Profile

Pinned Tweet

3 months ago

Announcing our fully open source code agent to support development in @leanprover. This has been a labor of love by our team at @MistralAI and we look forward to seeing what the #LeanProver community does with it!

JasonRute's tweet photo. Announcing our fully open source code agent to support development in @leanprover. This has been a labor of love by our team at @MistralAI and we look forward to seeing what the #LeanProver community does with it! https://t.co/Zt1texfIar

1

162

28

55

10K

Jason Rute @JasonRute

9 days ago

@prz_chojecki They said they won’t announce it until July. You could reach out to them with your findings (there is a thread on the Lean Zulip). Unfortunately, it will be an uphill battle to convince them to read the model output (especially if you don’t understand it) for obvious reasons.

1

0

46

Jason Rute @JasonRute

12 days ago

@gro_tsen Nevermind. I see you say for arbitrarily large n and talk just the sup of delta. My bad.

1

0

19

Jason Rute @JasonRute

12 days ago

@gro_tsen I think you also need a big-O or little-O fudge term. For example Sawin’s result is n^1.014114/C for some constant C and Erdos’s original was n^{1 + o(1)}. (Or maybe this is implied without saying in your post.)

2

0

35

Who to follow

Jesse Michael Han

@jessemhan

@mathematics_inc // prev. research @OpenAI 'turboautist' 'only guy i know who talks like an anime character'

Lawrence Paulson

@LawrPaulson

Computer scientist with a background in mathematics and logic. Academic researching formal verification technologies and applications. Also in the other place.

Kaiyu Yang

@KaiyuYang4

Chief Scientist, Verifiable AI Lab of @miromind_ai. Previously: Research Scientist @FAIR, Postdoc @Caltech, PhD @PrincetonCS, Undergrad @Tsinghua_Uni.

Jason Rute @JasonRute

18 days ago

@aaswaminathan01 @mathandcobb While I think your take has some truth (we will soon be able to autoformalize a nontrivial amount of math papers into say Lean), I think it is missing a large degree of technical, practical, and sociological nuance.

0

1

0

58

Jason Rute @JasonRute

19 days ago

@Kseniase_ @CarinaLHong @ylecun @logic_int @evelovesolive I agree with @CarinaLHong. Everything I’ve seen from @evelovesolive says that Aleph prover is not an EBM but an LLM wrapper. @evelovesolive has even said it in X replies when asked, but I would have to search to find them.

2

5

0

1

227

Jason Rute @JasonRute

20 days ago

@danrobinson I think it might be unethical to raise Erdős from the dead, although the idea has been considered for other purposes: https://t.co/5lNH6bQBOa

0

3

0

1

213

Jason Rute @JasonRute

20 days ago

@LatinumAI Did you rewrite the interpreter too or just the compiler? I guess now @leanprover needs an external compiler bench (alongside their external kernel bench).

1

0

113

Jason Rute @JasonRute

21 days ago

@lpachter Do you have good examples?

1

0

149

Jason Rute @JasonRute

21 days ago

@ChrSzegedy @prz_chojecki That paper was very influential to my view on this field. Especially the autoformalization/proving flywheel. It feels close.

0

31

Jason Rute @JasonRute

21 days ago

@littmath Can you explain “verification is the bottleneck”?

2

3

0

671

Jason Rute @JasonRute

22 days ago

@giffmana Math is usually fairly robust to errors, much more than code. There are lots of articles about why. This particular benchmark however is designed adversarially to be very fiddly, calculation based, and non-intuitive (else the model will guess the solution).

0

1

0

144

Jason Rute @JasonRute

23 days ago

@EpochAIResearch One third of solved problems? Or unsolved problems? Or both?

1

14

0

5K

Jason Rute @JasonRute

23 days ago

@ElliotGlazer Is there something special about aleph_17?

1

5

0

315

Jason Rute @JasonRute

25 days ago

@VictorTaelin Have you ever considered writing an external checker for Lean? (I assume this particular is different from Lean.)

0

101

Jason Rute @JasonRute

25 days ago

@lacker @julianboolean_ I think we have plenty of more difficult problems already? But if it is a new conjecture, it would be interesting (at least the first time) exactly because the AIs are trying to convince us it is important.

0

2

0

31

Jason Rute @JasonRute

25 days ago

@lacker @julianboolean_ There are a number of good videos aimed at more general audiences explaining advanced math concepts including fields medal winning papers. In this hypothetical scenario where AI is good at everything, they would also be good at making this kind of content.

3

2

0

75

Jason Rute @JasonRute

25 days ago

@littmath Did it work out that it is likely open, or find out from say your paper?

1

2

0

458

Jason Rute @JasonRute

26 days ago

@j_dekoninck @xeophon I know one benchmark where over half the problems are impossible to get correct.

1

0

40

Jason Rute @JasonRute

26 days ago

@Anthony_Bonato AI can do the verification as well, especially with formal verification (but humans can still verify the verifiers).

0

4

0

207

Jason Rute

@JasonRute

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users