Senthil Kumar @senthilkumarn_ - Twitter Profile

11 days ago

AI now can write long proofs, but autoformalization a research paper by Lean is still hard. 🚀 Check our LeanMarathon on 4 Erdős problems, fully autonomously! https://t.co/7zBO50oHnB Led by my student @yuanhezhang6 and collaboration with Yuekai, @btreetaiji and @jasondeanlee

Fanghui_SgrA's tweet photo. AI now can write long proofs, but autoformalization a research paper by Lean is still hard.

🚀 Check our LeanMarathon on 4 Erdős problems, fully autonomously! https://t.co/7zBO50oHnB

Led by my student @yuanhezhang6 and collaboration with Yuekai, @btreetaiji and @jasondeanlee https://t.co/KB4HqD3VTX

4

95

17

63

35K

SenthilKumarN_ retweeted

Pushmeet Kohli

@pushmeet

about 1 month ago

The future of Math is mathematicians and AI agents working together. Very pleased to introduce @GoogleDeepMind's AI co-mathematician: a multi-agent system designed to actively collaborate with human experts on open-ended research mathematics. Mathematicians testing the agent across areas as diverse as group theory, Hamiltonian systems, and algebraic combinatorics have reported impressive results. In autonomous mode evaluation on the rigorous FrontierMath Tier 4 problems, AI co-mathematician scored an unprecedented 48% — a new high score among all AI systems evaluated.

pushmeet's tweet photo. The future of Math is mathematicians and AI agents working together.

Very pleased to introduce @GoogleDeepMind's AI co-mathematician: a multi-agent system designed to actively collaborate with human experts on open-ended research mathematics.

Mathematicians testing the agent across areas as diverse as group theory, Hamiltonian systems, and algebraic combinatorics have reported impressive results.

In autonomous mode evaluation on the rigorous FrontierMath Tier 4 problems, AI co-mathematician scored an unprecedented 48% — a new high score among all AI systems evaluated.

172

3K

369

803

315K

SenthilKumarN_ retweeted

Yifan Zhang

@yifanzhang_

2 months ago

Introducing Math Code, our most capable frontier math prover. https://t.co/gZZJqxXMns

13

590

80

484

93K

SenthilKumarN_ retweeted

Ziran Yang @__zrrr__

3 months ago

Introducing Goedel-Code-Prover 🌲 LLMs write code, but can they prove it correct? Not just pass tests, but construct machine-checkable proofs that a program works for ALL possible inputs. We built a system that does exactly this. Given aprogram and its specification in Lean 4, Goedel-Code-Prover automatically synthesizes formal proofs ofcorrectness. Our 8B model achieves 62% overall success rate across three benchmarks (Verina, Clever &AlgoVeri), a 2.6x improvement over the strongest baseline, surpassing both frontier LLMs (GPT/Gemini/Claude)and open-source theorem provers up to 84x larger (DeepSeek-Prover/Goedel-Prover/Kimina-Prover/BFS-Prover).

__zrrr__'s tweet photo. Introducing Goedel-Code-Prover 🌲

LLMs write code, but can they prove it correct? Not just pass tests, but construct machine-checkable proofs that a program works for ALL possible inputs.

We built a system that does exactly this. Given aprogram and its specification in Lean 4, Goedel-Code-Prover automatically synthesizes formal proofs ofcorrectness.

Our 8B model achieves 62% overall success rate across three benchmarks (Verina, Clever &AlgoVeri), a 2.6x improvement over the strongest baseline, surpassing both frontier LLMs (GPT/Gemini/Claude)and open-source theorem provers up to 84x larger (DeepSeek-Prover/Goedel-Prover/Kimina-Prover/BFS-Prover).

21

557

76

397

71K

Who to follow

Arpan

@ArpanTripathi20

Subnet dev @vidaio_ | ex-AI & Subnet dev @nineteen_ai SN19 | MSc AI & ML @unibirmingham | CS alumni @PalakkadIIT

Junyoung Seo

@jyseo_cv

Ph.D. Student @KAIST_AI, working on visual generative models. RS Intern @NVIDIAAI Ex-Intern @Meta, @SonyAI_global. Collaborated with NAVER AI

Ramzi ⵣ

@_rram12

I teach machines for a living... the incarnation of Dionysus on earth Kaggle competitions expert

SenthilKumarN_ retweeted

Axiom

@axiommathai

3 months ago

We open-sourced Axplorer. Axplorer builds on PatternBoost; it discovers outlier math constructions to attack open problems. On Turán 4-Cycles, No 5 Points on Sphere, and Isosceles-Free Sets, Axplorer matched SOTA w/ a fraction of compute cost and time. It's now in your hands.

14

326

58

187

94K

SenthilKumarN_ retweeted

Matěj Kripner

@MatejKripner

3 months ago

I'm releasing OpenProver v1.0.0! It's 1) an open-source automated theorem prover inspired by DeepMind's Aletheia (@tonylfeng @gjb_ai @lmthang), and 2) a "Claude Code for mathematicians", allowing interactive proof search in English and formalization in Lean.

17

498

82

459

42K

SenthilKumarN_ retweeted

Math, Inc.

@mathematics_inc

3 months ago

Today, at the @DARPA expMath kickoff, we launched 𝗢𝗽𝗲𝗻𝗚𝗮𝘂𝘀𝘀, an open source and state of the art autoformalization agent harness for developers and practitioners to accelerate progress at the frontier. It is stronger, faster, and more cost-efficient than off-the-shelf alternatives. On FormalQualBench, running with a 4-hour timeout, it beats @HarmonicMath's Aristotle agent with no time limit. Users of OpenGauss can interact with it as much or as little as they want, can easily manage many subagents working in parallel, and can extend / modify / introspect OpenGauss because it is permissively open-source. OpenGauss was developed in close collaboration with maintainers of leading open-source AI tooling for Lean. Read the report and try it out:

mathematics_inc's tweet photo. Today, at the @DARPA expMath kickoff, we launched 𝗢𝗽𝗲𝗻𝗚𝗮𝘂𝘀𝘀, an open source and state of the art autoformalization agent harness for developers and practitioners to accelerate progress at the frontier.

It is stronger, faster, and more cost-efficient than off-the-shelf alternatives. On FormalQualBench, running with a 4-hour timeout, it beats @HarmonicMath's Aristotle agent with no time limit.

Users of OpenGauss can interact with it as much or as little as they want, can easily manage many subagents working in parallel, and can extend / modify / introspect OpenGauss because it is permissively open-source. OpenGauss was developed in close collaboration with maintainers of leading open-source AI tooling for Lean.

Read the report and try it out:

53

2K

389

2K

323K

SenthilKumarN_ retweeted

Leonardo de Moura @Leonard41111588

3 months ago

Prover correctness is becoming a central question as AI enters mathematics and software verification. New essay on why Lean's architecture is designed to survive AI pressure. https://t.co/CXaDTSEWum

6

241

45

110

17K

SenthilKumarN_ retweeted

Sebastian Raschka

@rasbt

3 months ago

I (finally) put together a new LLM Architecture Gallery that collects the architecture figures all in one place! https://t.co/NO7z6XSRHS

rasbt's tweet photo. I (finally) put together a new LLM Architecture Gallery that collects the architecture figures all in one place!
https://t.co/NO7z6XSRHS https://t.co/X41FrK4i94

202

8K

1K

8K

734K

SenthilKumarN_ retweeted

Pushmeet Kohli

@pushmeet

3 months ago

Happy to share new progress in AI for Maths @GoogleDeepMind . In extremal combinatorics, AlphaEvolve has helped establish new lower bounds for FIVE classical Ramsey numbers - a problem so challenging that even Erdős commented on its difficulty. Historically, computationally deriving these bounds required bespoke, human-designed search algorithms. For many of these bounds, the best previous results are at least a decade old. AlphaEvolve changes this by acting as a single meta-algorithm that automatically discovers the search procedures needed to find these new bounds. 📷

59

3K

321

715

468K

SenthilKumarN_ retweeted

Ken Ono

@KenOno691

3 months ago

Great read by Allyn Jackson on how AI is reshaping mathematics. Also thanks for the nod Allyn. Highly recommend checking this one out: https://t.co/O5maECW3fz

KenOno691's tweet photo. Great read by Allyn Jackson on how AI is reshaping mathematics. Also thanks for the nod Allyn. Highly recommend checking this one out: https://t.co/O5maECW3fz https://t.co/blKqeDj2Wt

0

66

15

50

7K

SenthilKumarN_ retweeted

Axiom

@axiommathai

3 months ago

1/ RELEASING AXLE: the Axiom Lean Engine ⚙️ We are serving our core Infrastructure for formal proving at scale. These are the same Lean metaprogramming tools that are behind AxiomProver, powering it to win Putnam and crack open research conjectures. Available to anyone today!

axiommathai's tweet photo. 1/ RELEASING AXLE: the Axiom Lean Engine ⚙️

We are serving our core Infrastructure for formal proving at scale.

These are the same Lean metaprogramming tools that are behind AxiomProver, powering it to win Putnam and crack open research conjectures.

Available to anyone today! https://t.co/Ak9WnKE7zZ

11

423

64

214

114K

SenthilKumarN_ retweeted

Leonardo de Moura @Leonard41111588

3 months ago

AI is writing a growing share of the world's software. No one is formally verifying any of it. New essay: "When AI Writes the World's Software, Who Verifies It?" https://t.co/8zjS9FkdA8

41

2K

246

2K

423K

SenthilKumarN_ retweeted

Prof. Anima Anandkumar

@AnimaAnandkumar

4 months ago

We’re excited to release TorchLean which is the first fully verified neural network framework in Lean. The Lean community has largely focused on pure mathematics. TorchLean expands this frontier toward verified neural network software and scientific computing. With the recent release of CSlib, we see this as another step toward a fully verified ML stack. We support features: 1. Executable IEEE-754 floating-point semantics (and extensible alternative FP models) verified tensor abstractions with precise shape/indexing semantics 2. Formally verified autograd system for differentiation of NN programs Proof-checked certification / verification algorithms like CROWN (robustness, bounds, etc.) 3. PyTorch-inspired modeling API with eager-style development + export/lowering to a shared IR for execution and verification Project page: https://t.co/YHpqhRbMQe Paper: [2602.22631] TorchLean: Formalizing Neural Networks in Lean Work done @Robertljg, Jennifer Cruden, Xiangru Zhong, @huan_zhang12 and @AnimaAnandkumar. #MachineLearning #ScientificComputing #Lean

AnimaAnandkumar's tweet photo. We’re excited to release TorchLean which is the first fully verified neural network framework in Lean. The Lean community has largely focused on pure mathematics. TorchLean expands this frontier toward verified neural network software and scientific computing. With the recent release of CSlib, we see this as another step toward a fully verified ML stack.

We support features:
1. Executable IEEE-754 floating-point semantics (and extensible alternative FP models) verified tensor abstractions with precise shape/indexing semantics
2. Formally verified autograd system for differentiation of NN programs Proof-checked certification / verification algorithms like CROWN (robustness, bounds, etc.)
3. PyTorch-inspired modeling API with eager-style development + export/lowering to a shared IR for execution and verification

Project page: https://t.co/YHpqhRbMQe
Paper: [2602.22631] TorchLean: Formalizing Neural Networks in Lean
Work done @Robertljg, Jennifer Cruden, Xiangru Zhong, @huan_zhang12 and @AnimaAnandkumar.

#MachineLearning #ScientificComputing #Lean

26

2K

246

946

149K

SenthilKumarN_ retweeted

Thang Luong

@lmthang

4 months ago

Thrilled to share: #Aletheia, our math research agent, just solved 6/10 notoriously hard FirstProof problems autonomously, the best result in the inaugural challenge! To me, this is even bigger than our historic IMO-gold achievement last year; these problems challenge even top mathematicians. We share our results transparently, see paper and full thoughts in the thread. 👇

lmthang's tweet photo. Thrilled to share: #Aletheia, our math research agent, just solved 6/10 notoriously hard FirstProof problems autonomously, the best result in the inaugural challenge! To me, this is even bigger than our historic IMO-gold achievement last year; these problems challenge even top mathematicians. We share our results transparently, see paper and full thoughts in the thread. 👇

30

924

153

288

162K

SenthilKumarN_ retweeted

alphaXiv

@askalphaxiv

4 months ago

“Learning Without Training” The current problem is that most learning on manifolds pipelines still rely on a brittle two-step recipe: first estimate the manifold, then learn a predictor. So errors and hyperparameters can easily stack up. This paper introduces a paradigm for machine learning that constructs models directly from data using mathematically derived kernels and functional analysis instead of iterative optimization. This means you can often skip training and manifold learning entirely. You can just take your examples, and predict new ones by doing a smart weighted average of nearby points using a carefully designed kernel, kinda like local smoothing with math guarantees. This creates a blueprint for fast and stable learning without backprop, and more like a plug-and-play geometry + linear algebra than train a huge model and pray it converges.

askalphaxiv's tweet photo. “Learning Without Training”

The current problem is that most learning on manifolds pipelines still rely on a brittle two-step recipe: first estimate the manifold, then learn a predictor. So errors and hyperparameters can easily stack up.

This paper introduces a paradigm for machine learning that constructs models directly from data using mathematically derived kernels and functional analysis instead of iterative optimization.

This means you can often skip training and manifold learning entirely. You can just take your examples, and predict new ones by doing a smart weighted average of nearby points using a carefully designed kernel, kinda like local smoothing with math guarantees.

This creates a blueprint for fast and stable learning without backprop, and more like a plug-and-play geometry + linear algebra than train a huge model and pray it converges.

18

743

118

709

49K

SenthilKumarN_ retweeted

Yi Tay

@YiTayML

4 months ago

Introducing Aletheia, a math research agent powered by an advanced version of Gemini Deep Think that produces publishable math research (two papers, one completely automatic and another with human-AI collaboration) and solved multiple open Erdős problems. 😀🔥 Paper link below! 👇

YiTayML's tweet photo. Introducing Aletheia, a math research agent powered by an advanced version of Gemini Deep Think that produces publishable math research (two papers, one completely automatic and another with human-AI collaboration) and solved multiple open Erdős problems. 😀🔥

Paper link below! 👇

29

906

113

446

92K

SenthilKumarN_ retweeted

alphaXiv

@askalphaxiv

4 months ago

"First Proof" A team of researchers proposes a way to test if AI can actually do NEW math by releasing 10 freshly-solved and never public research questions, with answers temporarily encrypted. This let's the community able to measure the genuine performance of LLMs on proof-generation, before their solutions drop. Questions include: - stochastic analysis - p-adic representation theory - algebraic combinatorics - spectral graph theory - equivariant algebraic topology - lattices in Lie groups/topology - symplectic geometry - tensor algebraic relations - numerical linear algebra

askalphaxiv's tweet photo. "First Proof"

A team of researchers proposes a way to test if AI can actually do NEW math by releasing 10 freshly-solved and never public research questions, with answers temporarily encrypted.

This let's the community able to measure the genuine performance of LLMs on proof-generation, before their solutions drop.

Questions include:
- stochastic analysis
- p-adic representation theory
- algebraic combinatorics
- spectral graph theory
- equivariant algebraic topology
- lattices in Lie groups/topology
- symplectic geometry
- tensor algebraic relations
- numerical linear algebra

42

839

148

433

76K

SenthilKumarN_ retweeted

Dawei Zhu

@dwzhu128

4 months ago

[1/n] Super excited to introduce PaperBanana 🍌! (PKU x Google Cloud AI) As AI researchers, we often spend way too much time crafting diagrams and plots instead of focusing on the ideas 🤯. To rescue us from this burden, we built an Agentic Framework to auto-generate NeurIPS-quality paper illustrations! 📄 Paper: https://t.co/2NbQeEhzMv 🌐 Page: https://t.co/05dKkjVs7f Key Features: 🌟 Human-like Workflow: Retrieve 🔍 -> Plan 📝 -> Style 🎨 -> Render 🖼️ -> Critique 🔄. This ensures both academic fidelity and aesthetics. 🌟 Versatile: Supports both illustrative diagrams and statistical plots. 🌟 Polishing: Also effective for polishing existing human-drawn diagrams. Here are some example diagrams and plots generated by our PaperBanana:

dwzhu128's tweet photo. [1/n]

Super excited to introduce PaperBanana 🍌! (PKU x Google Cloud AI)

As AI researchers, we often spend way too much time crafting diagrams and plots instead of focusing on the ideas 🤯. To rescue us from this burden, we built an Agentic Framework to auto-generate NeurIPS-quality paper illustrations!

📄 Paper: https://t.co/2NbQeEhzMv
🌐 Page: https://t.co/05dKkjVs7f

Key Features:
🌟 Human-like Workflow: Retrieve 🔍 -> Plan 📝 -> Style 🎨 -> Render 🖼️ -> Critique 🔄. This ensures both academic fidelity and aesthetics.
🌟 Versatile: Supports both illustrative diagrams and statistical plots.
🌟 Polishing: Also effective for polishing existing human-drawn diagrams.

Here are some example diagrams and plots generated by our PaperBanana:

66

2K

409

2K

262K

SenthilKumarN_ retweeted

Quoc Le

@quocleix

4 months ago

Excited to share our latest work: "Semi-Autonomous Mathematics Discovery with Gemini." We used Gemini to systematically evaluate 700 "open" conjectures in the Erdős Problems database. The result? We addressed 13 problems marked as open—finding 5 novel autonomous solutions and identifying 8 existing solutions missed by previous literature. Read the full case study here: https://t.co/y4WhkP4ETO

quocleix's tweet photo. Excited to share our latest work: "Semi-Autonomous Mathematics Discovery with Gemini." We used Gemini to systematically evaluate 700 "open" conjectures in the Erdős Problems database.

The result? We addressed 13 problems marked as open—finding 5 novel autonomous solutions and identifying 8 existing solutions missed by previous literature.

Read the full case study here: https://t.co/y4WhkP4ETO

45

1K

203

482

247K

Senthil Kumar

@SenthilKumarN_

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users