Semon Rezchikov @eigenstate - Twitter Profile

Pinned Tweet

4 months ago

For the record I think that progress in AI-for-math will completely change the way math research is done over the next few years. I am not at all a skeptic. I just want a) honesty b) for people to understand what math research *really is*.

3

64

4

9

9K

Semon Rezchikov @eigenstate

7 days ago

@SebastienBubeck @AlexKontorovich The important question scientifically is whether once we have this unit-distance-solving ChatGPT whether it also solves some huge fraction of all open problems mathematicians care about!

0

492

eigenstate retweeted

American Bird Conservancy @ABCbirds

4 months ago

🥇 We aren't saying Alysa Liu "Stole the Look" from the White-crowned Sparrow, but we aren't not saying it either. Congrats on an epic win! Learn how to help birds on and off the ice on our blog: https://t.co/Mf2aH0C7jO

ABCbirds's tweet photo. 🥇 We aren't saying Alysa Liu "Stole the Look" from the White-crowned Sparrow, but we aren't not saying it either. Congrats on an epic win!

Learn how to help birds on and off the ice on our blog:
https://t.co/Mf2aH0C7jO https://t.co/ClpCwMe5Nk

44

12K

825

461

590K

Semon Rezchikov @eigenstate

4 months ago

@littmath @jasondeanlee Then again, the unofficial (but widely distributed) pronouncements of folks at various labs were also too early. (I have the god-given right to complain about *everyone*!)

1

7

0

324

Who to follow

guille

@angeris

chief (mad) scientist @baincapcrypto alt: @tarunchitra

Vidit Nanda

@viditnanda

prof @oxunimaths & fellow @pembrokeoxford.

Rahul G. Krishnan

@rahulgk

Assistant Professor @UofTCompSci @LMP_UofT & @VectorInst Prev @MSRNE @MITEECS Teaching neural networks causality, physics, medicine and biology.

Semon Rezchikov @eigenstate

4 months ago

Key properties: a) output only depends on x + maybe rng b) no further inputs x’ are provided to f by A after x is revealed by B.

1

0

255

Semon Rezchikov @eigenstate

4 months ago

What is the status of verified secret computation? Specifically, suppose company A wants to claim that they have a fixed secret function f (some massive tool-using LLM ensemble) which at t_0 will be evaluated on value x submitted by unrelated org B. How A prove this to B? (1/2)

2

1

0

1

336

Semon Rezchikov @eigenstate

4 months ago

Given that there is no formal evaluation of the 1st proof challenge for round 1, it does not seem fair to interpret initial results in any way whatesoever. AI hype notwithstanding. Let's do it properly for round 2.

Harvard Department of Mathematics @HarvardMath

4 months ago

"The verdict, it seems, is in: artificial intelligence is not about to replace mathematicians. That is the immediate takeaway from the “First Proof” challenge—perhaps the most robust test yet of the ability of LLMs to perform mathematical research." https://t.co/fiq6HXyYj1

20

208

54

102

108K

1

22

0

2

2K

Semon Rezchikov @eigenstate

4 months ago

@dieworkwear You should do a Cedric villani critique!!!

0

1

0

529

Semon Rezchikov @eigenstate

4 months ago

@ben_golub As I understand it, the organizers are not treating the "first release" as a formal benchmark that they will be writing official evaluations for -- this will happen with subsequent releases. You should probably wait for the First Proof org to formally comment.

0

6

0

2K

Semon Rezchikov @eigenstate

4 months ago

@MysteryHacker1 I'm not at all a relevant expert, and obviously I can't make any sense of your links. I'm sure that you can find relevant experts and explain your dramatically improved argument in a conventional manner and they would be appreciative!

0

1

0

25

Semon Rezchikov @eigenstate

4 months ago

Actually benchmark idea : is it easy to redo the computer part of the 4 color theorem argument with AI assistance now? :D and write up a nice streamlined summary of the strategy that undergrads can understand?

Bojan Tunguz

@tunguz

4 months ago

I’ll take some time to think through the implications of this work, but my first impression is that it’s a Quantum Field Theory equivalent of the four color theorem - computer assisted proof, rather than computer generated. And definitely not profound new physics from scratch.

15

125

3

20

23K

5

9

2

5

3K

eigenstate retweeted

Semon Rezchikov @eigenstate

4 months ago

@boazbaraktcs I totally agree that users will be constantly using models to prove research level lemmas by Feb 2027. (Would be idiotic if not.) Your earlier claim reads like “in 6 months math centaurs are ~ over”, very different!

1

10

1

1K

Semon Rezchikov @eigenstate

4 months ago

This did not happen (The OAI work is super interesting, I don’t know why I have to caveat this)

Jason Lee

@jasondeanlee

4 months ago

Must be at 8/10 or 9/10 for the first proof. Some prodding from the boss to deliver on the last mile

1

31

1

11

16K

0

9

0

3K

Semon Rezchikov @eigenstate

4 months ago

@boazbaraktcs I totally agree that users will be constantly using models to prove research level lemmas by Feb 2027. (Would be idiotic if not.) Your earlier claim reads like “in 6 months math centaurs are ~ over”, very different!

1

10

1

1K

Semon Rezchikov @eigenstate

4 months ago

@AcerFur Including the chat instances is important! I really do think with a mathy human thinking about them while talking to a model quite a lot of them are solvable straightforwardly — these are lemmas, after all.

0

57

Semon Rezchikov @eigenstate

4 months ago

@deevrod Also to be fair, having the AI solve new problems is obviously super cool. Why not do it!

1

0

155

Semon Rezchikov @eigenstate

4 months ago

For the record I think that progress in AI-for-math will completely change the way math research is done over the next few years. I am not at all a skeptic. I just want a) honesty b) for people to understand what math research *really is*.

3

64

4

9

9K

Semon Rezchikov @eigenstate

4 months ago

@deevrod Because of strong financial and reputational incentives!

1

0

350

Semon Rezchikov @eigenstate

4 months ago

At this point I use the models on many days; they were net negative back in August and are solidly net positive now.

0

14

1

0

1K

Semon Rezchikov @eigenstate

4 months ago

Cheating here means claiming 2 when really something like 1 happened. It's not that they're not both meaningful, but they show pretty different behaviors and that's important. Oneshotting these problems totally autonomously (use many agents sure) is meaningfully different.

0

7

0

449

Semon Rezchikov @eigenstate

4 months ago

Re: #1stproof I just want to point out I think that 1) a math human with moderately relevant background can solve many of these problems by interactively talking to a model and then giving it hints, 2) this is extremely different from 1-shot performance, 3) it's tempting to cheat

1

8

1

0

1K

Semon Rezchikov

@eigenstate

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users