Yang Liu

@yangpliu

CMU CSD. Prev: IAS / Stanford / MIT. Interested in algorithms, probability, combinatorics, etc.

Pittsburgh, PA

Joined August 2019

88 Following

985 Followers

40 Posts

Yang Liu

@yangpliu

9 days ago

@sushnt @rpeng233 Very nice! This feels like the "right proof" for the problem, and I fully expect that many more results of a similar style will be found by current and future models.

136

Yang Liu

@yangpliu

25 days ago

@david_yang__ haha, I remember that. Awesome that you're at Anthropic now!

Yang Liu

@yangpliu

25 days ago

@Tulsipuramkr1k 100% true

212

Yang Liu

@yangpliu

25 days ago

guess I’m ahead of the curve as usual

Franklyn Wang

@frank_liquid

26 days ago

the world is not ready

757

111

141K

200

29K

Who to follow

Anupam Gupta

@anupamg

Professor, Computer Science, New York University. Also at: @[email protected] @anupamg.bsky.social

Richard Peng

@rpeng233

Associate Professor @SCSatCMU, works on efficient graph/sparse matrix algorithms. Adjunct Professor @UWCheritonCS.

Zihan Tan

@zihantan09

Assistant Professor @UMNComputerSci. Research interest: theoretical computer science.

Yang Liu

@yangpliu

4 months ago

@sushnt Yes, good point. I realized later that each subset being light is not equivalent to the sum being light (even if they're vertex disjoint).

314

Yang Liu

@yangpliu

4 months ago

1/ Technical thread on #1stProof Problem 6: finding “spectrally light” vertex subsets in a graph, and how its solution fits into the landscape of spectral sparsification + restricted invertibility. Original thread: https://t.co/c9Z9RH2Ont

Yang Liu

@yangpliu

4 months ago

My thoughts on #1stProof Problem 6 (closely related to areas I've worked in): OpenAI’s solution is essentially correct, and the difficulty feels consistent with AI capabilities over the past several months. More detail in the thread.

376

82K

116

49K

Yang Liu

@yangpliu

4 months ago

I don't personally want to rate the proof vs. human researchers. More concretely, I feel that current models so far are quite strong at proving self-contained statements whose solutions we believe are based on ideas in the literature (or are sufficiently short). To me, this framework captures the advances we've seen on both the IMO/competition side and the math research side.

391

Yang Liu

@yangpliu

4 months ago

Just for intuition: the official proof informally says that as long as a set does not have too much "mass" on its neighboring edges, then it's possible to add a new vertex to it. So if I have many sets, then by an averaging argument, one of them must have less mass than average, so we can safely add a vertex to it.

Yang Liu

@yangpliu

4 months ago

@TFWNicholson Yes, it should be the pseudoinverse. If the graph is connected (for these problems, this can be assumed without loss of generality), the only vector in the nullspace is (1, 1, ..., 1) so it's easy to get a handle on the pseudoinverse.

Yang Liu

@yangpliu

4 months ago

Technical thread here: https://t.co/ozJN1psG7b

Yang Liu

@yangpliu

4 months ago

116

49K

10K

Yang Liu

@yangpliu

4 months ago

376

82K

Yang Liu

@yangpliu

4 months ago

Still, I’m genuinely impressed by how far AI-for-math has come in the past few years, and I’m excited to see what’s next. If people want, I’m happy to write up more about the problem, the solution, and how it fits in the context of prior results.

119

17K

Yang Liu

@yangpliu

4 months ago

End/ Overall Comparison: In my opinion, the problem and solution cleanly fall within the scope of previous methods, but one does have to nontrivially adapt the method to handle the subtlety described above. In terms of comparing the solutions, both give equally clean fixes to the issue. Worth noting that the OpenAI solution proves something slightly stronger than is asked by the problem statement: if proves that a constant fraction of the vertices can be partitioned into O(1/ε) groups, so that the induced Laplacian on every group is spectrally at most ε*L, while the problem only asks for a single group of size ≥ ε*n. Finally, I want to mention an open problem in this direction. You can ask whether there are other natural objectives that can be sparsified down to size O(n/ε^2). Concretely, consider the function f(x) = |Ax|_1 where A is a m x n matrix. Is there a matrix A’ which is n’ x n, whose rows are reweightings of rows of A, and 0.5 * |Ax|_1 ≤ |A’ x|_1 ≤ 2 |Ax|_1, and n’ ≤ O(n)? The best known bound on n’ is O(n log n) by random sampling + chaining techniques.

Yang Liu

@yangpliu

4 months ago

10/ OpenAI solution: it maintains r ≈ 1/ε different sets S_1, …, S_r, whose total size eventually is a constant fraction of n. The solution proves that you can always find one of the sets to add a new vertex to without increasing the potential: this makes sense, because on average some set should be “light’’. In the screenshots below, M_t is the sum of Laplacians of each color class, and as you can see, the solution maintains a barrier function over M_t, and does not need to maintain any property of the leverage scores of S.

$yangpliu's tweet photo. 10/ OpenAI solution: it maintains r ≈ 1/ε different sets S_1, …, S_r, whose total size eventually is a constant fraction of n. The solution proves that you can always find one of the sets to add a new vertex to without increasing the potential: this makes sense, because on average some set should be “light’’. In the screenshots below, M_t is the sum of Laplacians of each color class, and as you can see, the solution maintains a barrier function over M_t, and does not need to maintain any property of the leverage scores of S.$

Yang Liu

@yangpliu

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users