Minghao Yan @Minghao__Yan - Twitter Profile

12 days ago

🚀🚀🚀

Omenn-Darling Bioengineering Institute @omenndarlingbio

12 days ago

Scientific American has named ODBI’s Kaiyi Jiang to its inaugural list of Young American Scientists. Congrats, @idmjky! Jiang will join ODBI next month. Read more on the ODBI site: https://t.co/nziKegQQqm

omenndarlingbio's tweet photo. Scientific American has named ODBI’s Kaiyi Jiang to its inaugural list of Young American Scientists. Congrats, @idmjky!

Jiang will join ODBI next month. Read more on the ODBI site: https://t.co/nziKegQQqm https://t.co/EthZCPEnQV

0

8

1

0

1K

0

1

0

381

Minghao Yan @Minghao__Yan

16 days ago

@awsTO USMNT plays like how Canada sees itself in the mirror 🤣

0

1

0

52

Minghao__Yan retweeted

Hongyi Wang

@HongyiWang10

26 days ago

1/8) Excited to share PR²: Predictive Routing Replay for MoE-Based LLM RL! 🎉 This is also the first paper from our research group at @RutgersCS @RutgersU. While MoE LLMs scale remarkably well, RL training exposes a hidden source of instability: routing. Paper: https://t.co/uekygKHSON A special shout-out to my talented PhD student, @DaizeDongCS, for leading this project and driving many of its key ideas and experiments.

HongyiWang10's tweet photo. 1/8) Excited to share PR²: Predictive Routing Replay for MoE-Based LLM RL! 🎉

This is also the first paper from our research group at @RutgersCS @RutgersU.

While MoE LLMs scale remarkably well, RL training exposes a hidden source of instability: routing.

Paper: https://t.co/uekygKHSON

A special shout-out to my talented PhD student, @DaizeDongCS, for leading this project and driving many of its key ideas and experiments.

3

31

12

13

4K

Minghao Yan @Minghao__Yan

about 2 months ago

@ChengleiSi Hard agree! If NanoGPT speedrun and Karpathy’s auto research has taught us anything, it’s that we still need major breakthrough(s) for agents to discover paradigm shift ideas.

0

2

0

199

Who to follow

Dacheng Li

@DachengLi177

大风起兮云飞扬 | PhD @BerkeleySky, @berkeley_ai @lmsysorg | Prev: @Nvidia @SCSatCMU

Boxin Wang

@wbx_life

Sr. Research Scientist @NVIDIA | UIUC Ph.D @IllinoisCS | LLM Post-training | Ex-intern at MSR @MSFTResearch, Google Research @googleai

Yuke Wang

@YukeWang1

Assistant Professor at Rice CS | CS Ph.D. at UCSB | Deep Learning System | ex- Amazon, Microsoft Research, NVIDIA Research | NVIDIA Graduate Fellowship’22.

Minghao Yan @Minghao__Yan

4 months ago

Spot on! One of my biggest takeaways after working on AI scientists for a while.

Andrew Gordon Wilson

@andrewgwils

4 months ago

Being good at next word prediction is the opposite of what we want for creativity, for scientific breakthroughs.

18

141

13

30

30K

0

19

2

12

6K

Minghao Yan @Minghao__Yan

4 months ago

@karpathy We worked on building an evolutionary agent for the NanoGPT benchmark back in October and shared our findings in the paper: https://t.co/C8CxWBIGKH Similarly, we also observed that the agent is really good at tuning hyperparameters, designing context / lr / decay schedules!

0

1

0

145

Minghao Yan @Minghao__Yan

4 months ago

@_ScottCondron @karpathy Shameless self-plug here: we’ve worked on the very task of self-evolution on the NanoGPT benchmark in our paper! https://t.co/y0jMrWNyni

Minghao Yan @Minghao__Yan

5 months ago

We even deployed PACEvolve on the Modded NanoGPT challenge. Despite the benchmark being heavily optimized by the community, PACEvolve discovered further gains in data loading, network initialization, and tuned better hyperparameters.

1

0

1

1K

0

3

0

221

Minghao Yan @Minghao__Yan

4 months ago

Code drop here:

Minghao Yan @Minghao__Yan

4 months ago

Our code is finally out at https://t.co/2VxiNAXw4f. Run it with your favorite task and see if you can push the scientific frontier!

0

4

0

462

0

231

Minghao Yan @Minghao__Yan

5 months ago

🚀 Thrilled to introduce PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution. We show how to push LLM self-evolution beyond short, unstable improvements and into consistent, long-horizon gains. 🧵👇

Minghao__Yan's tweet photo. 🚀 Thrilled to introduce PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution.
We show how to push LLM self-evolution beyond short, unstable improvements and into consistent, long-horizon gains. 🧵👇 https://t.co/MBBE8j1LnR

1

27

8

10

3K

Minghao Yan @Minghao__Yan

5 months ago

This work was done during my internship at Google and would not have been possible without my mentors and collaborators across Google and DeepMind. Kudos to everyone involved! Paper: https://t.co/yYL82IjYFx Code drop coming soon, stay tuned!

1

3

0

246

Minghao Yan @Minghao__Yan

4 months ago

@karpathy Yes, we’ve been working on building a RSI agent on NanoGPT speedup since last Oct! Check it out in our paper: https://t.co/y0jMrWNyni

Minghao Yan @Minghao__Yan

5 months ago

We even deployed PACEvolve on the Modded NanoGPT challenge. Despite the benchmark being heavily optimized by the community, PACEvolve discovered further gains in data loading, network initialization, and tuned better hyperparameters.

1

0

1

1K

0

2

0

1

637

Minghao Yan @Minghao__Yan

4 months ago

@eliebakouch agree! we tried it in our paper and it has been an awesome experience learning about both the capabilities and the limitations of current frontier LLMs.

Minghao Yan @Minghao__Yan

5 months ago

We even deployed PACEvolve on the Modded NanoGPT challenge. Despite the benchmark being heavily optimized by the community, PACEvolve discovered further gains in data loading, network initialization, and tuned better hyperparameters.

1

0

1

1K

0

1

147

Minghao Yan @Minghao__Yan

4 months ago

Our code is finally out at https://t.co/2VxiNAXw4f. Run it with your favorite task and see if you can push the scientific frontier!

Minghao Yan @Minghao__Yan

5 months ago

🚀 Thrilled to introduce PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution. We show how to push LLM self-evolution beyond short, unstable improvements and into consistent, long-horizon gains. 🧵👇

1

27

8

10

3K

0

4

0

462

Minghao__Yan retweeted

Henry Shevlin

@dioscuri

6 months ago

All I want for Christmas is a new Matt Lakeman blogpost

0

8

1

2K

Minghao Yan

@Minghao__Yan

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users