Libin Zhu

@BusyZhu

Postdoc at UW

Seattle, WA

Joined August 2018

108 Following

24 Followers

2 Posts

BusyZhu retweeted

Neil Mallinar @nmallinar

almost 2 years ago

Grokking modular arithmetic is widely studied for the seemingly unique emergent abilities of neural networks. Instead, we find that iteratively solving a kernel machine and estimating the Average Gradient Outer Product (AGOP) recovers this phenomenon identically:

nmallinar's tweet photo. Grokking modular arithmetic is widely studied for the seemingly unique emergent abilities of neural networks.

Instead, we find that iteratively solving a kernel machine and estimating the Average Gradient Outer Product (AGOP) recovers this phenomenon identically: https://t.co/Jib6GjN3TI

18K

BusyZhu retweeted

Stat.ML Papers @StatMLPapers

over 6 years ago

Toward a theory of optimization for over-parameterized systems of non-linear equations: the lessons of deep learning. (arXiv:2003.00307v1 [cs.LG]) https://t.co/Xq8xXulZSg

Libin Zhu

@BusyZhu

Last Seen Users on Sotwe

Trends for you

Most Popular Users