Weinan Sun

@sunw37

NeuroAI @AsteraInstitute Prev. Asst Prof @Cornell

Joined February 2016

784 Following

795 Followers

895 Posts

Pinned Tweet

Weinan Sun @sunw37

over 1 year ago

1/12 How do animals build an internal map of the world? In our new paper, we tracked thousands of neurons in mouse CA1 over days/weeks as they learned a VR navigation task. @nspruston @HHMIJanelia, w/ co-1st author @JohanWinn Video summary: https://t.co/mMvsT8TVc9 Paper: https://t.co/35k5tWdvOe

216

116

48K

Weinan Sun @sunw37

2 months ago

@sofroniewn @AnthropicAI super cool!

116

sunw37 retweeted

Doris Tsao

@doristsao

2 months ago

We show face patches implement the following code through recurrent dynamics: Detect face If (face found) Discriminate face else Continue to detect face IMHO, our paper conclusively resolves a debate that has raged since I was a graduate student, about whether face patches are specialized for processing faces or not. It turns out domain-general folks were right early on, domain-specific folks were right later in the response. So proud of @Yuelin_Shi and the entire team!

148

22K

Weinan Sun @sunw37

3 months ago

@karpathy I think you will enjoy reading about the hippocampal-neocortical system in neuroscience, a lot of continual learning algorithm hints can be found there. Agree more unconventional research is needed.

Andrej Karpathy

@karpathy

3 months ago

There was a nice time where researchers talked about various ideas quite openly on twitter. (before they disappeared into the gold mines :)). My guess is that you can get quite far even in the current paradigm by introducing a number of memory ops as "tools" and throwing them into the mix in RL. E.g. current compaction and memory implementations are crappy, first, early examples that were somewhat bolted on, but both can be fairly easily generalized and made part of the optimization as just another tool during RL. That said neither of these is fully satisfying because clearly people are capable of some weight-based updates (my personal suspicion - mostly during sleep). So there should be even more room for more exotic approaches for long-term memory that do change the weights, but exactly - the details are not obvious. This is a lot more exciting, but also more into the realm of research outside of the established prod stack.

273

292

611K

636

Who to follow

Andrew Saxe

@SaxeLab

Prof at @GatsbyUCL and @SWC_Neuro, trying to figure out how we learn. Bluesky: @SaxeLab Mastodon: @[email protected]

Jonathan Pillow

@jpillowtime

Computational neuroscientist @ Princeton. Into brains, math, machine learning. Brain is a computer 4eva.

Kenneth D Harris

@kennethd_harris

Neuroscientist at University College London

Weinan Sun @sunw37

3 months ago

Great opportunity in an amazing team. Highly recommended! @E11BIO @JohanWinn

Johan Winnubst

@JohanWinn

3 months ago

🚀 Looking to hire a light-sheet microscopy scientist to help us design, build, and optimize the next-gen platform that will power mammalian connectomics at scale. If you’re excited to join a fast-paced team doing high-impact science, take a look: https://t.co/wRN2INtnwd🔬⚙️

661

369

Weinan Sun @sunw37

4 months ago

Everything is open-source and designed as building blocks, not just for our lab. We're relicensing under Apache 2.0 in the coming weeks to make adoption even easier. We built this for 2-photon imaging + VR behavior in mice, but the patterns should transfer to any hardware-intensive lab. If your science hits a wall at the hardware layer, this is for you. Paper: https://t.co/t4ken72VD5 Code: https://t.co/4SXZ8Vlr1p

102

Weinan Sun @sunw37

4 months ago

Closed-loop AI scientists will need to talk to lab hardware. Towards that goal, we released Ataraxis - an open-source framework that gives AI coding assistants direct access to physical instruments. Built by our talented Ph.D. student @InkarosEng @CornellNBB @Cornell. Paper: https://t.co/t4ken72VD5 Code: https://t.co/4SXZ8Vlr1p Demos videos: 🎥 Pre-session validation: https://t.co/SnbG8VH5wk 🎥 Hardware troubleshooting: https://t.co/1cAc97Fyq0 🎥 AI-guided hardware integration: https://t.co/ip6tAtvOle

198

Weinan Sun @sunw37

4 months ago

Key design choice: AI helps at configuration time only. During actual experiments, everything runs deterministically with no AI in the loop. Network goes down? API rate limited? Doesn't matter - your experiment keeps running.

112

sunw37 retweeted

Victor M

@victormustar

about 3 years ago

I just made my own Star Wars clip using AI (text-to-video)!

263

198

453

sunw37 retweeted

Toon Van de Maele @toonvdm

4 months ago

(1/n) How do we generalize knowledge across similar experiences? In our new preprint, we introduce S-HAI: a hierarchical active inference model that captures "schemas" used by humans and animals to generalize task abstractions. https://t.co/hwv6y4GBbA 🧵

120

sunw37 retweeted

Alice Ting

@aliceyting

5 months ago

Janelia is hiring Group Leaders! Work alongside superstars like Luke Lavis, @JiefuBiol, @mengwang939, @JLS_Lab, @ERSchreiter. If you have ideas that are perhaps too ambitious or crazy for a traditional academic setting, that is exactly who they are looking for. Tool-builders at all career stages with original, transformative ideas for experimental approaches in imaging, molecular engineering, protein chemistry, mass spectrometry, and methods that don’t yet exist: Apply by Feb. 3, 2026: https://t.co/WcEOWPoxw5

226

32K

sunw37 retweeted

Niels Rogge @NielsRogge

5 months ago

"Before Transformers, RNNs were the thing. These were a big breakthrough. Suddenly, everyone started to work on improving RNNs. But the results were always these slight modifications on the same architecture, like putting the gate in a different spot, with improvements to 1.26, 1.25 bits per character on language modeling." "After the Transformer, when we applied very deep decoder-only Transformers to the same task, we immediately got 1.1 bits per character. So all that research on RNNs suddenly seemed a waste of time". "We're currently in the same situation where a lot of papers are taking the same architecture (Transformer) and making these endless tweaks, in a local minimum, and we might be wasting time in exactly the same way." - Llion Jones, co-author of the Transformer on @MLStreetTalk

NielsRogge's tweet photo. "Before Transformers, RNNs were the thing. These were a big breakthrough. Suddenly, everyone started to work on improving RNNs. But the results were always these slight modifications on the same architecture, like putting the gate in a different spot, with improvements to 1.26, 1.25 bits per character on language modeling."

"After the Transformer, when we applied very deep decoder-only Transformers to the same task, we immediately got 1.1 bits per character. So all that research on RNNs suddenly seemed a waste of time".

"We're currently in the same situation where a lot of papers are taking the same architecture (Transformer) and making these endless tweaks, in a local minimum, and we might be wasting time in exactly the same way."

- Llion Jones, co-author of the Transformer on @MLStreetTalk

177

650

190K

sunw37 retweeted

Dileep George

@dileeplearning

6 months ago

If no one builds it, you're never born. ..not building AGI is a risky thing. Wilbur Wright, inventor of airplanes, died in 1912 at the age of 45 from typhoid fever because antibiotics didn’t exist then. Just imagine how our world would be without the medical and technological advancements of the past century! Actually, you wouldn’t have to imagine, because you wouldn’t exist! Inventing technology and advancing science is how we overcame our challenges and managed to support 8 billion human souls on this planet, escaping the Malthusian trap of famines, diseases, and conflicts. Automation of knowledge acquisition and thought is the next step, and the best tool humanity can build. The risk of not building AGI is that we won’t be prepared for the challenges the world throws at us, some of which would be challenges that our own existence creates. A(G)I safety is important, and here are my thoughts about it. 1. Scaling up current techniques is not going to lead to AGI. It will lead to powerful AI systems, but these will be supported by a lot of engineered scaffolding. In these cases, making AI work usefully is almost exactly the same as making AI safe. Since scaling has already proven to be useful, we are naturally on the path to exploiting it to the maximum, and we should. 2. We will eventually figure out how to build and scale AI that uses principles of human intelligence. These systems will learn causal structure and reliable world models that can be used for counterfactual thinking. This will lead to much more capable AI systems and AGI. But for these kinds of systems, increasing capability can also come with increasing controllability. Where powerful AI and AGI is going to help us Earthquakes, wildfires, hurricanes, floods: Despite all the technological advances, we are still at the mercy of nature when it comes to these disasters. Where are the army of robots helping to dig out people from collapsed buildings? Where are the ones to manage fires and help people? Having AGI means we will have robots that will help us in these cases to save lives and to recover faster. Health: Antibiotic resistance, pandemics, …, we don’t know what challenges we will face in the future and it would be great to have powerful tools. In general, having much better understanding of how our bodies and minds work, and curing of diseases. Flora and Fauna: Instead of conforming to the requirements of dumb machines, we might finally be able to do more organic multi-crop agriculture, reduce the amount of pesticides we use, and abolish factory farming. Intelligent machines will free us from the economic necessity of these. Climate change, Energy, Materials, Education, Transportation, Space …. examples like these abound in each of those areas. Things that we accomplished crudely with dumb machinery will be done with more finesse with intelligent machines, and that will be important for humanity to thrive at scale. Balancing the risks… Of course the title is a play on the Yudkowsky and Soares book “If anyone builds it, everyone dies”. While I disagree with many things that the book asserts, their work has brought attention to the important problem of AI safety. Smart people working on AI safety is a good thing. It is important to continue that work, even if the specific x-risk scenarios in the book can be taken apart. In the midst of all the talk about the risks of AGI it is important to realize that not building AGI has risks as well.

dileeplearning's tweet photo. If no one builds it, you're never born.
..not building AGI is a risky thing.

Wilbur Wright, inventor of airplanes, died in 1912 at the age of 45 from typhoid fever because antibiotics didn’t exist then. Just imagine how our world would be without the medical and technological advancements of the past century! Actually, you wouldn’t have to imagine, because you wouldn’t exist!

Inventing technology and advancing science is how we overcame our challenges and managed to support 8 billion human souls on this planet, escaping the Malthusian trap of famines, diseases, and conflicts.

Automation of knowledge acquisition and thought is the next step, and the best tool humanity can build. The risk of not building AGI is that we won’t be prepared for the challenges the world throws at us, some of which would be challenges that our own existence creates.

A(G)I safety is important, and here are my thoughts about it.

1. Scaling up current techniques is not going to lead to AGI. It will lead to powerful AI systems, but these will be supported by a lot of engineered scaffolding. In these cases, making AI work usefully is almost exactly the same as making AI safe. Since scaling has already proven to be useful, we are naturally on the path to exploiting it to the maximum, and we should.

2. We will eventually figure out how to build and scale AI that uses principles of human intelligence. These systems will learn causal structure and reliable world models that can be used for counterfactual thinking. This will lead to much more capable AI systems and AGI. But for these kinds of systems, increasing capability can also come with increasing controllability.

Where powerful AI and AGI is going to help us

Earthquakes, wildfires, hurricanes, floods: Despite all the technological advances, we are still at the mercy of nature when it comes to these disasters. Where are the army of robots helping to dig out people from collapsed buildings? Where are the ones to manage fires and help people? Having AGI means we will have robots that will help us in these cases to save lives and to recover faster.

Health: Antibiotic resistance, pandemics, …, we don’t know what challenges we will face in the future and it would be great to have powerful tools. In general, having much better understanding of how our bodies and minds work, and curing of diseases.

Flora and Fauna: Instead of conforming to the requirements of dumb machines, we might finally be able to do more organic multi-crop agriculture, reduce the amount of pesticides we use, and abolish factory farming. Intelligent machines will free us from the economic necessity of these.

Climate change, Energy, Materials, Education, Transportation, Space …. examples like these abound in each of those areas. Things that we accomplished crudely with dumb machinery will be done with more finesse with intelligent machines, and that will be important for humanity to thrive at scale.

Balancing the risks…

Of course the title is a play on the Yudkowsky and Soares book “If anyone builds it, everyone dies”. While I disagree with many things that the book asserts, their work has brought attention to the important problem of AI safety. Smart people working on AI safety is a good thing. It is important to continue that work, even if the specific x-risk scenarios in the book can be taken apart. In the midst of all the talk about the risks of AGI it is important to realize that not building AGI has risks as well.

Weinan Sun @sunw37

6 months ago

You should interview @dileeplearning. Can’t think of anyone else who can connect between neuroscience and AI as well as he can. From co-founding Numenta and Vicarious to getting acquired by Alphabet, he had an interesting journey building neuroscience inspired AI. Our recent Nature paper was heavily influenced by his work.

179

sunw37 retweeted

Machine Learning Street Talk

@MLStreetTalk

6 months ago

Wouldn't it be great if neural networks knew a spiral... as a spiral? 🍥

111

11K

Weinan Sun @sunw37

7 months ago

hi! based on my testing on various file formats with different sizes, Kosmos could analyze all kinds of files (see below). Claude 4.5 could deal with some of them, but often limited by upload size limit, and for some files, the reasoning pushes to max token limit. This reading ability could be a major unlock for science as some of these data were buried for more than 10 yrs with software license expired. Analysis is not perfect yet but feels this is a major step to the right direction.

sunw37's tweet photo. hi! based on my testing on various file formats with different sizes, Kosmos could analyze all kinds of files (see below). Claude 4.5 could deal with some of them, but often limited by upload size limit, and for some files, the reasoning pushes to max token limit. This reading ability could be a major unlock for science as some of these data were buried for more than 10 yrs with software license expired. Analysis is not perfect yet but feels this is a major step to the right direction.

Weinan Sun @sunw37

7 months ago

@SGRodriques wow, congrats Sam and your team on this milestone! super impressive. just dumped an old whole-cell recording file into it, and got this beautiful summary: