Mark J. Webb

@markjwebb

Climate scientist working on cloud feedbacks, climate sensitivity, hydrological sensitivity and machine learning.

Exeter, U.K.

Joined March 2009

2.2K Following

755 Followers

562 Posts

markjwebb retweeted

Google DeepMind @GoogleDeepMind

about 1 year ago

We also applied AlphaEvolve to over 50 open problems in analysis ✍️, geometry 📐, combinatorics ➕ and number theory 🔂, including the kissing number problem. 🔵 In 75% of cases, it rediscovered the best solution known so far. 🔵 In 20% of cases, it improved upon the previously best known solutions, thus yielding new discoveries.

740

106

145

96K

markjwebb retweeted

Google DeepMind @GoogleDeepMind

about 1 year ago

Human generated data has fueled incredible AI progress, but what comes next? 📈 On the latest episode of our podcast, @FryRsquared and David Silver, VP of Reinforcement Learning, talk about how we could move from the era of relying on human data to one where AI could learn for itself. Watch now → 00:00 Introduction 01:50 Era of experience 03:45 AlphaZero 10:19 Move 37 15:20 Reinforcement learning and human feedback 24:30 AlphaProof 29:50 Math Olympiads 35:00 Experience based methods 42:56 Hannah's reflections 44:00 Fan Hui joins

265

825

226K

markjwebb retweeted

Richard Sutton

@RichardSSutton

over 1 year ago

The original RL algorithms, inspired by natural learning, were online and incremental—they were streaming in the sense that they learned from each increment of experience as it happened, then discarded it, never to be processed again. The streaming algorithms were simple and elegant, but the first big successes of RL in deep learning were not with streaming algorithms. Instead, methods such as DQN chopped the stream of experience into individual transitions, then stored and sampled them in arbitrary batches. Subsequent work followed, extended, and refined the batch approach into asynchronous and offline RL, while the streaming approach languished, unable to produce good results in popular deep learning domains. Until now. Now researchers at the University of Alberta have shown that streaming RL algorithms can work just as well as DQN on Atari and Mujoco tasks (https://t.co/S4D6lSvdxz). How did they do it? Mostly just by getting signal normalization and step-size bounding right for the streaming case—otherwise they use standard streaming algorithms like TD(lambda) and Q(lambda). To me it looks like they were simply the first researchers knowledgeable of streaming RL algorithms to seriously address deep RL without being over-influenced by batch-oriented software and batch-oriented supervised-learning ways of thinking.

225

884

129K

Mark J. Webb @markjwebb

over 1 year ago

@DrMichaelByrne @univofstandrews @EarthSciStA @UStAResearch Congratulations Mike! Fantastic.

Who to follow

Nick Lutsko

@Nick_Lutsko

Associate professor of climate at SIO/UCSD. More at https://t.co/KeZsiQ8Imn and https://t.co/F45Kkpzokc

Kyle Armour

@karmour_uw

Climate dynamics, atmospheric science, and oceanography.

Tim Osborn

@TimOsbornClim

Climate variability and change | Professor of Climate Science & Director of Climatic Research Unit | @ueaenv | UEA | Views expressed here are my own, not UEA's

Mark J. Webb @markjwebb

over 1 year ago

@PauloCeppi @LeverhulmeTrust @ImperialSci @ImperialPhysics @Grantham_IC Fantastic news. Congratulations Paulo!

markjwebb retweeted

Henry Addison @hja22

over 1 year ago

Excited to announce a pre-print of my first paper as part of my PhD at @BristolUni! We’ve created a generative ML emulator of a UK convection-permitting climate model (CPM). It’s able to produce simulations of high-res precipitation at far lower computational cost.

hja22's tweet photo. Excited to announce a pre-print of my first paper as part of my PhD at @BristolUni! We’ve created a generative ML emulator of a UK convection-permitting climate model (CPM). It’s able to produce simulations of high-res precipitation at far lower computational cost. https://t.co/UlMegy5ve5

894

markjwebb retweeted

Andrej Karpathy

@karpathy

almost 2 years ago

Programming is changing so fast... I'm trying VS Code Cursor + Sonnet 3.5 instead of GitHub Copilot again and I think it's now a net win. Just empirically, over the last few days most of my "programming" is now writing English (prompting and then reviewing and editing the generated diffs), and doing a bit of "half-coding" where you write the first chunk of the code you'd like, maybe comment it a bit so the LLM knows what the plan is, and then tab tab tab through completions. Sometimes you get a 100-line diff to your code that nails it, which could have taken 10+ minutes before. I still don't think I got sufficiently used to all the features. It's a bit like learning to code all over again but I basically can't imagine going back to "unassisted" coding at this point, which was the only possibility just ~3 years ago.

514

18K

11K

markjwebb retweeted

Tim Andrews @Tim_AndrewsUK

almost 2 years ago

Feedbacks, Pattern Effects, and Efficacies in a Large Ensemble of HadGEM3-GC3.1-LL Historical Simulations | Harry Mutton et al. #JGR https://t.co/Hvt2761EyU We have a new paper in #JGR that utilizes a large ensemble (50 member) of HadGEM3-GC3.1-LL historical simulations 🧵[1/n]

Tim_AndrewsUK's tweet photo. Feedbacks, Pattern Effects, and Efficacies in a Large Ensemble of HadGEM3-GC3.1-LL Historical Simulations | Harry Mutton et al. #JGR https://t.co/Hvt2761EyU

We have a new paper in #JGR that utilizes a large ensemble (50 member) of HadGEM3-GC3.1-LL historical simulations

🧵[1/n] https://t.co/uw4g3GXNpx

markjwebb retweeted

Nature Geoscience @NatureGeosci

almost 2 years ago

Article: Inhibition of atmospheric convection by dry air intensifies moist heatwaves, and this process may further increase moist heatwaves under climate warming @Duan_Suqin https://t.co/YQirsuBbBC

markjwebb retweeted

Mike Pritchard

@SciPritchard

almost 2 years ago

🎉New preprint on hybrid physics-ML climate modeling is out! 📣 “Sampling Hybrid Climate Simulation at Scale to Reliably Improve Machine Learning Parameterization” https://t.co/Fy3PYqWVZI Led by Jerry Lin @jlin404 @uciess @UCIPhysSci @LeapStc 🧵

markjwebb retweeted

Xiaolong Wang

@xiaolonw

almost 2 years ago

Cannot believe this finally happened! Over the last 1.5 years, we have been developing a new LLM architecture, with linear complexity and expressive hidden states, for long-context modeling. The following plots show our model trained from Books scale better (from 125M to 1.3B) than Mamba and Transformer, and our 1.3B model works better and better with longer context. The idea, Test-Time Training (TTT), is something we have been working on for over 5 years. I still remember when I started as a postdoc, discussing with Alyosha what to work on, he asked me to talk to Yu Sun on TTT. And that is when everything begins. Although this TTT is quite a different thing now. It is a network layer, replacing the hidden state of an RNN with a machine learning model. Instead of using a feature vector to represent a memory, our TTT layer maintains a small neural network to compress the input tokens. As each token comes in sequentially, the compression is done via gradient descent on this token to update the small neural network. This is currently applied to language modeling, but imagine applying it to videos. In the future, when modeling long videos, instead sampling 1 FPS, we can sample the frames densely, and these dense frames would be a burden for Transformer but a blessing for TTT layer. As they are essentially just augmentations in time to train a better network inside TTT. Please check: https://t.co/XNK1mlxpUQ

xiaolonw's tweet photo. Cannot believe this finally happened! Over the last 1.5 years, we have been developing a new LLM architecture, with linear complexity and expressive hidden states, for long-context modeling. The following plots show our model trained from Books scale better (from 125M to 1.3B) than Mamba and Transformer, and our 1.3B model works better and better with longer context.

The idea, Test-Time Training (TTT), is something we have been working on for over 5 years. I still remember when I started as a postdoc, discussing with Alyosha what to work on, he asked me to talk to Yu Sun on TTT. And that is when everything begins.

Although this TTT is quite a different thing now. It is a network layer, replacing the hidden state of an RNN with a machine learning model. Instead of using a feature vector to represent a memory, our TTT layer maintains a small neural network to compress the input tokens. As each token comes in sequentially, the compression is done via gradient descent on this token to update the small neural network.

This is currently applied to language modeling, but imagine applying it to videos. In the future, when modeling long videos, instead sampling 1 FPS, we can sample the frames densely, and these dense frames would be a burden for Transformer but a blessing for TTT layer. As they are essentially just augmentations in time to train a better network inside TTT.

Please check: https://t.co/XNK1mlxpUQ

262

262K

markjwebb retweeted

irina sandu @irinasandu_ec

almost 2 years ago

Really cool results - i never cease to be amazing how #ML models break new ground every day, accelerating the weather & climate revolution at an unprecedented pace :) Stay tuned for more interesting work in this space @ECMWF with our partners, also in the framework of #DestinE

Mark J. Webb @markjwebb

almost 2 years ago

Convective organisation

103

markjwebb retweeted

Robin Hogan @RobinJHogan

almost 2 years ago

First light from #EarthCARE!!! This is an incredible moment - the first ever global measurements of cloud, snow and rain fall speeds. The reflectivity is beautifully detailed and the fall speeds are clearer than we could possibly have dreamed of! (1/4)

markjwebb retweeted

Dr Doug McNeall @dougmcneall

almost 2 years ago

Hey everyone, I finally got a new role at the @metoffice. I'm scoping what a data-driven climate model might look like. So, if you're into ML and climate, foundation models or something more lightweight, get in touch. It would be great to chat.

markjwebb retweeted

Stephan Hoyer

@shoyer

almost 2 years ago

A big update to ARCO-ERA5 landed this week -- we now have a copy of ERA5 on native vertical levels with regular 0.25° horizontal resolution. This ~6 PB dataset is freely available as part of Google's public datasets program: https://t.co/7MK5ntSHjB

shoyer's tweet photo. A big update to ARCO-ERA5 landed this week -- we now have a copy of ERA5 on native vertical levels with regular 0.25° horizontal resolution.

This ~6 PB dataset is freely available as part of Google's public datasets program:
https://t.co/7MK5ntSHjB https://t.co/fIuUBvwv7V

241

117

78K

markjwebb retweeted

Mike Pritchard

@SciPritchard

almost 2 years ago

Another new preprint from @nvidia Earth-2 research. I love this ML paradigm for data assimilation pinned on skillful unconditional diffusion based state generators guided by obs during inference alone. Grateful to learn from inspiring AI researchers and a very productive intern!

markjwebb retweeted

Richard McElreath 🐈‍⬛ @rlmcelreath

almost 2 years ago

Looking for a mind-growing distraction? How about my 3 part intro to Bayesian causal inference. It's like a condensed version of my book, 10 weeks of causal computation in 3 short blog posts. Take with plenty of water. https://t.co/s5DtK9BfNu

698

108

737

71K

markjwebb retweeted

Stephan Hoyer

@shoyer

almost 2 years ago

The WeatherBench2 paper has been published in JAMES! https://t.co/xubQbVV9Eb Big congrats to @raspstephan, who did all the heavy lifting on this project.

markjwebb retweeted

Mat Collins @mat_collins

almost 2 years ago

New paper “Possible Shift in Controls of the Tropical Pacific Surface Warming Pattern” by Watanabe et al. - @Nature https://t.co/jnt40LNoLm

mat_collins's tweet photo. New paper “Possible Shift in Controls of the Tropical Pacific Surface Warming Pattern” by Watanabe et al. - @Nature https://t.co/jnt40LNoLm https://t.co/ubOmmF6hSu

107

17K

Mark J. Webb

@markjwebb

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users