Tom Rainforth

@tom_rainforth

Associate Professor in Machine Learning at the University of Oxford, Head of RainML Research Lab (

Oxford, England

Joined November 2016

293 Following

5.8K Followers

372 Posts

Pinned Tweet

Tom Rainforth @tom_rainforth

over 3 years ago

We've got really good at utilizing data. But methods for acquiring that data are often still rudimentary. Our new review paper shows how Bayesian experimental design has recently transformed to now provide a powerful mechanism to acquire data intelligently https://t.co/cAdxp3jI3b

tom_rainforth's tweet photo. We've got really good at utilizing data. But methods for acquiring that data are often still rudimentary. Our new review paper shows how Bayesian experimental design has recently transformed to now provide a powerful mechanism to acquire data intelligently https://t.co/cAdxp3jI3b https://t.co/ELtolAyx1J

107

15K

tom_rainforth retweeted

Freddie Bickford Smith @fbickfordsmith

6 months ago

Active testing enables label-efficient model evals but can be computationally expensive. We show how to reduce costs and scale up to LLMs. https://t.co/rXkpQrJ7DY Work led by Gabrielle Berrada. Find her at EurIPS, or @janundnik and me at NeurIPS in San Diego.

fbickfordsmith's tweet photo. Active testing enables label-efficient model evals but can be computationally expensive.

We show how to reduce costs and scale up to LLMs.

https://t.co/rXkpQrJ7DY

Work led by Gabrielle Berrada. Find her at EurIPS, or @janundnik and me at NeurIPS in San Diego. https://t.co/qDQrM3bCaT

tom_rainforth retweeted

Jackson Atkins

@JacksonAtkinsX

9 months ago

Apple and Oxford just made AI 6.5x better at problem-solving. The secret: it teaches AI agents to ask perfect questions. This rockets success rates from 14% to 91%. No need for fine-tuning or retraining. It runs on current models. Here's how it works: It's a strategic loop designed for multi-turn conversations. At every step, the agent works to find the shortest path to the right answer. Hypothesize: The agent creates an internal list of all possible solutions to the problem. Score Questions: It simulates asking various questions and scores each one on "Expected Information Gain" (EIG). This number represents how much a question is mathematically likely to shrink the list of possibilities. Ask the Best Question: It asks the user only the single, highest-scoring question. Update & Repeat: Based on the answer, it filters its list of hypotheses, getting smarter with each interaction, and then begins the loop again for the next turn. Why this matters for your AI strategy: This marks a shift from building passive "oracles" to proactive, question-asking agents Business Leaders: A 6.5x multiplier on task success is a lever for efficiency. This translates to fewer failed customer interactions, faster diagnostics, and more accurate personalization, a clear ROI on smarter AI. Practitioners: This is a deployment-time framework, not a new model. You can build this agent on top of existing LLMs today. It provides a principled way to overcome common multi-turn issues like inconsistency and context loss without fine-tuning or retraining. Researchers: This paper is a victory for information theory. It proves that a full EIG calculation is superior to heuristics like predictive entropy. It sets a new standard for how to build intelligent information-seeking agents.

JacksonAtkinsX's tweet photo. Apple and Oxford just made AI 6.5x better at problem-solving.

The secret: it teaches AI agents to ask perfect questions. This rockets success rates from 14% to 91%.

No need for fine-tuning or retraining. It runs on current models.

Here's how it works:

It's a strategic loop designed for multi-turn conversations. At every step, the agent works to find the shortest path to the right answer.

Hypothesize: The agent creates an internal list of all possible solutions to the problem.

Score Questions: It simulates asking various questions and scores each one on "Expected Information Gain" (EIG). This number represents how much a question is mathematically likely to shrink the list of possibilities.

Ask the Best Question: It asks the user only the single, highest-scoring question.

Update & Repeat: Based on the answer, it filters its list of hypotheses, getting smarter with each interaction, and then begins the loop again for the next turn.

Why this matters for your AI strategy:
This marks a shift from building passive "oracles" to proactive, question-asking agents

Business Leaders: A 6.5x multiplier on task success is a lever for efficiency. This translates to fewer failed customer interactions, faster diagnostics, and more accurate personalization, a clear ROI on smarter AI.

Practitioners: This is a deployment-time framework, not a new model. You can build this agent on top of existing LLMs today. It provides a principled way to overcome common multi-turn issues like inconsistency and context loss without fine-tuning or retraining.

Researchers: This paper is a victory for information theory. It proves that a full EIG calculation is superior to heuristics like predictive entropy. It sets a new standard for how to build intelligent information-seeking agents.

827

136

89K

Tom Rainforth @tom_rainforth

10 months ago

I have an opening for a 2-year postdoc in probabilistic machine learning and/or experimental design. The application deadline is the 3rd of September. See here for details and how to apply: https://t.co/ht9n9cEviw

Who to follow

Shimon Whiteson

@shimon8282

Research Director at Google DeepMind | Professor of Computer Science at Oxford.

Brandon Amos

@brandondamos

🧙 RL @Reflection_AI past: @MetaAi @GoogleDeepmind @SCSatCMU @Cornell_Tech

Jakob Foerster

@j_foerst

Associate Prof in ML @UniofOxford. Something Something Research Scientist @MetaAI. Something @FLAIR_Ox. Always #teamhuman. Opinions belong to the world.

Tom Rainforth @tom_rainforth

almost 2 years ago

@BlackHC @AdaptiveAgents @AndreyMalinin @kschweig_ @aichberger @HochreiterSepp @roydanroy @fbickfordsmith @WmLisa Given models are never really perfectly specified, this is very important in practice.

169

Tom Rainforth @tom_rainforth

almost 2 years ago

@BlackHC @AdaptiveAgents @AndreyMalinin @kschweig_ @aichberger @HochreiterSepp @roydanroy @fbickfordsmith @WmLisa 2) it can be less stable compared to things like stacking (see Eg https://t.co/3CvcoOXdwV). Even if the model is well specified, you can get big variability in predictions with the exact data you happen to see

358

Tom Rainforth @tom_rainforth

almost 2 years ago

@BlackHC @AdaptiveAgents @AndreyMalinin @kschweig_ @aichberger @HochreiterSepp @roydanroy @fbickfordsmith @WmLisa 3) it doesn't widen your model class. This is sort of the same as 1), but there also is a general principle that a combination of hypotheses is more powerful than one, and BMA will always collapse to one. There are methods of "Bayesian model combination" that do this.

170

Tom Rainforth @tom_rainforth

almost 2 years ago

@BlackHC @AdaptiveAgents @AndreyMalinin @kschweig_ @aichberger @HochreiterSepp @roydanroy @fbickfordsmith @WmLisa 1) the optimality goes away when the model is misspecified (Eg Bayesian decision trees are usually significantly inferior to random forests)

190

Tom Rainforth @tom_rainforth

almost 2 years ago

I have an opening for a 2.5-year postdoc position in the RainML lab as part of my ERC grant on probabilistic machine learning and intelligent data acquisition. Application deadline 10th July 2024. See here for details and to apply: https://t.co/BWlBYBHMEv

11K

Tom Rainforth @tom_rainforth

almost 2 years ago

I'm delighted to announce that from September I will officially be an Associate Professor (remaining at the Oxford stats department)

169

16K

Tom Rainforth @tom_rainforth

about 2 years ago

All credit goes to my fantastic coauthors @nmstatistics and @yeewhye

588

Tom Rainforth @tom_rainforth

about 2 years ago

Our new #ICLR2024 paper shows how LLMs can successfully check their own change of thought reasoning without any fine-tuning or even examples, using an approach we call SelfCheck. Join me at poster 125 this afternoon to learn more Paper: https://t.co/47MXrN4xDi

tom_rainforth's tweet photo. Our new #ICLR2024 paper shows how LLMs can successfully check their own change of thought reasoning without any fine-tuning or even examples, using an approach we call SelfCheck.

Join me at poster 125 this afternoon to learn more

Paper: https://t.co/47MXrN4xDi https://t.co/IdcQeylyjb

Tom Rainforth @tom_rainforth

about 2 years ago

In-context learning can learn novel input-output relationships beyond what can be picked up from input context alone, but doesn't behave like conventional learning algorithm. Find out more at our ICLR poster #129 this afternoon. Paper: https://t.co/NoJWC3Ws9J, led by @janundnik

tom_rainforth retweeted

Jannik Kossen @janundnik

about 2 years ago

Are you at ICLR? Have you heard that In-Context Learning in LLMs does not learn label relationships? Well that's not true. Visit our poster TODAY to find out how LLMs incorporate label information. Spoiler: it's not Bayesian inference. Poster #129, May 7, 4.30 pm

janundnik's tweet photo. Are you at ICLR?

Have you heard that In-Context Learning in LLMs does not learn label relationships?

Well that's not true.

Visit our poster TODAY to find out how LLMs incorporate label information.

Spoiler: it's not Bayesian inference.

Poster #129, May 7, 4.30 pm https://t.co/82jdcsU54r

tom_rainforth retweeted

Tim Reichelt @TimReichelt3

about 2 years ago

I will be presenting our work on "Beyond Bayesian Model Averaging over Paths in Probabilistic Programs with Stochastic Support" at AISTATS in Valencia tomorrow (details in thread below). If you are interested in probabilistic programming, come and say hi at poster session 1!

tom_rainforth retweeted

Freddie Bickford Smith @fbickfordsmith

about 2 years ago

The current default recipe for Bayesian active learning doesn’t really work beyond MNIST scale. We suggest why that is and identify a simple fix. https://t.co/TgmxX2RonT @aistats_conf with @adamefoster @tom_rainforth 1/5

Tom Rainforth @tom_rainforth

over 2 years ago

Link for author author guidelines: https://t.co/NpnlTkWQYI

467

Tom Rainforth @tom_rainforth

over 2 years ago

We are delighted to announce an ACM-TOPML special issue on "Probabilistic Programming". Please see the attached call for papers for details

tom_rainforth's tweet photo. We are delighted to announce an ACM-TOPML special issue on "Probabilistic Programming". Please see the attached call for papers for details https://t.co/HNCxzMTnqK

Tom Rainforth @tom_rainforth

over 2 years ago

@haus_cole We did this as a pretty direct follow up: https://t.co/8CiZiM48hq. I think unfortunately the reality is that disentanglement is not generally viable without either strong inductive biases or some degree of supervision

215

tom_rainforth retweeted

Christian Weilbach [email protected] @wh1lo

over 2 years ago

It is @NeurIPS time again! I am excited to present our trans-dimensional jump diffusion work with @AndrewC_ML @willarvey @ValentinDeBort1 @tom_rainforth and @ArnaudDoucet1 ! Come over on Thursday 2nd poster session, https://t.co/sHFojNAZXp. https://t.co/MR1SnLV6k2 #NeurIPS2023

Tom Rainforth

@tom_rainforth

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users