Anshuk Uppal

@sigmabayesian

PhD student @DTUtweet. Probabilistic ML 🧠 diffusion and sampling🧠. previously intern @MSFTResearch @SonyAI_global, visitor @NYU_Courant.

Copenhagen, Denmark

Joined March 2012

1.2K Following

293 Followers

459 Posts

sigmabayesian retweeted

Elon Litman

@elon_lit

4 months ago

Fun fact: your transformer's attention weights are the unique solution (transport plan) to an optimal transport problem regularized by entropy.

elon_lit's tweet photo. Fun fact: your transformer's attention weights are the unique solution (transport plan) to an optimal transport problem regularized by entropy. https://t.co/BhZGDKWoJd

111

205K

Anshuk Uppal @sigmabayesian

about 1 month ago

@JCJesseLai @marikgoldstein This is like 2 of my worlds merging lol. Hope you had fun in NYC!

Anshuk Uppal @sigmabayesian

about 2 months ago

@torchcompiled Can't we always get an equivalent x0? In other words as long as there's a predicted eps, using the forward/noising process eqn, we can get an predicted x0.

sigmabayesian retweeted

Nathan Lambert

@natolambert

about 2 months ago

Excited to launch the accompanying free RLHF Course for my book. To kick it off, I've released: - Welcome video - Lecture 1: Overview of RLHF & Post-training - Lecture 2: IFT, Reward Models, Rejection Sampling - Lecture 3: RL Math - Lecture 4: RL Implementation I'm going to add question & answer videos throughout the lecture to go deeper on topics that need it, and potentially cover some topics that are too recent and in flux to go in print. I expect 10-15 videos in total over the next few months. At the same time, development around the code for the book is picking up. It's a great time to build the foundation for post-training methods. YT playlist and course landing page below.

natolambert's tweet photo. Excited to launch the accompanying free RLHF Course for my book. To kick it off, I've released:

- Welcome video
- Lecture 1: Overview of RLHF & Post-training
- Lecture 2: IFT, Reward Models, Rejection Sampling
- Lecture 3: RL Math
- Lecture 4: RL Implementation

I'm going to add question & answer videos throughout the lecture to go deeper on topics that need it, and potentially cover some topics that are too recent and in flux to go in print. I expect 10-15 videos in total over the next few months.

At the same time, development around the code for the book is picking up. It's a great time to build the foundation for post-training methods.

YT playlist and course landing page below.

236

190K

Who to follow

Paul Jeha

@jeha_paul

PhD in Cph curr. @gen_intuition / https://t.co/uz1rGMixzO the work is mysterious and important

Valentin Liévin

@valentinlievin

Research Scientist at @GoogleDeepMind. Better LLMs for healthcare and science. PhD @DTU_compute

Pierre-Alexandre Mattei

@pamattei

Research scientist @inria, member of the @inriaMaasai team, statistical machine learning. Also on 🦋

sigmabayesian retweeted

Nicholas Boffi

@nmboffi

2 months ago

🤯 big update to our flow map language models paper! we believe this is the future of non-autoregressive text generation. read about it in the blog: https://t.co/DfBXrYmJc8 full details in the paper: https://t.co/coiNXj4ucC we introduce a new class of continuous flow-based language models and distill them into their corresponding flow map for one-step text generation. we beat all discrete diffusion baselines at ~8x speed! v2 gives a complete theory of the flow map over discrete data, with three equivalent ways to learn it (semigroup, lagrangian, eulerian). it turns out you can train these with cross-entropy objectives that look very similar to standard discrete diffusion — but without the factorization error that kills discrete methods at few steps. beyond improving results across the board, we showcase properties that are unique to continuous flows. in particular, inference-time steering and guidance become straightforward. autoguidance brings generative perplexity down to 51.6 on LM1B, while discrete baselines completely collapse at the same guidance scale. we also show reward-guided generation for steering topic, sentiment, grammaticality, and safety at inference time — and it works even at 1-2 steps with our flow map model. simple, well-understood techniques from continuous flows just work incredibly well in practice for language. we’re extremely excited about the future of this class of models. stay tuned for results on scaling, reasoning, and reinforcement learning-based fine-tuning. 🚀

478

332

76K

sigmabayesian retweeted

Sophia Tang @_sophia_tang_

3 months ago

New tutorial paper on the “Foundations of Schrödinger Bridges for Generative Modeling” is out on arXiv! 🧩 📖 arXiv: https://t.co/ce4feGdXZT 🔮 Project Website: https://t.co/dyNr5TRijq With 220 pages and 24 figures, this guide builds the theoretical foundations of Schrödinger bridges from the ground up, unifying the broad field of generative modeling with a single guiding principle: construct an optimal stochastic bridge between distributions while minimizing deviation from a reference process. The rapid progress in generative modeling has made the field increasingly difficult to navigate from a foundational perspective, which motivated me to develop a resource that builds the core concepts needed to understand and contribute to new advances. This guide contains intuitive explanations and step-by-step proofs covering: 🧩 The dynamic Schrödinger bridge formulation, lifting optimal transport to continuous-time stochastic processes between distributions, with direct connections to diffusion models, score-based methods, and flow matching. 🧩 A comprehensive toolkit for constructing Schrödinger bridges from first principles, describing stochastic optimal control, forward–backward SDEs, Doob’s h-transform, and Markov and reciprocal projections. 🧩 Extensions to complex and real-world problem settings, including the multi-marginal, unbalanced, discrete SB problems, highlighting the flexibility of the Schrödinger bridge framework in describing complex dynamical systems. 🧩 Practical, scalable algorithms for training and inference of dynamic Schrödinger bridges across modern generative modeling tasks. More details in the thread 👇🏻

_sophia_tang_'s tweet photo. New tutorial paper on the “Foundations of Schrödinger Bridges for Generative Modeling” is out on arXiv! 🧩

📖 arXiv: https://t.co/ce4feGdXZT
🔮 Project Website: https://t.co/dyNr5TRijq

With 220 pages and 24 figures, this guide builds the theoretical foundations of Schrödinger bridges from the ground up, unifying the broad field of generative modeling with a single guiding principle: construct an optimal stochastic bridge between distributions while minimizing deviation from a reference process.

The rapid progress in generative modeling has made the field increasingly difficult to navigate from a foundational perspective, which motivated me to develop a resource that builds the core concepts needed to understand and contribute to new advances.

This guide contains intuitive explanations and step-by-step proofs covering:

🧩 The dynamic Schrödinger bridge formulation, lifting optimal transport to continuous-time stochastic processes between distributions, with direct connections to diffusion models, score-based methods, and flow matching.
🧩 A comprehensive toolkit for constructing Schrödinger bridges from first principles, describing stochastic optimal control, forward–backward SDEs, Doob’s h-transform, and Markov and reciprocal projections.
🧩 Extensions to complex and real-world problem settings, including the multi-marginal, unbalanced, discrete SB problems, highlighting the flexibility of the Schrödinger bridge framework in describing complex dynamical systems.
🧩 Practical, scalable algorithms for training and inference of dynamic Schrödinger bridges across modern generative modeling tasks.

More details in the thread 👇🏻

884

146

751

45K

Anshuk Uppal @sigmabayesian

3 months ago

@wgrathwohl 🤩🤐🤐

sigmabayesian retweeted

Ronen Tamari @rtk254

over 1 year ago

Video models != world models "We find that across a range of current models (Sora, Runway, Pika, Lumiere, Stable Video Diffusion, and VideoPoet), physical understanding is severely limited, and unrelated to visual realism"

rtk254's tweet photo. Video models != world models

"We find that across a range of current models (Sora, Runway, Pika, Lumiere, Stable Video Diffusion, and VideoPoet), physical understanding is severely limited, and unrelated to visual realism" https://t.co/BOaHfyd4k1

885

120

472

176K

sigmabayesian retweeted

Gautam Kamath @thegautamkamath

3 months ago

Fantastic post by Colin Raffel, "We Are Over-Indexing on Paper Acceptance," drafted in May 2021 (!) but only posted now. The more things change.. Last sentence: "If you want to judge a researcher’s quality, the only meaningful way is to read their papers and judge for yourself."

thegautamkamath's tweet photo. Fantastic post by Colin Raffel, "We Are Over-Indexing on Paper Acceptance," drafted in May 2021 (!) but only posted now. The more things change..

Last sentence: "If you want to judge a researcher’s quality, the only meaningful way is to read their papers and judge for yourself." https://t.co/uA1jgMQ6VS

124

10K

sigmabayesian retweeted

Amir Zamir

@zamir_ar

3 months ago

Swiss AI Visiting PhD Program at EPFL. Deadline Feb 28. "The program provides a fellowship contribution of CHF 2,500 ($3200) per month, access to the Alps supercomputer, and eligibility for a post-visit continuation grant of up to 50k GPU hours. The call is open to PhD students enrolled outside Switzerland, with applications supported by EPFL PIs contributing to the Swiss AI Initiative. The application closes on February 28." https://t.co/ZlB7Mm1SFj @ICepfl @EPFL_AI_Center

284

269

31K

sigmabayesian retweeted

yingzhen @liyzhen2

5 months ago

GeMSS 2026 will come to London from Mar 23 - Mar 27, apply by Jan 23 - imho no.1 research school in Europe on generative models - 3-day crash course on foundations - 2-day frontier talks (diffusion, LLM for math, AI4Science, etc.) @jesfrellsen @pamattei @jmtomczak @bguedj

liyzhen2's tweet photo. GeMSS 2026 will come to London from Mar 23 - Mar 27, apply by Jan 23
- imho no.1 research school in Europe on generative models
- 3-day crash course on foundations
- 2-day frontier talks (diffusion, LLM for math, AI4Science, etc.)
@jesfrellsen @pamattei @jmtomczak @bguedj https://t.co/dh2tfoNzUk

Anshuk Uppal @sigmabayesian

5 months ago

@isskoro Learning the structure seems crucial but when training pixel level models wouldn't it also make sense to spend time/model capacity on lower noise levels so as to generate sharper images since we don't have an adversarially trained decoder?

Anshuk Uppal @sigmabayesian

5 months ago

@alec_helbling Can you link your favourite 2-3 papers/ works here?

247

sigmabayesian retweeted

Jiatao Gu@CVPR2026

@thoma_gu

6 months ago

🎁 Christmas gift: let’s learn deep generative models through 101 papers! Advanced Topics of Deep Generative Models: a hybrid of lectures + student talks, spanning foundations to applications. Today, the slides are public: https://t.co/shWPV5CHUc Huge thanks to @xxunhuang @YizheZhangNLP @du_yilun--our guest speakers for sharing their latest research and perspectives! This is also the first full course I’ve ever taught! Super grateful to learn alongside amazing students at Penn, and deeply thankful to our TAs for all their hard work! @TongMutianTMT @tyao923

372

371

28K

sigmabayesian retweeted

Jeff Dean

@JeffDean

6 months ago

Performance Hints Over the years, my colleague Sanjay Ghemawat and I have done a fair bit of diving into performance tuning of various pieces of code. We wrote an internal Performance Hints document a couple of years ago as a way of identifying some general principles and we've recently published a version of it externally. We'd love any feedback you might have! Read the full doc at: https://t.co/jej95g236P

JeffDean's tweet photo. Performance Hints

Over the years, my colleague Sanjay Ghemawat and I have done a fair bit of diving into performance tuning of various pieces of code. We wrote an internal Performance Hints document a couple of years ago as a way of identifying some general principles and we've recently published a version of it externally.

We'd love any feedback you might have!

Read the full doc at: https://t.co/jej95g236P

106

14K

sigmabayesian retweeted

aahlad puli @aahladpuli

6 months ago

Heading to neurips soon! I'm on the job market and I work on making AI reliable, with a focus on healthcare. Would love to meet people and hear about opportunities!

332

Anshuk Uppal @sigmabayesian

6 months ago

@liyzhen2 @FrancoisRozet @EurIPSConf Let's do it!

Anshuk Uppal @sigmabayesian

6 months ago

@itsbautistam @StefanABaumann Sounds v reasonable. Though a counter argument I have here is that image distributions live on manifolds where all degrees of freedom can be captured using low dim latents but noise distributions occupy ambient space so compression might be lossy.

sigmabayesian retweeted

yingzhen @liyzhen2

6 months ago

@sedielem OK, I can organise re diffusion circle at EurIPS, and tweet, if you can RT 🙏 then this would reach to more people.

536

sigmabayesian retweeted

Arnaud Doucet @ArnaudDoucet1

7 months ago

🔥 WANTED: Student Researcher to join me,@ValentinDeBort1,@thjashin,@liwenliang,@ArthurGretton in DeepMind London. You'll be working on Multimodal Diffusions for science. Apply here https://t.co/owR6KoCQII

321

238

84K

Anshuk Uppal

@sigmabayesian

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users