Tyler LaBonte

@tmlabonte

ML PhD student @GeorgiaTech, Math BS @USC. Deep learning theory, generalization, robustness.

Atlanta, GA

Joined December 2019

699 Following

896 Followers

914 Posts

Pinned Tweet

Tyler LaBonte @tmlabonte

about 1 year ago

Excited to present at the first #AISTATS2025 poster session on May 3! Ever wondered how LLMs can generalize to new tasks in-context despite only training on token completion? We formalize this phenomenon as "task shift" and investigate a linear version: https://t.co/RoYippuZVi

tmlabonte's tweet photo. Excited to present at the first #AISTATS2025 poster session on May 3!

Ever wondered how LLMs can generalize to new tasks in-context despite only training on token completion? We formalize this phenomenon as "task shift" and investigate a linear version: https://t.co/RoYippuZVi https://t.co/tIQgGU9uJ1

1

23

2

4

3K

Tyler LaBonte @tmlabonte

13 days ago

@guilhermeotina @TmlrOrg Yep! Most methods which infer group structure use properties of the model representation; we argue feature learning is key for understanding & interpreting them (e.g. Izmailov 2022). Our TMLR paper is more specific to unbalanced LLR, but feature learning papers are in the works!

0

0

0

0

43

Tyler LaBonte @tmlabonte

13 days ago

The updated version of this paper has been accepted at @TmlrOrg 🚨🚀 Very excited about implications of our results for SOTA robustness algorithms & understanding spurious correlations more generally. Journal version link: https://t.co/XkLMSbw5ua

Tyler LaBonte @tmlabonte

about 1 year ago

Heading to #ICLR2025 to present our SCSL workshop paper on understanding how last-layer retraining methods mitigate spurious correlations! https://t.co/6kQ1HG0WVI Stop by on Monday, April 28 to chat and learn more 🙂

tmlabonte's tweet photo. Heading to #ICLR2025 to present our SCSL workshop paper on understanding how last-layer retraining methods mitigate spurious correlations! https://t.co/6kQ1HG0WVI

Stop by on Monday, April 28 to chat and learn more 🙂 https://t.co/6CYVugeZEs

1

29

3

7

5K

1

11

3

3

2K

Tyler LaBonte @tmlabonte

about 1 month ago

@matheusmaldaner @iStaridium @LAHacks Go to Tacos 1986!

0

1

0

0

50

Who to follow

Verified account

Assistant Professor MIT @medialab @MITEECS @nlp_mit || Foundations of self-evolving multisensory AI to enhance the human experience.

Bodhisattwa Majumder

Verified account

I lead AI x (Data-driven) Discovery @allen_ai. 🧬 Agents + Search. @AdobeResearch Fellow. Prev Google, MSR, Meta. PhD @ucsd_cse.

Verified account

Associate Professor CS/stats UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on LLMs and deep learning. PhD at Stanford.

Tyler LaBonte @tmlabonte

about 2 months ago

@matheusmaldaner @NSF Congrats!!!

0

1

0

0

37

Tyler LaBonte @tmlabonte

2 months ago

@etash_guha Killing it Etash!

1

1

0

0

101

Tyler LaBonte @tmlabonte

3 months ago

Cramér on Lindeberg: "When he was reproached for not being sufficiently active in his scientific work, he said 'Well, I am really a farmer.' And if somebody happened to say that his farm was not properly cultivated, his answer was 'Of course my real job is to be a professor.'"

0

2

0

0

259

Tyler LaBonte @tmlabonte

3 months ago

@iamwaynechi Can't wait for more games in various shades of red! ("rougelikes"... ok I'll see myself out)

1

1

0

0

67

tmlabonte retweeted

Microsoft Research

3 months ago

Multimodal reasoning with Phi-4-reasoning-vision, new work on scaling LLM inference, benchmarking AI agents in network operations, cinematic video generation, adaptive evaluation for LLMs, and using AI to improve individual and population health. https://t.co/9Y0SyTlG5W

3

32

10

8

12K

Tyler LaBonte @tmlabonte

3 months ago

Our Phi-4-reasoning-vision-15B technical report is now available on arxiv: https://t.co/6ZPE7J6kz4

0

5

1

0

408

Tyler LaBonte @tmlabonte

3 months ago

Some nice coverage on our new model release, highlighting our hybrid approach to multimodal reasoning 🚀

3 months ago

Microsoft built Phi-4-reasoning-vision-15B to know when to think — and when thinking is a waste of time https://t.co/exMXVFEss6

0

9

3

4

2K

0

3

0

0

408

Tyler LaBonte @tmlabonte

3 months ago

It's been the privilege of my career to help build the newest Phi series model from @MSFTResearch! Phi-4-reasoning-vision-15B is open-weight & competitive on perf with 10X less compute/tokens. Read the blog for math and CUA case studies, hybrid reasoning, data insights, & more!

tmlabonte's tweet photo. It's been the privilege of my career to help build the newest Phi series model from @MSFTResearch!

Phi-4-reasoning-vision-15B is open-weight & competitive on perf with 10X less compute/tokens.

Read the blog for math and CUA case studies, hybrid reasoning, data insights, & more! https://t.co/34hDOufGzE

Microsoft Research

3 months ago

Vision-language models improve multimodal systems, but can make them slower, costlier, and harder to deploy. Learn how Phi-4-reasoning-vision-15B, a compact and fast multimodal reasoning model, blends strengths of different methods while reducing their limits: https://t.co/jP5L3AXRzX

MSFTResearch's tweet photo. Vision-language models improve multimodal systems, but can make them slower, costlier, and harder to deploy. Learn how Phi-4-reasoning-vision-15B, a compact and fast multimodal reasoning model, blends strengths of different methods while reducing their limits: https://t.co/jP5L3AXRzX

1

66

16

26

18K

0

10

0

2

887

Tyler LaBonte @tmlabonte

4 months ago

@bneyshabur Best of luck, Behnam! Looking forward to what comes next!

0

1

0

0

801

Tyler LaBonte @tmlabonte

5 months ago

Finally, thanks to @Kangwook_Lee's "Tenure Track Simulator" post for inspiring me to make the game public and write this up!

0

1

0

0

92

Tyler LaBonte @tmlabonte

5 months ago

Over the holidays, I stress-tested the AI coding hype by doing something concrete: I built a college football simulator game from scratch to see if agents actually deliver. Here’s what I learned 👇

tmlabonte's tweet photo. Over the holidays, I stress-tested the AI coding hype by doing something concrete: I built a college football simulator game from scratch to see if agents actually deliver. Here’s what I learned 👇 https://t.co/QnJIqqRezU

2

1

0

0

185

Tyler LaBonte @tmlabonte

5 months ago

Misc takeaways: • Copilot + GitHub was far more useful than I expected • Keeping code style consistent across humans + agents is painful • Overall: Claude was best for agentic coding; Gemini best for interactive pair-programming

2

0

0

0

134

Last Seen Users on Sotwe

Trends for you

Most Popular Users