Andrew Williams @cluelessandrew - Twitter Profile

11 days ago

New paper: We present a "Unified Neural Scaling Law" functional form that accurately models & extrapolates the multivariate scaling behaviors of artificial neural networks as the variables listed in this attached video are varied. (1/N)

10

476

63

408

45K

CluelessAndrew retweeted

Accepted papers at TMLR @TmlrPub

3 months ago

PipelineRL: Faster On-policy Reinforcement Learning for Long Sequence Generation Alexandre Piché, Ehsan Kamalloo, Rafael Pardinas, Xiaoyin Chen, Dzmitry Bahdanau. Action editor: Sebastian Tschiatschek. https://t.co/38fb1LpWCe #pipelinerl #accelerato

0

3

1

569

CluelessAndrew retweeted

Gaurav Sahu

@dem_fier

3 months ago

ever been here? open overleaf → write a paragraph → "hmm...this needs a citation" → open 15 different tabs → skim 8 abstracts → find the 1 actually relevant paper → format bibtex → paste it back on overleaf if so, i built a plugin just for you. meet openleaf: → reads your paper paragraph by paragraph → searches major academic databases → filters out irrelevant papers using ai → one click to add BibTeX to your .bib you'll also find the 🤝 friendly and 🔥 fire reviewers there. i don't think i need to tell you what they do :) free. open source. no account. no data collection. works with ollama, openrouter, openai api and more. https://t.co/XvX03iem38 dear algorithm, please show this to my fellow researchers in need 🙏 #overleaf #latex #opensource #academictwitter

27

815

106

1K

1M

CluelessAndrew retweeted

Andrei Mircea @mirandrom

8 months ago

I gave a talk on LLM zero-sum learning dynamics last week at MSR Montreal. I went over a few things that were not in the paper but that I'm particularly excited about; one of those is the connection between generalization and zero-sum learning. https://t.co/di3iLLytvO

1

35

9

24

5K

Who to follow

Ryan D'Orazio

@RyanDOrazio

PhD Student at Mila Quebec AI Institute, and Université de Montréal.

Gopeshh Subbaraj

@gopeshh1

PhD Student @Mila_Quebec/UdeM Interested in RL and CL! Prev. developing software @MathWorks. Robotics Grad @WPI. Alum @ReachNITT Views my own!

Mehran Shakerinava

@MShakerinava

PhD student in computer science at @mcgillu & @Mila_Quebec 🇨🇦

Andrew Williams @CluelessAndrew

8 months ago

I have found Mila to be a great place to do a PhD. Feel free to reach out if you have any questions!

Mila - Institut québécois d'IA

@Mila_Quebec

8 months ago

Mila's annual supervision request process is now open to receive MSc and PhD applications for Fall 2026 admission! For more information, visit https://t.co/r01eLcY1P4

Mila_Quebec's tweet photo. Mila's annual supervision request process is now open to receive MSc and PhD applications for Fall 2026 admission! For more information, visit https://t.co/r01eLcY1P4 https://t.co/eXDAsWdOw6

3

123

63

68

106K

0

1

0

93

Andrew Williams @CluelessAndrew

12 months ago

Great opportunity for those new to ML research!

NewInML @ NeurIPS 2025 @NewInML

12 months ago

New to ML research? Never published at ICML? Don't miss this! Check out the New in ML workshop at ICML 2025 — no rejections, detailed feedback, awards, and ICML tickets for selected authors. Deadline: June 10 (AoE) Submit: https://t.co/xNiccKTelq Info: https://t.co/1dBY6bnGji

0

27

14

12

2K

0

4

0

1

258

CluelessAndrew retweeted

David Duvenaud

@DavidDuvenaud

over 1 year ago

LLMs have complex joint beliefs about all sorts of quantities. And my postdoc @jamesrequeima visualized them! In this thread we show LLM predictive distributions conditioned on data and free-form text. LLMs pick up on all kinds of subtle and unusual structure: 🧵

30

2K

196

1K

194K

CluelessAndrew retweeted

David Duvenaud

@DavidDuvenaud

over 1 year ago

This is fun because LLMs can condition on free-form side information, and make predictions about anything. This turns qualitative knowledge into quantitative predictions. Here we condition Llama 3 on two datapoints, plus text. Changing the text changes the meaning of the data.

3

167

16

48

11K

CluelessAndrew retweeted

Perouz Taslakian @PerouzT

over 1 year ago

🚀 We have released our paper on ReTreever! 🌳🔍 ReTreever organizes and represents documents in a binary tree across various granular levels, balancing cost & utility while enhancing retrieval transparency. 📜 Read it here: https://t.co/4VlePz5e1K #AI @ServiceNowRSRCH 🧵👇

PerouzT's tweet photo. 🚀 We have released our paper on ReTreever! 🌳🔍

ReTreever organizes and represents documents in a binary tree across various granular levels, balancing cost & utility while enhancing retrieval transparency.

📜 Read it here: https://t.co/4VlePz5e1K

#AI @ServiceNowRSRCH
🧵👇 https://t.co/r8p098MXXu

1

31

20

3

4K

CluelessAndrew retweeted

Ahmed Masry @Ahmed_Masry97

over 1 year ago

Happy to announce AlignVLM📏: a novel approach to bridging vision and language latent spaces for multimodal understanding in VLMs! 🌍📄🖼️ 🔗 Read the paper: https://t.co/czaL8NrlZL 🧵👇 Thread

Ahmed_Masry97's tweet photo. Happy to announce AlignVLM📏: a novel approach to bridging vision and language latent spaces for multimodal understanding in VLMs! 🌍📄🖼️

🔗 Read the paper: https://t.co/czaL8NrlZL
🧵👇 Thread

2

211

55

92

23K

CluelessAndrew retweeted

gian

@giansegato

over 1 year ago

this is paper is kinda wild. turns out that if you simply ask an LLM to straight out predict a timeseries like this: ``` <history> (t1, v1) (t2, v2) (t3, v3) </history> <forecast> (t4, v4) (t5, v5) </forecast> ``` making sure to prepend the prompt like this: ``` Here is some context about the task. Make sure to factor in any background knowledge, satisfy any constraints, and respect any scenarios. <context> ((context)) </context> ``` it will just… do it? beating SOTA timeseries forcasting?! llama 3.1 405b directly prompted is more precise at forecasting real-world series than: - stats-based timeseries models (ARIMA, ETS) - foundation models specifically trained for time series (eg. chronos) - multimodal forecasting models (eg, time-LLM) peak 'bitter lesson' behavior lol

giansegato's tweet photo. this is paper is kinda wild. turns out that if you simply ask an LLM to straight out predict a timeseries like this:

```
<history>
(t1, v1) (t2, v2) (t3, v3)
</history>
<forecast>
(t4, v4) (t5, v5)
</forecast>
```

making sure to prepend the prompt like this:

```
Here is some context about the task. Make sure to factor in any background knowledge, satisfy any constraints, and respect any scenarios.
<context>
((context))
</context>
```

it will just… do it? beating SOTA timeseries forcasting?!

llama 3.1 405b directly prompted is more precise at forecasting real-world series than:
- stats-based timeseries models (ARIMA, ETS)
- foundation models specifically trained for time series (eg. chronos)
- multimodal forecasting models (eg, time-LLM)

peak 'bitter lesson' behavior lol

48

2K

195

3K

248K

CluelessAndrew retweeted

Andrei Mircea @mirandrom

over 1 year ago

📢 New paper “Language model scaling laws and zero-sum learning” @scifordl #NeurIPS2024 ℹ️https://t.co/abMHR2C75M TL;DR: scaling improves LMs by mitigating zero-sum learning, a mechanism that could be targeted directly and independent of scale. W205 @ 4:30pm (1/12)🧵

mirandrom's tweet photo. 📢 New paper “Language model scaling laws and zero-sum learning” @scifordl #NeurIPS2024

ℹ️https://t.co/abMHR2C75M
TL;DR: scaling improves LMs by mitigating zero-sum learning, a mechanism that could be targeted directly and independent of scale.

W205 @ 4:30pm

(1/12)🧵 https://t.co/tgniYWXCfg

2

54

17

32

8K

CluelessAndrew retweeted

Arjun Ashok @arjunashok37

over 1 year ago

Starting the workshop on Time Series in the Age of Large Models (TSALM) at #NeurIPS2024 with @tomaspfister's invited talk on Multimodal Time Series Modeling!

arjunashok37's tweet photo. Starting the workshop on Time Series in the Age of Large Models (TSALM) at #NeurIPS2024 with @tomaspfister's invited talk on Multimodal Time Series Modeling! https://t.co/IXUPKDtGPV

1

24

3

0

993

CluelessAndrew retweeted

Tianyu Zhang @tianyu_zh

over 1 year ago

🚀 Excited to present our work on VCR: Visual Caption Restoration – the 1st and unique VLM benchmark testing if VLMs can focus on tiny but crucial details! 📍 Join us at #NeurIPS24 on Sunday, Dec 15, West Ballroom B 🛠️ Dive into the details: https://t.co/GRUljO3TFg

tianyu_zh's tweet photo. 🚀 Excited to present our work on VCR: Visual Caption Restoration – the 1st and unique VLM benchmark testing if VLMs can focus on tiny but crucial details!

📍 Join us at #NeurIPS24 on Sunday, Dec 15, West Ballroom B

🛠️ Dive into the details: https://t.co/GRUljO3TFg https://t.co/bFvQbkeCEA

0

5

2

0

359

CluelessAndrew retweeted

Tianyu Zhang @tianyu_zh

over 1 year ago

🚀A game-changer for open-access multimodal AI! BigDocs is paving the way for transparent, accountable, and innovative document reasoning and code generation. Check it out! 💡👏

1

8

4

1

700

CluelessAndrew retweeted

Joan Rodriguez

@joanrod_ai

over 1 year ago

🎉 Excited to introduce BigDocs! An open, transparent multimodal dataset designed for: 📄 Documents 🌐 Web content 🖥️ GUI understanding 👨‍💻 Code generation from images We’re also launching BigDocs-Bench, featuring 10 tasks to test models on: ➡️ Document, Web, GUI Visual reasoning ➡️ Converting images into JSON, Markdown, LaTeX, SVG, and more! 📜 Paper: https://t.co/NsXtsIennh https://t.co/EXhEFnQ622 🌍 Website https://t.co/kA9c6JH7L8

joanrod_ai's tweet photo. 🎉 Excited to introduce BigDocs!
An open, transparent multimodal dataset designed for:

📄 Documents
🌐 Web content
🖥️ GUI understanding
👨‍💻 Code generation from images

We’re also launching BigDocs-Bench, featuring 10 tasks to test models on:

➡️ Document, Web, GUI Visual reasoning
➡️ Converting images into JSON, Markdown, LaTeX, SVG, and more!

📜 Paper: https://t.co/NsXtsIennh https://t.co/EXhEFnQ622
🌍 Website https://t.co/kA9c6JH7L8

2

95

41

31

23K

CluelessAndrew retweeted

Joseph Suarez 🐡

@jsuarez

over 1 year ago

https://t.co/Ctyo1nVzJO

10

316

25

180

40K

CluelessAndrew retweeted

Benjamin Thérien @ MLSys 2026

@benjamintherien

over 1 year ago

Learned optimizers can’t generalize to large unseen tasks…. Until now! Excited to present μLO: Compute-Efficient Meta-Generalization of Learned Optimizers! Don’t miss my talk about it next Sunday at the OPT2024 Neurips Workshop :) 🧵https://t.co/UiCr4EQ5s9 1/N

benjamintherien's tweet photo. Learned optimizers can’t generalize to large unseen tasks…. Until now! Excited to present μLO: Compute-Efficient Meta-Generalization of Learned Optimizers! Don’t miss my talk about it next Sunday at the OPT2024 Neurips Workshop :) 🧵https://t.co/UiCr4EQ5s9 1/N

2

113

32

56

13K

CluelessAndrew retweeted

Will Bryk

@WilliamBryk

over 1 year ago

Spent the weekend hacking together Exa embeddings over 4500 NeurIPS 2024 papers - https://t.co/gazgno2hfk Let's you: - do otherwise impossible searches ("transformer architectures inspired by neuroscience") - explore a 2D t-SNE plot - chat with Claude about multiple papers

WilliamBryk's tweet photo. Spent the weekend hacking together Exa embeddings over 4500 NeurIPS 2024 papers - https://t.co/gazgno2hfk

Let's you:
- do otherwise impossible searches ("transformer architectures inspired by neuroscience")
- explore a 2D t-SNE plot
- chat with Claude about multiple papers https://t.co/QDYFtxqUiI

27

665

78

520

110K

CluelessAndrew retweeted

Perouz Taslakian @PerouzT

over 1 year ago

🌟🌟🌟 We just released BigDocs: An Open Multimodal Dataset — our latest work on scaling document understanding across diverse data types! 📄 👉 Dive into the details: https://t.co/KfOKZKARDS 🧠 or come see us at the #NeurIPS2024 RBFM workshop! #AI @ServiceNowRSRCH #bigdocs

PerouzT's tweet photo. 🌟🌟🌟 We just released BigDocs: An Open Multimodal Dataset — our latest work on scaling document understanding across diverse data types! 📄

👉 Dive into the details: https://t.co/KfOKZKARDS

🧠 or come see us at the #NeurIPS2024 RBFM workshop!
#AI @ServiceNowRSRCH #bigdocs https://t.co/BwaM9GKtoz

0

17

15

3

1K

Andrew Williams

@CluelessAndrew

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users