Alicia Sun

@AliciaSun17

FAIR NYC; Formerly PhD @MIT

New York, USA

Joined April 2014

203 Following

53 Followers

5 Posts

AliciaSun17 retweeted

Dane Malenfant

@dvnxmvl_hdf5

about 1 month ago

🚨Excited to announce our workshop Context Beyond the Window hosted at COLM in SF! 🚨 LLMs have finite context windows, yet real-world tasks demand absorbing, retaining, and acting on information that far exceeds any single prompt. 1/3 We're looking for submissions across: https://t.co/6y1ILeeC9A • Context compression 🧃 — token compaction, recursive subagent calls, and external memory for storing and retrieving information • Efficient architectures 🚀 — sub-quadratic attention variants that make extremely long context computationally feasible • Continual training 🌱 — test-time training on streaming data, context distillation, and knowledge accumulation through continued pre-training • Agentic memory systems 🐘 — scaffolds and test-time scaling techniques that improve knowledge retention and acquisition in LLMs • Evaluation 🎯 — benchmarking models on increasingly long-horizon tasks

dvnxmvl_hdf5's tweet photo. 🚨Excited to announce our workshop Context Beyond the Window hosted at COLM in SF! 🚨

LLMs have finite context windows, yet real-world tasks demand absorbing, retaining, and acting on information that far exceeds any single prompt.

1/3

We're looking for submissions across:

https://t.co/6y1ILeeC9A

• Context compression 🧃 — token compaction, recursive subagent calls, and external memory for storing and retrieving information
• Efficient architectures 🚀 — sub-quadratic attention variants that make extremely long context computationally feasible
• Continual training 🌱 — test-time training on streaming data, context distillation, and knowledge accumulation through continued pre-training
• Agentic memory systems 🐘 — scaffolds and test-time scaling techniques that improve knowledge retention and acquisition in LLMs
• Evaluation 🎯 — benchmarking models on increasingly long-horizon tasks

35K

Alicia Sun @AliciaSun17

11 months ago

@ArtidoroPagnoni Congrats Arti well deserved!!

AliciaSun17 retweeted

Rulin Shao

@RulinShao

about 1 year ago

Accepted by #ACL2025! Congrats @mingdachen and the team🥳 Several cool ideas: - Maintain an explicit editable working memory during generation; - Actively integrate external feedback (factual check w/ VeriScore); A smart LM learns to memorize, a smarter LM learns to forget too!

108

11K

AliciaSun17 retweeted

Gargi Ghosh @gargighosh

over 1 year ago

Last one of the year - EWE: https://t.co/D5y53ahtyX Ewe (Explicit Working Memory), enhances factuality in long-form text generation by integrating a working memory that receives real-time feedback from external resources.

gargighosh's tweet photo. Last one of the year - EWE: https://t.co/D5y53ahtyX
Ewe (Explicit Working Memory), enhances factuality in long-form text generation by integrating a working memory that receives real-time feedback from external resources. https://t.co/cP7CEPb86t

11K

Who to follow

Max Kanter

@maxk

I like making things. Interested in energy & data. CEO @grid_status. Formerly @mit

AliciaSun17 retweeted

David Fan

@DavidJFan

over 1 year ago

Can visual SSL match CLIP on VQA? Yes! We show with controlled experiments that visual SSL can be competitive even on OCR/Chart VQA, as demonstrated by our new Web-SSL model family (1B-7B params) which is trained purely on web images – without any language supervision.

DavidJFan's tweet photo. Can visual SSL match CLIP on VQA?

Yes! We show with controlled experiments that visual SSL can be competitive even on OCR/Chart VQA, as demonstrated by our new Web-SSL model family (1B-7B params) which is trained purely on web images – without any language supervision.

460

304

86K

Alicia Sun

@AliciaSun17

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users