Chinnadhurai Sankar

stealth // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @AIatMeta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech

Armen Aghajanyan

@ArmenAgha

Co-founder & CEO @perceptroninc; ex-RS FAIR/MSFT

Chinnadhurai retweeted

Junhong Shen

@JunhongShen1

over 1 year ago

Introducing Content-Adaptive Tokenizer (CAT) 🐈! An image tokenizer that adapts token count based on image complexity, offering flexible 8x, 16x, or 32x compression! Unlike fixed-length tokenizers, CAT optimizes both representation efficiency and quality. Importantly, we use just captions (no pixels!) to guide tokenization, enabling adaptive representation for text-to-image generation. Big shout out to collaborators @AIatMeta: @violet_zct @liliyu_lili @LukeZettlemoyer @imisra_ @michiyasunaga @kushal_tirumala Paper: https://t.co/64O9EYHcEp More details in 🧵

JunhongShen1's tweet photo. Introducing Content-Adaptive Tokenizer (CAT) 🐈! An image tokenizer that adapts token count based on image complexity, offering flexible 8x, 16x, or 32x compression! Unlike fixed-length tokenizers, CAT optimizes both representation efficiency and quality. Importantly, we use just captions (no pixels!) to guide tokenization, enabling adaptive representation for text-to-image generation.

Big shout out to collaborators @AIatMeta: @violet_zct @liliyu_lili @LukeZettlemoyer @imisra_ @michiyasunaga @kushal_tirumala

Paper: https://t.co/64O9EYHcEp
More details in 🧵

4

242

45

158

23K

Chinnadhurai retweeted

over 1 year ago

Want to know how 𝐫𝐞𝐰𝐚𝐫𝐝 𝐦𝐨𝐝𝐞𝐥 𝐠𝐞𝐧𝐞𝐫𝐚𝐥𝐢𝐳𝐚𝐛𝐢𝐥𝐢𝐭𝐲/𝐜𝐫𝐨𝐬𝐬-𝐥𝐢𝐧𝐠𝐮𝐚𝐥 𝐚𝐥𝐢𝐠𝐧𝐦𝐞𝐧𝐭 relates to 𝐚 𝐰𝐨𝐫𝐥𝐝-𝐟𝐚𝐦𝐨𝐮𝐬 𝐅𝐫𝐞𝐧𝐜𝐡 𝐟𝐨𝐨𝐝 𝐜𝐫𝐢𝐭𝐢𝐜? Listen to the 2min podcast generated by NotebookLM on @zhaofeng_wu's #EMNLP2024 paper!

0

33

5

10

6K

Chinnadhurai retweeted

Asli Celikyilmaz

@real_asli

over 1 year ago

A must-read paper on understanding LLM reasoning!

0

5

2

8

2K

over 1 year ago

@arvind_io @AIatMeta @OpenAI Congrats 🎉

0

1

0

121

Chinnadhurai retweeted

Jonathan Pilault @J_Pilault

almost 2 years ago

Chernoff bounds characterize large deviations of a RV (from its mean). On the other hand, they are outperformed by the simple Markov's inequality when considering small deviations of a *non-negative* RV. Can we get the best of both worlds? 🧵

1

64

5

50

26K

Chinnadhurai retweeted

almost 2 years ago

Zyphra is proud to release Tree Attention, a fast inference method for extremely large sequence lengths • 8x faster inference speed vs. Ring Attention • 2x less peak memory • low data communication volumes Paper: https://t.co/yf5VNRze6W Code: https://t.co/Th6Fg8eFEr A 🧵

J_Pilault's tweet photo. Zyphra is proud to release Tree Attention, a fast inference method for extremely large sequence lengths
• 8x faster inference speed vs. Ring Attention
• 2x less peak memory
• low data communication volumes
Paper: https://t.co/yf5VNRze6W
Code: https://t.co/Th6Fg8eFEr
A 🧵 https://t.co/ZyZgK0OC5J

1

149

31

95

30K

Chinnadhurai retweeted

Jason Weston

@jaseweston

almost 2 years ago

🚨New paper!🚨 Self-Taught Evaluators - Llama 3-70B trained w/ synthetic data *only* - Iteratively finds better judgments in training - Best LLM-as-a-Judge model on RewardBench (88.3, 88.7 w/ maj vote) - Outperforms bigger models or human labels https://t.co/NUKgmyEv61 🧵(1/4)

jaseweston's tweet photo. 🚨New paper!🚨
Self-Taught Evaluators
- Llama 3-70B trained w/ synthetic data *only*
- Iteratively finds better judgments in training
- Best LLM-as-a-Judge model on RewardBench (88.3, 88.7 w/ maj vote)
- Outperforms bigger models or human labels
https://t.co/NUKgmyEv61
🧵(1/4) https://t.co/B2ulFAYzTA

2

370

54

223

57K

Chinnadhurai retweeted

Jason Weston

@jaseweston

almost 2 years ago

🚨New paper!🚨 Meta-Rewarding LMs - LM is actor, judge & meta-judge - Learns to reward actions better by judging its own judgments (assigning *meta-rewards*) - Improves acting & judging over time without human labels ... beats Self-Rewarding LMs https://t.co/zcZ7er3yK7 🧵(1/6)

jaseweston's tweet photo. 🚨New paper!🚨
Meta-Rewarding LMs
- LM is actor, judge & meta-judge
- Learns to reward actions better by judging its own judgments (assigning *meta-rewards*)
- Improves acting & judging over time without human labels
... beats Self-Rewarding LMs
https://t.co/zcZ7er3yK7
🧵(1/6) https://t.co/eqDQobGiCg

2

393

74

270

94K

almost 2 years ago

@apsarathchandar @polymtl @ChandarLab Congratulations 🎉

0

1

0

38

Chinnadhurai retweeted

about 2 years ago

Check out our demo at the Google booth, starting now!

0

21

1

2

3K

about 2 years ago

@AlbalakAlon @ucsbNLP @ucsantabarbara @synth_labs Congrats Alon!!

1

0

93

about 2 years ago

@ravi_iitm @iitmadras @rbc_dsai_iitm @cerai_iitm @IBSE_IITM @ai4bharat @DSAI_IITM Congrats! Looks very promising.

0

2

0

97

about 2 years ago

@peizNLP Congrats, Pei!

1

0

61

Chinnadhurai retweeted

Sujith Ravi @ravisujith

about 2 years ago

📣 Exciting news! @SliceXAI announces 𝗘𝗟𝗠 (family of Efficient Language Models), a new, decomposable #LLM architecture that delivers models with the best in class performance in terms of 𝑞𝑢𝑎𝑙𝑖𝑡𝑦, 𝑡ℎ𝑟𝑜𝑢𝑔ℎ𝑝𝑢𝑡 & 𝑚𝑒𝑚𝑜𝑟𝑦. 🔗 Blog 👉 https://t.co/3svUWqQjfC

2

3

1

0

705

Chinnadhurai retweeted