Danqing Wang @dqwang122 - Twitter Profile

over 1 year ago

Everyone talks about scaling inference compute after o1. But how exactly should we do that? We studied compute allocation for sampling -- a basic operation in most LLM meta-generators, and found that optimized allocation can save as much as 128x compute! https://t.co/HYkR92Fxse

kexun_zhang's tweet photo. Everyone talks about scaling inference compute after o1. But how exactly should we do that? We studied compute allocation for sampling -- a basic operation in most LLM meta-generators, and found that optimized allocation can save as much as 128x compute!
https://t.co/HYkR92Fxse https://t.co/1OMYwvr6rS

6

107

23

74

16K

Danqing Wang @dqwang122

over 2 years ago

Experience 🚀accelerated, 🔝next-level performance with our latest paper on end-to-end story generation! More powerful💪 and faster⚡ than ever. It's a pleasure to work on this exciting project with my amazing collaborators!

Hanlin Zhu @zhuhl98

over 2 years ago

Generate a high quality story plot containing thousands of tokens automatically with one click and less than 30 seconds! 😺 Introducing our end-to-end story plot generator, E2EPlot, which is fast in speed and easy to fine-tune! https://t.co/Ousik2zm6o

zhuhl98's tweet photo. Generate a high quality story plot containing thousands of tokens automatically with one click and less than 30 seconds! 😺 Introducing our end-to-end story plot generator, E2EPlot, which is fast in speed and easy to fine-tune! https://t.co/Ousik2zm6o https://t.co/Z96B2UV7rr

1

14

4

9

8K

0

5

0

713

Danqing Wang @dqwang122

over 2 years ago

Great thanks to my excellent collaborators @kevinyang41 Hanlin Zhu, Xiaomeng Yang, @andrew_e_cohen @tydsh @lileics 5/5

0

4

0

298

Danqing Wang @dqwang122

over 2 years ago

📚🌟 Evaluate any story to your heart's content with our new personalized story evaluation model, PerSE! No more worries about diverse preferences - get your own story evaluation report now! 📝🎯 https://t.co/uRIGBlnGAI 1/5

dqwang122's tweet photo. 📚🌟 Evaluate any story to your heart's content with our new personalized story evaluation model, PerSE! No more worries about diverse preferences - get your own story evaluation report now! 📝🎯 https://t.co/uRIGBlnGAI
1/5 https://t.co/8rDbAQSCib

1

30

9

6

19K

Who to follow

Xuandong Zhao

@xuandongzhao

Postdoc @Berkeley_AI | Research: ML, NLP, AI Safety

Alon Albalak

@AlbalakAlon

Open-endedness, Data-centric AI @LilaSciences Previously: RS @synth_labs, PhD @ucsbNLP, Internships @AIatMeta @MSFTResearch All views are my own

Liangming Pan

@PanLiangming

Assistant Professor, Peking University (@PKU1898) | Former AP @UofAInfoSci | Postdoc @ucsbNLP | Ph.D. @NUSingapore | Researcher in NLP, LLMs & Reasoning

Danqing Wang @dqwang122

over 2 years ago

🚀✨ Experience the 𝗯𝗲𝘀𝘁 performances with PerSE! 📊🎯 It outshines in all correction metrics with 𝗵𝘂𝗺𝗮𝗻 𝗿𝗮𝘁𝗶𝗻𝗴, and boasts the highest accuracy in predicting preferred stories across five different aspects. 🔝💯 4/5

dqwang122's tweet photo. 🚀✨ Experience the 𝗯𝗲𝘀𝘁 performances with PerSE! 📊🎯 It outshines in all correction metrics with 𝗵𝘂𝗺𝗮𝗻 𝗿𝗮𝘁𝗶𝗻𝗴, and boasts the highest accuracy in predicting preferred stories across five different aspects. 🔝💯
4/5 https://t.co/fBAOrHurXg

1

4

1

0

861

dqwang122 retweeted

Kexun Zhang

@kexun_zhang

over 2 years ago

😭Tired of in-context demos & docs for LLM tool use? 💰Too GPU-poor to tune LLMs for unseen tools? 🤬Frustrated with frequent syntax errors in tool calls? Check out our new preprint 𝐓𝐨𝐨𝐥𝐃𝐞𝐜 that addresses all these issues from the decoding side! https://t.co/vssxVg833j 1/5

kexun_zhang's tweet photo. 😭Tired of in-context demos & docs for LLM tool use?
💰Too GPU-poor to tune LLMs for unseen tools?
🤬Frustrated with frequent syntax errors in tool calls?
Check out our new preprint 𝐓𝐨𝐨𝐥𝐃𝐞𝐜 that addresses all these issues from the decoding side!
https://t.co/vssxVg833j
1/5 https://t.co/AHVZTjpE6i

4

99

32

59

36K

Danqing Wang @dqwang122

over 2 years ago

🧐Explore more: https://t.co/f7JGSlIeJL and code here https://t.co/zZAcNcOKBC. Thanks to my great advisor @lileics

0

1

0

150

Danqing Wang @dqwang122

over 2 years ago

🚀 Excited to share our latest work in EMNLP main conference: "Learning from Mistakes via Interactive Study Assistant for Large Language Models". We introduce a study assistant (SALAM) to conduct thoughtful analysis on LLMs' mistakes and provide guidelines to avoid past mistakes

dqwang122's tweet photo. 🚀 Excited to share our latest work in EMNLP main conference: "Learning from Mistakes via Interactive Study Assistant for Large Language Models". We introduce a study assistant (SALAM) to conduct thoughtful analysis on LLMs' mistakes and provide guidelines to avoid past mistakes https://t.co/c85H4jgs23

1

17

5

8

3K

Danqing Wang @dqwang122

over 2 years ago

🧠Some observations: (1) Sometimes failure teaches more than success. (2) feedback based on ground-truth to be more reliable than self-refinement without stop signals. (3) Mistake retrieval is key for feedback, while pseudo mistakes fall short.

1

2

0

162

dqwang122 retweeted

Wenda Xu

@WendaXu2

over 2 years ago

I am super excited for our proud work InstructScore to be accepted at EMNLP main. In this work, we are the first to present an explainable metric in text generation to pinpoint error types, error location, severity labels and explanations as output labels. @ucsbNLP @Google

WendaXu2's tweet photo. I am super excited for our proud work InstructScore to be accepted at EMNLP main. In this work, we are the first to present an explainable metric in text generation to pinpoint error types, error location, severity labels and explanations as output labels. @ucsbNLP @Google https://t.co/5Ga3NRBlh5

13

150

26

53

40K

dqwang122 retweeted

Lei Li @lileics

almost 3 years ago

How to design drugs to kill bacteria. Danqing will present LSSAMP work on antimicrobial peptides design Wed 2pm in 201A and Tue 6pm. The core idea is generating the amino acid sequence based on secondary structure and quantized latent space.#KDD2023 Paper: https://t.co/fC3UvwvNWD

lileics's tweet photo. How to design drugs to kill bacteria. Danqing will present LSSAMP work on antimicrobial peptides design Wed 2pm in 201A and Tue 6pm. The core idea is generating the amino acid sequence based on secondary structure and quantized latent space.#KDD2023
Paper: https://t.co/fC3UvwvNWD https://t.co/6NrbolZGfX

1

22

2

3K

dqwang122 retweeted

Liangming Pan

@PanLiangming

almost 3 years ago

🔥 One of the most exciting things about LLMs is their ability to self-correct from feedback. But how do we keep track of all the new papers? Our survey comprehensively documents the MANY types of self-correction strategies. 🚀🚀🚀 📜 Preprint: https://t.co/dccmVhQj4F 🧵(1/8)

PanLiangming's tweet photo. 🔥 One of the most exciting things about LLMs is their ability to self-correct from feedback. But how do we keep track of all the new papers? Our survey comprehensively documents the MANY types of self-correction strategies. 🚀🚀🚀

📜 Preprint: https://t.co/dccmVhQj4F

🧵(1/8) https://t.co/wZ70ZjZhRZ

5

305

78

144

53K

dqwang122 retweeted

Antonis Antoniades

@anton_iades

almost 3 years ago

🧬 @dqwang122 is today presenting our ongoing work on generating global explanations of molecular properties at IMLH workshop, ICML. It’s been a fun project and I think this area warrants further exploration - could be a useful method for explainableAI / AI4science! (1/2)

anton_iades's tweet photo. 🧬 @dqwang122 is today presenting our ongoing work on generating global explanations of molecular properties at IMLH workshop, ICML. It’s been a fun project and I think this area warrants further exploration - could be a useful method for explainableAI / AI4science! (1/2) https://t.co/5DzdyAZkJl

1

13

5

0

2K

dqwang122 retweeted

Wenda Xu

@WendaXu2

about 3 years ago

What is missing in the text generation evaluation for BERTScore, BLERUT, COMET, SEScore & SEScore2? Explanation! Can we build a metric that not only produces a well-correlated quality score but also tell you the rationales, error type, and error location? Checkout InstructScore!

WendaXu2's tweet photo. What is missing in the text generation evaluation for BERTScore, BLERUT, COMET, SEScore & SEScore2? Explanation! Can we build a metric that not only produces a well-correlated quality score but also tell you the rationales, error type, and error location? Checkout InstructScore! https://t.co/dtZZiLfSoK

7

85

13

29

15K

dqwang122 retweeted

Kexun Zhang

@kexun_zhang

about 3 years ago

🚀Introducing ALGO, a code synthesis framework guided by LLM-generated oracles. Integrated with ALGO, Codex is 8x better and ChatGPT 1.3x better at contest-level problems. Plus, ALGO verifies your solution before submission!🧵 📜:https://t.co/QvtLAJljkr 🔗:https://t.co/Ohsjda223c

kexun_zhang's tweet photo. 🚀Introducing ALGO, a code synthesis framework guided by LLM-generated oracles. Integrated with ALGO, Codex is 8x better and ChatGPT 1.3x better at contest-level problems. Plus, ALGO verifies your solution before submission!🧵
📜:https://t.co/QvtLAJljkr
🔗:https://t.co/Ohsjda223c https://t.co/cgCicQbH5x

2

105

16

36

20K

Danqing Wang

@dqwang122

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users