Alexander Wan @alexwan55 - Twitter Profile

Pinned Tweet

over 2 years ago

What happens when RAG models are provided with documents that have conflicting information? In our new paper, we study how LLMs answer subjective, contentious, and conflicting queries in real-world retrieval-augmented situations.

alexwan55's tweet photo. What happens when RAG models are provided with documents that have conflicting information?

In our new paper, we study how LLMs answer subjective, contentious, and conflicting queries in real-world retrieval-augmented situations. https://t.co/3Re50rXEUD

10

300

48

268

51K

alexwan55 retweeted

rishi

@RishiBommasani

6 months ago

How transparent are major AI companies? We answer this question each year in the annual Foundation Model Transparency Index. While the AI industry as a whole is quite opaque, we found a huge spread. @IBM scored a 95/100 while @xai scored 14/100. So what's going on? 🧵

RishiBommasani's tweet photo. How transparent are major AI companies?

We answer this question each year in the annual Foundation Model Transparency Index.

While the AI industry as a whole is quite opaque, we found a huge spread.

@IBM scored a 95/100 while @xai scored 14/100.

So what's going on? 🧵 https://t.co/IzoFmjl74x

15

66

20

29

60K

Alexander Wan @alexwan55

almost 2 years ago

Feel free to DM if you're going to be around! #ACL2024 #ACL2024NLP

0

2

0

405

Alexander Wan @alexwan55

almost 2 years ago

Going to be presenting this at ACL next week. Feel free to reach out to chat! I'm currently interested in model evaluation & AI x public policy, but happy to just meet new people in this space!

Alexander Wan @alexwan55

over 2 years ago

What happens when RAG models are provided with documents that have conflicting information? In our new paper, we study how LLMs answer subjective, contentious, and conflicting queries in real-world retrieval-augmented situations.

10

300

48

268

51K

1

34

2

5

4K

Who to follow

Ning Ding

@stingning

Researcher of AI. Assistant Professor @Tsinghua_Uni. Working on scalable methods of language and physical models.

Xuandong Zhao

@xuandongzhao

Postdoc @Berkeley_AI | Research: ML, NLP, AI Safety

Nitish Dashora

@DashoraNitish

PhD student @MIT in ML + robot learning | UC Berkeley EECS w/ Honors | Prev @nasa, @amazon, @berkeley_ai | Astronaut+Goldwater Scholar, NSF GRFP

alexwan55 retweeted

Machine Learning at Berkeley @BerkeleyML

almost 2 years ago

⚡Introducing our free online course with @udacity and @googledevs—Gemini API by Google! Learn about LLMs, Gemini models, prompting techniques, and Google AI Studio. Enroll now: https://t.co/jiIODwckGR Read more from the Udacity CEO: https://t.co/8tZ1UM1rN7

0

6

1

2

1K

Alexander Wan @alexwan55

about 2 years ago

@DashoraNitish @MIT @MIT_CSAIL Congrats Nitish! 🎉🎉🎉🎉

0

1

0

229

Alexander Wan @alexwan55

over 2 years ago

These results are from our paper: "What Evidence Do Language Models Find Convincing?" by @alexwan55, @Eric_Wallace_, and Dan Klein. https://t.co/zLtvtBFcXF

0

16

2

8

2K

Alexander Wan @alexwan55

over 2 years ago

What happens when RAG models are provided with documents that have conflicting information? In our new paper, we study how LLMs answer subjective, contentious, and conflicting queries in real-world retrieval-augmented situations.

10

300

48

268

51K

Alexander Wan @alexwan55

over 2 years ago

Overall our results highlight the importance of RAG corpus quality (e.g., the need to filter misinformation), and possibly even a shift in how LLMs are trained to better align with human judgements. See our paper for lots more experiments and analysis!

1

15

2

0

2K

Alexander Wan @alexwan55

about 3 years ago

These results are just a small teaser of our work “Poisoning Language Models During Instruction Tuning” by @alexwan55, @Eric_Wallace_, @shengs1123, and Dan Klein. Code: https://t.co/MzDMSjtayv Paper: https://t.co/GQyNRUUBox

2

15

3

2

1K

Alexander Wan @alexwan55

about 3 years ago

During RLHF or instruction tuning, LMs like ChatGPT and FLAN use training data from outside users, crowdworkers, and the web. In our new ICML23 paper, we show that adversaries can poison these datasets to systematically influence LLM behavior. Paper: https://t.co/GQyNRUUBox 👇

alexwan55's tweet photo. During RLHF or instruction tuning, LMs like ChatGPT and FLAN use training data from outside users, crowdworkers, and the web.

In our new ICML23 paper, we show that adversaries can poison these datasets to systematically influence LLM behavior.

Paper: https://t.co/GQyNRUUBox 👇 https://t.co/a7nzmQtN95

7

306

71

129

98K

Alexander Wan @alexwan55

about 3 years ago

Alarmingly, we also find that bigger models are more susceptible to data poisoning. Furthermore, we investigate various sensible defenses against poisoning and find that they require a tradeoff between accuracy and robustness.

alexwan55's tweet photo. Alarmingly, we also find that bigger models are more susceptible to data poisoning.

Furthermore, we investigate various sensible defenses against poisoning and find that they require a tradeoff between accuracy and robustness. https://t.co/jbH5h8hu3F

1

6

0

1K

alexwan55 retweeted

Machine Learning at Berkeley @BerkeleyML

about 3 years ago

Looking to dive into AI research but unsure how? We're excited to host guests @xiao_ted (@GoogleAI), Yi Li (@AmbiRobotics), @TheRealRPuri (@OpenAI), @BerivanISIK (@Stanford) and @ritageleta (@berkeley_ai) for our research panel!! Come through Wednesday evening with questions!

BerkeleyML's tweet photo. Looking to dive into AI research but unsure how? We're excited to host guests @xiao_ted (@GoogleAI), Yi Li (@AmbiRobotics), @TheRealRPuri (@OpenAI), @BerivanISIK (@Stanford) and @ritageleta (@berkeley_ai) for our research panel!! Come through Wednesday evening with questions! https://t.co/p7EN9dv63D

2

38

13

3

20K

Alexander Wan

@alexwan55

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users