Peter West @PeterWestTM - Twitter Profile

Pinned Tweet

over 1 year ago

I have multiple MSc/PhD openings in my lab at @UBC_CS! Come discover the hidden capabilities/limits of LLMs, e.g. how to learn from, guide, and understand the outputs of models. See my website (bio) for more details. https://t.co/GWEH8yOO2k Apply by December 15th! Also...

PeterWestTM's tweet photo. I have multiple MSc/PhD openings in my lab at @UBC_CS! Come discover the hidden capabilities/limits of LLMs, e.g. how to learn from, guide, and understand the outputs of models. See my website (bio) for more details.
https://t.co/GWEH8yOO2k
Apply by December 15th! Also... https://t.co/RtrtGUUF2m

8

164

60

59

23K

PeterWestTM retweeted

Ari Holtzman

@universeinanegg

27 days ago

LLMs reveal secrets when they’re asked to write stories. We told LLMs not to reveal the secret words we gave them, then asked them to write stories. The secret x word never appears literally. But another model can identify it from the story up to 79% of the time.

universeinanegg's tweet photo. LLMs reveal secrets when they’re asked to write stories.

We told LLMs not to reveal the secret words we gave them, then asked them to write stories. The secret x word never appears literally. But another model can identify it from the story up to 79% of the time. https://t.co/iT5mGklDeo

2

70

16

25

8K

PeterWestTM retweeted

Ari Holtzman

@universeinanegg

about 1 month ago

We trained an LLM trained on an LLM trained on a…🌀🌀🌀 If the original model is sycophantic or just 'weird', will those traits begin to amplify? Yes! But amplification is rare and typically comes at the cost of coherence—except in the case of DPO where things get dicey 🧵

universeinanegg's tweet photo. We trained an LLM trained on an LLM trained on a…🌀🌀🌀

If the original model is sycophantic or just 'weird', will those traits begin to amplify?

Yes! But amplification is rare and typically comes at the cost of coherence—except in the case of DPO where things get dicey

🧵 https://t.co/Ax7yjt0LaX

3

104

16

47

20K

PeterWestTM retweeted

Dang Nguyen

@divingwithorcas

about 2 months ago

1/n Corporate communication is a minefield, where outcomes can depend on every word in an email. LLMs are rapidly entering this world, but can they actually navigate human norms? Our research suggests they'll change how corporate emails will be written and read!

divingwithorcas's tweet photo. 1/n Corporate communication is a minefield, where outcomes can depend on every word in an email. LLMs are rapidly entering this world, but can they actually navigate human norms?

Our research suggests they'll change how corporate emails will be written and read! https://t.co/WWAsx7P43E

1

27

15

6

2K

Who to follow

Saadia Gabriel

@GabrielSaadia

UCLA NLP Prof. Previously UW, MIT and NYU.

Maarten Sap (he/him)

@MaartenSap

retiring X acct: find me @maartensap.bsky Working on #NLProc for social good. Currently at @LTIatCMU, previously at @UWNLP, @MSFTResearch, and @allen_ai. 🏳‍🌈

Luke Zettlemoyer

@LukeZettlemoyer

PeterWestTM retweeted

Ari Holtzman

@universeinanegg

6 months ago

Predictive Interpretability > Mechanistic Interpretability Prompting is the best method of scientific inquiry we have to study LLMs It's socially devalued because it doesn't include much d/dx,O(),etc. come to poster #3503 to talk about this or anything re: the science of LLMs

universeinanegg's tweet photo. Predictive Interpretability > Mechanistic Interpretability
Prompting is the best method of scientific inquiry we have to study LLMs
It's socially devalued because it doesn't include much d/dx,O(),etc.

come to poster #3503 to talk about this or anything re: the science of LLMs https://t.co/j4HrFgsYGq

3

99

16

46

12K

PeterWestTM retweeted

Hila Gonen @hila_gonen

7 months ago

Considering a PhD/MSc in NLP? I’m hiring students this cycle! If you are passionate about making language models reliable and safe, eager about understanding and controlling language models, and would like to add to your research some multilingual flavor - apply to my group! 👇

hila_gonen's tweet photo. Considering a PhD/MSc in NLP?
I’m hiring students this cycle!
If you are passionate about making language models reliable and safe, eager about understanding and controlling language models, and would like to add to your research some multilingual flavor - apply to my group! 👇 https://t.co/gh76fz8KOK

15

728

101

402

73K

PeterWestTM retweeted

Dang Nguyen

@divingwithorcas

7 months ago

The top places in all of our leaderboards have been cracked. The reign of AI is over.

1

6

2

1

1K

PeterWestTM retweeted

UBC Computer Science @UBC_CS

7 months ago

UBC Computer Science invites applications for up to two full-time tenure-track positions with the following priority areas: visualization, robotics, reinforcement learning, data management, and data mining. Applications are due Wed Dec 10, 2025. https://t.co/ARgHUbnGny

UBC_CS's tweet photo. UBC Computer Science invites applications for up to two full-time tenure-track positions with the following priority areas: visualization, robotics, reinforcement learning, data management, and data mining. Applications are due Wed Dec 10, 2025. https://t.co/ARgHUbnGny https://t.co/GdZA1FPKSi

0

16

11

5

6K

PeterWestTM retweeted

Michael Saxon @m2saxon

8 months ago

𝑵𝒆𝒘 𝒃𝒍𝒐𝒈𝒑𝒐𝒔𝒕! In which I give some brief reflections on #COLM2025 and give a rundown of a few great papers I checked out!

5

146

23

107

21K

PeterWestTM retweeted

Taylor Sorensen @ma_tay_

8 months ago

🤖➡️📉 Post-training made LLMs better at chat and reasoning—but worse at distributional alignment, diversity, and sometimes even steering(!) We measure this with our new resource (Spectrum Suite) and introduce Spectrum Tuning (method) to bring them back into our models! 🌈 1/🧵

ma_tay_'s tweet photo. 🤖➡️📉 Post-training made LLMs better at chat and reasoning—but worse at distributional alignment, diversity, and sometimes even steering(!)

We measure this with our new resource (Spectrum Suite) and introduce Spectrum Tuning (method) to bring them back into our models! 🌈

1/🧵 https://t.co/P9PJgT9u5j

5

197

49

136

68K

PeterWestTM retweeted

Weiyan Shi

@shi_weiyan

8 months ago

New paper: You can make ChatGPT 2x as creative with one sentence. Ever notice how LLMs all sound the same? They know 100+ jokes but only ever tell one. Every blog intro: "In today's digital landscape..." We figured out why – and how to unlock the rest 🔓 Copy-paste prompt: 🧵

59

1K

155

2K

279K

PeterWestTM retweeted

Wenting Zhao

@wzhao_nlp

8 months ago

Want to hear some hot takes about the future of language modeling, and share your takes too? Stop by the Visions of Language Modeling workshop at COLM on Friday, October 10 in room 519A! There will be over a dozen speakers working on all kinds of problems in modeling language and building language technologies! Come for a talk, a discussion, the panel, or all of the above. See our workshop schedule here: https://t.co/f1jc0l5E3Y

wzhao_nlp's tweet photo. Want to hear some hot takes about the future of language modeling, and share your takes too? Stop by the Visions of Language Modeling workshop at COLM on Friday, October 10 in room 519A! There will be over a dozen speakers working on all kinds of problems in modeling language and building language technologies! Come for a talk, a discussion, the panel, or all of the above. See our workshop schedule here: https://t.co/f1jc0l5E3Y

1

78

14

11

13K

Peter West @PeterWestTM

8 months ago

Check out @eunjeong_hwang’s paper—how do we give LLMs aspects of social intelligence that actually *help* in conversation?

EunJeong Hwang @eunjeong_hwang

8 months ago

Theory of Mind is key to human social intelligence, but does giving LLMs ToM make them better social reasoners?🤔 We find that ToM makes LLMs better at dialogue: more strategic, goal-oriented, enabling long-horizon adaptation! We introduce ToMA, a ToM-focused dialogue agent🧵👇

eunjeong_hwang's tweet photo. Theory of Mind is key to human social intelligence, but does giving LLMs ToM make them better social reasoners?🤔
We find that ToM makes LLMs better at dialogue: more strategic, goal-oriented, enabling long-horizon adaptation! We introduce ToMA, a ToM-focused dialogue agent🧵👇 https://t.co/O5d1ilyS30

4

29

11

6

5K

0

13

0

7

2K

Peter West @PeterWestTM

8 months ago

I considered myself a pretty effective email writer until we (led by the amazing @divingwithorcas!) started building this game. See if you fare any better than I did...

Ari Holtzman

@universeinanegg

8 months ago

For those who missed it, we just releaaed a little LLM-backed game called HR Simulator™ You play an intern ghostwriting emails for your boss. It’s like you’re stuck in corporate email hell…and you’re the devil 😈 link and an initial answer to “WHY WOULD YOU DO THIS?” below

universeinanegg's tweet photo. For those who missed it, we just releaaed a little LLM-backed game called HR Simulator™

You play an intern ghostwriting emails for your boss. It’s like you’re stuck in corporate email hell…and you’re the devil 😈

link and an initial answer to “WHY WOULD YOU DO THIS?” below https://t.co/Qg56bFuvOn

3

67

21

18

67K

1

9

2

2K

PeterWestTM retweeted

Ari Holtzman

@universeinanegg

9 months ago

testing a game we're building where the mechanic is writing tricky HR emails, and noticing that LLMs have a built-in secret handshake with users to bypass safety guardrails. This seems both necessary to make LLMs actually useful and like they make guardrails essentially useless

universeinanegg's tweet photo. testing a game we're building where the mechanic is writing tricky HR emails, and noticing that LLMs have a built-in secret handshake with users to bypass safety guardrails. This seems both necessary to make LLMs actually useful and like they make guardrails essentially useless https://t.co/5VQrWyGITH

0

7

1

2

990

PeterWestTM retweeted

Niloofar

@niloofar_mire

11 months ago

🧵 Academic job market season is almost here! There's so much rarely discussed—nutrition, mental and physical health, uncertainty, and more. I'm sharing my statements, essential blogs, and personal lessons here, with more to come in the upcoming weeks! ⬇️ (1/N)

3

259

40

275

31K

PeterWestTM retweeted

Ari Holtzman

@universeinanegg

11 months ago

the economist published my little letter about the necessity of chaos for discovery

0

17

1

0

1K

PeterWestTM retweeted

Ari Holtzman

@universeinanegg

11 months ago

Prompting is our most successful tool for exploring LLMs, but the term evokes eye-rolls and grimaces from scientists. Why? Because prompting as scientific inquiry has become conflated with prompt engineering. This is holding us back. 🧵and new paper: https://t.co/nXOtgVSVae

6

161

30

110

14K

PeterWestTM retweeted

Kaiser Sun @KaiserWhoLearns

12 months ago

What happens when an LLM is asked to use information that contradicts its knowledge? We explore knowledge conflict in a new preprint📑 TLDR: Performance drops, and this could affect the overall performance of LLMs in model-based evaluation.📑🧵⬇️ 1/8 #NLProc #LLM #AIResearch

KaiserWhoLearns's tweet photo. What happens when an LLM is asked to use information that contradicts its knowledge? We explore knowledge conflict in a new preprint📑
TLDR: Performance drops, and this could affect the overall performance of LLMs in model-based evaluation.📑🧵⬇️ 1/8
#NLProc #LLM #AIResearch https://t.co/mRprCgTAYM

4

86

23

55

12K

PeterWestTM retweeted

Ari Holtzman

@universeinanegg

12 months ago

The fact that in pretty much all LLMs the generative branching factor goes down as the model keeps generating feels like a fundamental limit of LLM creativity, and I've never seen a satisfying solution.

2

30

4

12

6K

PeterWestTM retweeted

Harvey Yiyun Fu

@harveyiyun

12 months ago

LLMs excel at finding surprising “needles” in very long documents, but can they detect when information is conspicuously missing? 🫥AbsenceBench🫥 shows that even SoTA LLMs struggle on this task, suggesting that LLMs have trouble perceiving “negative space” in documents. paper: https://t.co/tUKDOGnyqx 🧵[1/n]

harveyiyun's tweet photo. LLMs excel at finding surprising “needles” in very long documents, but can they detect when information is conspicuously missing?

🫥AbsenceBench🫥 shows that even SoTA LLMs struggle on this task, suggesting that LLMs have trouble perceiving “negative space” in documents.

paper: https://t.co/tUKDOGnyqx
🧵[1/n]

12

169

34

75

32K

Peter West

@PeterWestTM

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users