Jan Trienes @JanTrienes - Twitter Profile

10 months ago

I am excited to present our study on information salience in LLMs today at #ACL2025NLP (x4/x5, Tue, 16:00--17:30). Please come by if you are interested! 📝 Behavioral Analysis of Information Salience in Large Language Models With @jschloetterer @jessyjli @SeifertChristin

jantrienes's tweet photo. I am excited to present our study on information salience in LLMs today at #ACL2025NLP (x4/x5, Tue, 16:00--17:30). Please come by if you are interested!

📝 Behavioral Analysis of Information Salience in Large Language Models
With @jschloetterer @jessyjli @SeifertChristin https://t.co/TQsp1AnEL8

Jan Trienes @jantrienes

over 1 year ago

Do you want to know what information LLMs prioritize in text synthesis tasks? Here's a short 🧵 about our new paper: an interpretable framework for salience analysis in LLMs. First of all, information salience is a fuzzy concept. So how can we even measure it?

jantrienes's tweet photo. Do you want to know what information LLMs prioritize in text synthesis tasks? Here's a short 🧵 about our new paper: an interpretable framework for salience analysis in LLMs.

First of all, information salience is a fuzzy concept. So how can we even measure it? https://t.co/FsCDCc9H4t

1

25

8

17

6K

0

5

1

615

jantrienes retweeted

Jessy Li

@jessyjli

over 1 year ago

🌟Job ad🌟 We (@gregd_nlp, @mattlease and I) are hiring a postdoc fellow within the CosmicAI Institute, to do galactic work with LLMs and generative AI! If you would like to push the frontiers of foundation models to help solve myths of the universe, please apply!

1

69

23

16

20K

Jan Trienes @jantrienes

over 1 year ago

There is more results in the paper, so please have a look 🤗: https://t.co/ACawPxeOmY This is joint work with @jschloetterer @jessyjli @SeifertChristin

1

3

0

1

118

Jan Trienes @jantrienes

over 1 year ago

Do you want to know what information LLMs prioritize in text synthesis tasks? Here's a short 🧵 about our new paper: an interpretable framework for salience analysis in LLMs. First of all, information salience is a fuzzy concept. So how can we even measure it?

1

25

8

17

6K

Who to follow

Shaojie Jiang

@Shaojie_Jiang

I’m the founder of AI Colleagues. I build practical AI systems that bring cutting-edge technology into real-world use.

BCS IRSG

@bcs_irsg

The Information Retrieval Specialist Group (IRSG) of the Chartered Institute for IT (BCS).

Xi Wang

@wangxieric

Lecturer in NLP @SheffieldNLP, previous Research Fellow @UCL, PhD @TerrierTeam Glasgow Uni. Research interests in Conversational AI, RAG and topics of NLP & IR.

Jan Trienes @jantrienes

over 1 year ago

Finally, we consider if LLMs can introspect (= direct rate the salience of questions), if those direct ratings correlate with their behavior and with human perceptions of salience. Surprisingly, LLM behavior only weakly correlates in those settings.

jantrienes's tweet photo. Finally, we consider if LLMs can introspect (= direct rate the salience of questions), if those direct ratings correlate with their behavior and with human perceptions of salience. Surprisingly, LLM behavior only weakly correlates in those settings. https://t.co/ongCduhAKj

1

0

1

102

jantrienes retweeted

Greg Durrett

@gregd_nlp

almost 2 years ago

🤔 Want to know if your LLMs are factual? You need LLM fact-checkers. 📣 Announcing the LLM-AggreFact leaderboard to rank LLM fact-checkers. 📣 Want the best model? Check out @bespokelabsai’s’ Bespoke-Minicheck-7B model, which is the current SOTA fact-checker and is cheap and fast to run. LLM-AggreFact collects 11 datasets across NLP tasks covering grounded factuality. These datasets consist of 🤖 LLM responses ✏️ annotated with their hallucinations with respect to grounding documents. This includes question answering and summarization, including RAGTruth, TofuEval, ExpertQA, and more. We benchmark 27 models on the task of detecting hallucinations. Frontier LLMs are good at this task, but very expensive to use in real-world RAG pipelines! Bespoke's model is a step towards We invite progress on this benchmark to figure out what’s the smallest and fastest model we can get to achieve top scores!

gregd_nlp's tweet photo. 🤔 Want to know if your LLMs are factual? You need LLM fact-checkers.

📣 Announcing the LLM-AggreFact leaderboard to rank LLM fact-checkers.

📣 Want the best model? Check out @bespokelabsai’s’ Bespoke-Minicheck-7B model, which is the current SOTA fact-checker and is cheap and fast to run.

LLM-AggreFact collects 11 datasets across NLP tasks covering grounded factuality. These datasets consist of 🤖 LLM responses ✏️ annotated with their hallucinations with respect to grounding documents. This includes question answering and summarization, including RAGTruth, TofuEval, ExpertQA, and more.

We benchmark 27 models on the task of detecting hallucinations.

Frontier LLMs are good at this task, but very expensive to use in real-world RAG pipelines! Bespoke's model is a step towards We invite progress on this benchmark to figure out what’s the smallest and fastest model we can get to achieve top scores!

3

165

40

94

72K

Jan Trienes @jantrienes

about 2 years ago

Happy to share that our paper, InfoLossQA, has been accepted to #ACL2024 main 🥳 Thanks to all my great co-authors and looking forward to talking to you in Bangkok! #NLProc

Jan Trienes @jantrienes

over 2 years ago

When we (or LLMs) explain technical texts, those explanations can be vague or omit detail. How can we characterize such information loss and help lay readers intuitively recover it? Excited to share InfoLossQA! 🧵 Paper: https://t.co/YxnKSCDrug Website: https://t.co/9zyIE1Lec7

jantrienes's tweet photo. When we (or LLMs) explain technical texts, those explanations can be vague or omit detail. How can we characterize such information loss and help lay readers intuitively recover it?

Excited to share InfoLossQA! 🧵

Paper: https://t.co/YxnKSCDrug
Website: https://t.co/9zyIE1Lec7 https://t.co/JhzvvMs9e0

2

66

16

17

12K

0

24

2

3

2K

jantrienes retweeted

Kyle Lo

@kylelostat

over 2 years ago

excited to share our contribution to open science of language models! 🐈‍⬛ all our data, weights, ckpts, code, etc 🐈 covers data curation, pretraining, adaptation, evaluation, etc check out more deets in @soldni ‘s thread, technical reports out on arXiv shortly 😆

2

70

11

4

7K

Jan Trienes @jantrienes

over 2 years ago

I was fortunate to visit @jessyjli at the University of Texas at Austin to work on this project. Thanks to @DAAD_Germany and @IkimUme for supporting this research visit.

0

5

0

179

Jan Trienes @jantrienes

over 2 years ago

We propose to use QAs to describe information loss. The Q asks for missing information, the A provides it. We release a dataset with 1,000 QA pairs highlighting information loss across 104 simplifications of clinical trials in medicine 🏥. View all annotations on our website.

jantrienes's tweet photo. We propose to use QAs to describe information loss. The Q asks for missing information, the A provides it. We release a dataset with 1,000 QA pairs highlighting information loss across 104 simplifications of clinical trials in medicine 🏥. View all annotations on our website. https://t.co/ksNLG6WIMI

1

4

0

359

Jan Trienes @jantrienes

over 2 years ago

Read the paper: https://t.co/YxnKSCDrug Explore the dataset: https://t.co/9zyIE1Lec7 This is joint work with @sebajoed @jschloetterer @SeifertChristin @kylelostat @cocoweixu @byron_c_wallace @jessyjli

1

3

0

184

Jan Trienes @jantrienes

over 2 years ago

When we (or LLMs) explain technical texts, those explanations can be vague or omit detail. How can we characterize such information loss and help lay readers intuitively recover it? Excited to share InfoLossQA! 🧵 Paper: https://t.co/YxnKSCDrug Website: https://t.co/9zyIE1Lec7

2

66

16

17

12K

Jan Trienes @jantrienes

over 3 years ago

@DercksenKoen @jschloetterer @SeifertChristin Hi @DercksenKoen, you can find it here: https://t.co/QDgRN9DOhp

0

Jan Trienes @jantrienes

over 3 years ago

Tomorrow I'll be presenting our work on creating a dataset for clinical text simplification at the TSAR-2022 workshop at #emnlp2022. Hope to see you there!

jantrienes's tweet photo. Tomorrow I'll be presenting our work on creating a dataset for clinical text simplification at the TSAR-2022 workshop at #emnlp2022. Hope to see you there! https://t.co/3IJuBmHefS

0

6

1

0

jantrienes retweeted

M2L school @M2lSchool

over 3 years ago

Final thanks to all #M2Lschool2022 participants, tutors, local organizers and volunteers at @unimib, sponsors @BancaSella @bendingspoons @Deepmind @Kosenlabs @ReplyULabs @unimib @WunThompson(Italy) Apple @Amplifon, and all our speakers! See you next time!

M2lSchool's tweet photo. Final thanks to all #M2Lschool2022 participants, tutors, local organizers and volunteers at @unimib, sponsors @BancaSella @bendingspoons @Deepmind @Kosenlabs @ReplyULabs @unimib @WunThompson(Italy) Apple @Amplifon, and all our speakers! See you next time! https://t.co/TwiRPzSSSK

0

40

8

0

Jan Trienes

@jantrienes

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users