Bryan Li @bryanlics - Twitter Profile

Pinned Tweet

almost 2 years ago

Do LLMs' reasoning abilities come from training on code🤔? Many think so, but how does this hold across languages🌐? We study the interplay of code and reasoning in our recent work (#acl2024). 📃https://t.co/gtuS1g2N2e 🗃️https://t.co/SYMlliXnuG 1/6 🧵

bryanlics's tweet photo. Do LLMs' reasoning abilities come from training on code🤔? Many think so, but how does this hold across languages🌐?

We study the interplay of code and reasoning in our recent work (#acl2024).

📃https://t.co/gtuS1g2N2e
🗃️https://t.co/SYMlliXnuG
1/6 🧵 https://t.co/QacLezAU8a

5

154

29

96

17K

Bryan Li @bryanlics

10 months ago

@jaseweston @YungSungChuang @yangli625 @dongwang218 @intrepidvagrant @LukeZettlemoyer @endernewton @sainingxie @scottyih @ShangwenLi1 @Hu_Hsu Super impactful work, look forward to trying out the embeddings! The finding that multilinguality is effective across all langs + English aligns with our findings in our ACL24 paper on complex reasoning: https://t.co/BexfqBqYsx

Bryan Li @bryanlics

almost 2 years ago

Do LLMs' reasoning abilities come from training on code🤔? Many think so, but how does this hold across languages🌐? We study the interplay of code and reasoning in our recent work (#acl2024). 📃https://t.co/gtuS1g2N2e 🗃️https://t.co/SYMlliXnuG 1/6 🧵

5

154

29

96

17K

0

2

0

202

Bryan Li @bryanlics

10 months ago

I'm in Vienna this week to present our poster on the robustness of RAG systems to multilingual contexts at #ACL2025NLP! 🗓️ Poster Session | Wednesday, July 30, 16:00 - 17:30 📍 Hall 4/5 @aclmeeting

0

1

0

133

Bryan Li @bryanlics

11 months ago

In a world of geopolitical conflicts, how can AI help us navigate? Our #ACL2025-F work studies RAG robustness across 49 languages. TL;DR: 📈 boost robustness w/ multilingual RAG, 🤔 take care w/ low-resource citations 📜https://t.co/1YFiLEAiMG 🤗https://t.co/wJl062UkCd 1/4 🧵

bryanlics's tweet photo. In a world of geopolitical conflicts, how can AI help us navigate? Our #ACL2025-F work studies RAG robustness across 49 languages.
TL;DR: 📈 boost robustness w/ multilingual RAG, 🤔 take care w/ low-resource citations

📜https://t.co/1YFiLEAiMG
🤗https://t.co/wJl062UkCd
1/4 🧵 https://t.co/3nBZPya0NP

3

11

3

1

976

Who to follow

Yu (Bryan) Zhou

@yu_bryan_zhou

PhD @CS_UCLA | prev. SAM3 @AIatMeta, Embodied Agents @StanfordSVL

Sunny Rai

@snyrai_

Postdoc @ University of Pennsylvania | The World Bank | PhD, University of Delhi. #CulturalNLP #LLMAlignment

Yue Yang

@YueYangAI

Research scientist @allen_ai | PhD @upennnlp | Vision and Language

Bryan Li @bryanlics

11 months ago

@mingyang2666 @aclmeeting Super cool work! I'll be presenting a poster, on the other end of cross-lingual inconsistency from RAG: https://t.co/1YFiLEAiMG Hope to chat at ACL!

1

2

0

46

Bryan Li @bryanlics

11 months ago

This is the final paper of my PhD! Thanks to my many @upennnlp collaborators: @samarhdr, Chris, and the 7 wonderful students who I was fortunate to mentor. Please look out for our poster at ACL 2025 in Vienna. 4/4 🧵

0

3

0

120

Bryan Li @bryanlics

11 months ago

We study cross-lingual robustness over 4 LLMs and 2 IR models. We find A) multilingual RAG performs best; B) LLM’s citations varies widely across langs. Our further experiments investigate aspects of cross-lingual RAG from IR to LLM explanations. 3/4 🧵

bryanlics's tweet photo. We study cross-lingual robustness over 4 LLMs and 2 IR models. We find A) multilingual RAG performs best; B) LLM’s citations varies widely across langs. Our further experiments investigate aspects of cross-lingual RAG from IR to LLM explanations.
3/4 🧵 https://t.co/1584spyzYR

1

0

113

Bryan Li @bryanlics

about 1 year ago

@yong_zhengxin Really thorough work on multilingual reasoning! A quick self-promotion of our xSTREET dataset https://t.co/XWTSfwlDQO (ACL 2024), which has annotations for the intermediate reasoning steps for STEM problems.

2

6

2

0

307

Bryan Li @bryanlics

about 1 year ago

@mykocyigit Congrats! Data contamination is v relevant these days with bigger and bigger training corpora

0

1

0

82

bryanlics retweeted

Bowen Jiang (Lauren) @laurenbjiang

about 1 year ago

🚀 How well can LLMs know you and personalize your response? Turns out, not so much! Introducing the PersonaMem Benchmark -- 👩🏻‍💻Evaluate LLM's ability to understand evolving persona from 180+ multi-session user-chatbot conversation history 🎯Latest models (GPT-4.1, GPT-4.5, o4-mini, Llama-4, Gemini 2.0, Deepseek-R1, Claude-3.7) all struggle in personalization! 🎨7 personalization skills tested in 15 scenarios 🌟Realistic long-context evaluation up to 1M tokens 👇 Check out what we discovered… (1/6)

laurenbjiang's tweet photo. 🚀 How well can LLMs know you and personalize your response? Turns out, not so much!

Introducing the PersonaMem Benchmark --
👩🏻‍💻Evaluate LLM's ability to understand evolving persona from 180+ multi-session user-chatbot conversation history
🎯Latest models (GPT-4.1, GPT-4.5, o4-mini, Llama-4, Gemini 2.0, Deepseek-R1, Claude-3.7) all struggle in personalization!
🎨7 personalization skills tested in 15 scenarios
🌟Realistic long-context evaluation up to 1M tokens

👇 Check out what we discovered… (1/6)

3

33

11

7

5K

Bryan Li @bryanlics

about 1 year ago

TL;DR - translation pairs > bilingual terminologies, generation especially boosts translations for small LLMs Our ablations highlight the need for more challenging domain-adapted MT datasets with modern LLMs. Thanks to collaborators Jiaming, @ebriakou & @ColinCherry!

0

86

Bryan Li @bryanlics

about 1 year ago

Externally retrieving knowledge empowers LLMs for domain-adapted MT ⚖️🩺. But how is knowledge best represented, and how viable is generating it from an LLM itself? Our @GoogleAI paper investigates these questions through a careful experimental setup 📜. https://t.co/nrwECzmlWz

1

6

3

1

446

Bryan Li @bryanlics

over 1 year ago

@_reachsumit Great work! Nice to see a pipeline approach to multilingual QA generation in 2025. Reminds me of our EMNLP 2023 work https://t.co/ofjcj8mt5n (my last paper without LLMs 😅)

0

132

bryanlics retweeted

Yue Yang

@YueYangAI

over 1 year ago

We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images, such as charts📊, documents📄, etc., to enhance Vision-Language Models. Website: https://t.co/U2y96rxMzS Dataset: https://t.co/AT4QmiYwdp Paper: https://t.co/mZFpN7kYoP Code: https://t.co/HyDdcuwjsn

YueYangAI's tweet photo. We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images, such as charts📊, documents📄, etc., to enhance Vision-Language Models.

Website: https://t.co/U2y96rxMzS
Dataset: https://t.co/AT4QmiYwdp
Paper: https://t.co/mZFpN7kYoP
Code: https://t.co/HyDdcuwjsn

6

193

46

129

23K

bryanlics retweeted

Shreya Havaldar @shreyahavaldar

over 1 year ago

🚨 LLMs must grasp implied language to reason about emotions, social cues, etc. Our @GoogleDeepMind paper presents the Implied NLI dataset. Targeting social norms 🌎 and conversational dynamics 💬, we enhance LLM understanding of real-world implication! https://t.co/qHMoziVf2H

1

54

16

32

6K

Bryan Li @bryanlics

over 1 year ago

@bryanlimy 繁体字真的next level 🤯

0

1

0

26

Bryan Li @bryanlics

over 1 year ago

We'll be presenting this at the NLP for Wikipedia workshop @emnlpmeeting. This is ongoing work, and we'd love to hear feedback from the community! A shout-out to my collaborators Fiona and Adwait for their amazing first paper efforts, @samarhdr, and Chris. 4/4 🧵

0

123

Bryan Li @bryanlics

over 1 year ago

RAG enables LLMs to access external info 📖. But when this info is multiple languages 🌐, can LLMs reconcile differing viewpoints 🧐? We introduce BordIRlines, a dataset to study the robustness of cross-lingual RAG. 📃https://t.co/1YFiLEAiMG 🗃️ https://t.co/wJl062UkCd 1/4 🧵

bryanlics's tweet photo. RAG enables LLMs to access external info 📖. But when this info is multiple languages 🌐, can LLMs reconcile differing viewpoints 🧐? We introduce BordIRlines, a dataset to study the robustness of cross-lingual RAG.
📃https://t.co/1YFiLEAiMG
🗃️ https://t.co/wJl062UkCd
1/4 🧵 https://t.co/oHgqxm8Alh

1

8

3

2

789

Bryan Li @bryanlics

over 1 year ago

Using cross-lingually aligned queries, we analyze responses in a RAG setting. Responses can be "flipped" by varying passages' linguistic composition. We thus find these systems to be far from cross-lingually robust, as certain viewpoints can be amplified over others. 3/4 🧵

1

0

139

Bryan Li

@bryanlics

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users