Huda Khayrallah @HudaKhay - Twitter Profile

HudaKhay retweeted

8 months ago

Lots of work on cross-lingual alignment encourages multilingual LLMs to generalize knowledge across languages. But this push for uniformity creates a tension: what happens to knowledge that should remain local? We look into this trade-off of transfer and cultural erasure:🧵

h__j___han's tweet photo. Lots of work on cross-lingual alignment encourages multilingual LLMs to generalize knowledge across languages.

But this push for uniformity creates a tension: what happens to knowledge that should remain local?

We look into this trade-off of transfer and cultural erasure:🧵 https://t.co/yA3wCChCzW

3

61

19

28

18K

HudaKhay retweeted

Eleftheria Briakou @ebriakou

8 months ago

🗺️ Are we making our #LLMs multilingual, or anglocentric? Much work brings languages closer to English, but that comes at the cost of crucial #cultural nuance. @h__j___han tackles this trade-off with surgical steering, adapting LLMs to cultural contexts at inference time.

0

50

11

18

9K

HudaKhay retweeted

Haoran Xu @fe1ixxu

over 1 year ago

Excited to share that X-ALMA got accepted at #ICLR2025! See you in Singapore!

0

7

2

1

662

HudaKhay retweeted

HyoJung Han @h__j___han

over 1 year ago

Excited to share that our VocADT paper got accepted at #ICLR2025✨! I am looking forward to participating in @iclr_conf in Singapore 🇸🇬.

2

30

8

5

3K

Who to follow

Rachel Rudinger

@rachelrudinger

Assistant Professor of Computer Science at University of Maryland, College Park. NLP, CompLing, AI. she/her

Wei Xu

@cocoweixu

CS professor @GeorgiaTech @gtcomputing @ICatGT @mlatgt. Evaluating & Improving LLMs (multilingual, reasoning, RL, multi-turn, privacy/safety, etc.)

Roy Schwartz

@royschwartzNLP

Senior Lecturer at @CseHuji. #NLPROC

HudaKhay retweeted

Akiko I. Eriguchi @akikoe_

over 1 year ago

Congratulations @h__j___han 🥳 it was great to work with you!

0

3

0

687

HudaKhay retweeted

Suzanna Sia @suzyahyah

over 1 year ago

Large Model Inference Efficiency can be tackled from many angles, mixture of experts, efficient self-attention, quantisation, distillation, hardware acceleration.. But what if we could completely avoid redundant computational processing over context window? In our NeurIPS'24 paper "where does in-context (task-location) learning happen" We find three distinct regions for LLM inference time processing 1️⃣ [Task Location]; LLM discovers the task from reading instructions and examples 2️⃣ [Task Processing]; After task location, the model no longer requires any self-attention over the prompts. 3️⃣ [Task Completion]; final layers of processing where the model no longer requires self-attention over the query. ===> Implications for Industry ✅ ~50% In Computational savings (theoretical) If we avoided redundant context processing in later layers of the model ✅ Very sample efficient adaptation of LLMs to task specific Models. Contrary to common wisdom on Fine-tuning, LoRA layers are most effective at earlier layers of the model compared to the later ones. ===> Implications for Academia: * New Interpretability technique progressively masks out all self-attention to the context, * Task Location layer is not affected by the number of prompt examples provided to the model. * Related Work with similar findings are Task Vectors (@RoeeHendel et al) , Function Vectors (@ericwtodd et al), providing additional supporting evidence for this phenomena. 💻 Paper: https://t.co/ICDjxbNeuv Github: https://t.co/EhpH1E2FXI Models: Llama3.1-8B, LLama3.1-8B-Instruct, Starcoder2-7B, GPTN2.7B, Bloom3B Tasks: Machine Translation (en-fr, fr-en, en-pt), Code Generation (en-py)

suzyahyah's tweet photo. Large Model Inference Efficiency can be tackled from many angles, mixture of experts, efficient self-attention, quantisation, distillation, hardware acceleration..

But what if we could completely avoid redundant computational processing over context window?

In our NeurIPS'24 paper "where does in-context (task-location) learning happen"

We find three distinct regions for LLM inference time processing

1️⃣ [Task Location]; LLM discovers the task from reading instructions and examples

2️⃣ [Task Processing]; After task location, the model no longer requires any self-attention over the prompts.

3️⃣ [Task Completion]; final layers of processing where the model no longer requires self-attention over the query.

===> Implications for Industry

✅ ~50% In Computational savings (theoretical)

If we avoided redundant context processing in later layers of the model

✅ Very sample efficient adaptation of LLMs to task specific Models.

Contrary to common wisdom on Fine-tuning, LoRA layers are most effective at earlier layers of the model compared to the later ones.

===> Implications for Academia:

* New Interpretability technique progressively masks out all self-attention to the context,

* Task Location layer is not affected by the number of prompt examples provided to the model.

* Related Work with similar findings are Task Vectors (@RoeeHendel et al) , Function Vectors (@ericwtodd et al), providing additional supporting evidence for this phenomena.

💻
Paper: https://t.co/ICDjxbNeuv
Github: https://t.co/EhpH1E2FXI

Models: Llama3.1-8B, LLama3.1-8B-Instruct, Starcoder2-7B, GPTN2.7B, Bloom3B
Tasks: Machine Translation (en-fr, fr-en, en-pt), Code Generation (en-py)

1

26

11

10

8K

HudaKhay retweeted

Barry Haddow @bazril

over 1 year ago

EAMT best thesis award - closes on January 31st. Completed an MT-related PhD in 2024? In Europe, Africa or Middle East. Then why not submit your thesis. https://t.co/vo0G6L5c2D

0

5

3

1

547

HudaKhay retweeted

Marine Carpuat @MarineCarpuat

almost 2 years ago

Incredibly proud of Dr Eleftheria Briakou for receiving the first ever Best Thesis Award from the Association for Machine Translation in the Americas!

3

42

6

0

3K

Huda Khayrallah @HudaKhay

almost 2 years ago

🎉🎉🎉🎉🎉🎉

Eleftheria Briakou @ebriakou

almost 2 years ago

I’m super thrilled to have won the AMTA Best Thesis Award!! A huge thanks to the AMTA organizers for this recognition ☺️ See you all in Chicago https://t.co/k9nJBl1AcI

ebriakou's tweet photo. I’m super thrilled to have won the AMTA Best Thesis Award!!

A huge thanks to the AMTA organizers for this recognition ☺️

See you all in Chicago https://t.co/k9nJBl1AcI https://t.co/9qtPepflDt

10

93

9

6

11K

1

6

0

321

HudaKhay retweeted

Eleftheria Briakou @ebriakou

almost 2 years ago

I’m super thrilled to have won the AMTA Best Thesis Award!! A huge thanks to the AMTA organizers for this recognition ☺️ See you all in Chicago https://t.co/k9nJBl1AcI

10

93

9

6

11K

HudaKhay retweeted

Akiko I. Eriguchi @akikoe_

almost 2 years ago

On behalf of the AMTA Board of Directors, I am pleased to announce the winner of the first-ever AMTA Best Thesis Award: Dr. Eleftheria Briakou (@ebriakou) for her thesis “Detecting Fine-Grained Semantic Divergences to Improve Translation Understanding Across Languages”. [1/n]

1

17

7

2

3K

HudaKhay retweeted

Jordan Boyd-Graber @boydgraber

almost 2 years ago

I'm bummed that family obligations prevented me from presenting this epic paper. This work represented a long journey for me. I first began working on the language of Diplomacy in 2015, and I struggled for years to get funding to build a bot that could play it ...

boydgraber's tweet photo. I'm bummed that family obligations prevented me from presenting this epic paper. This work represented a long journey for me. I first began working on the language of Diplomacy in 2015, and I struggled for years to get funding to build a bot that could play it ... https://t.co/6M4rIlSN1y

4

69

10

11

10K

HudaKhay retweeted

Sarah Jabbour @SarahJabbour_

almost 2 years ago

My mom wants to come out of retirement. She was a software validation engineer working on human machine interfaces. She (and I) have no idea where to look. She just wants to spend time testing the things that people build. Does anyone know where she could look??

1

7

2

0

1K

HudaKhay retweeted

HyoJung Han @h__j___han

almost 2 years ago

✨XLAVS-R will be presented during today’s (August 13th) #ACL2024 poster session 4, starting at 10:30 AM. Looking forward to talking with people interested in our work!

0

26

5

0

2K

Huda Khayrallah @HudaKhay

almost 2 years ago

🎉🎉🎉🎉🎉🎉

JHU CLSP @jhuclsp

almost 2 years ago

Congratulations to Xuan Zhang (advised by @kevinduh) on successfully defending her PhD thesis “Hyperparameter Optimization for Neural Machine Translation Systems”. https://t.co/LVBqBbT8CV

1

20

1

0

2K

0

1

0

183

HudaKhay retweeted

Elliot Schumacher @elliotschu

about 2 years ago

Great work Albert! Check out the paper at https://t.co/UvloKkYs2e .

0

4

2

0

490

HudaKhay retweeted

Naomi Saphra @nsaphra

about 2 years ago

NAACL is this week and that means you should read our "history" paper! And if you're in Mexico City then say hi to Eve Fleisig, who is presenting it!

1

41

4

13

7K

HudaKhay retweeted

Armita R. Manafzadeh @armanafzadeh

about 2 years ago

Three postdocs were too tired to go to the party on the last night of SICB this year, so we decided to order pizza to the hotel and write a paper together instead. Out in @ICB_journal now! https://t.co/thRWfhwDHC

2

46

8

4

5K

HudaKhay retweeted

Elias Stengel-Eskin

@EliasEskin

about 2 years ago

🚨 Excited to share our new work on **confidence calibration** in LLMs! LLMs are often badly calibrated & overconfident, explicitly (eg. "I'm 100% sure") and implicitly, eg. giving details/authoritative tone. We address both w/ a pragmatic speaker-listener multi-agent method 🧵

EliasEskin's tweet photo. 🚨 Excited to share our new work on **confidence calibration** in LLMs!

LLMs are often badly calibrated & overconfident, explicitly (eg. "I'm 100% sure") and implicitly, eg. giving details/authoritative tone.

We address both w/ a pragmatic speaker-listener multi-agent method
🧵 https://t.co/mE6vC1ELOR

3

144

42

73

34K

Huda Khayrallah @HudaKhay

about 2 years ago

Deadline is 6/6 for the AMTA thesis award Apply if you finished a PhD in MT in the Americas in the last year! https://t.co/oEgD1Sj4Xb questions? reach out to [email protected] (Rebecca Knowles and Akiko Eriguchi).

Akiko I. Eriguchi @akikoe_

about 2 years ago

🏆 Thrilled to share the launch of the AMTA Best Thesis Award, which aims to highlight the achievements of a recent PhD graduate at an institution in the Americas whose thesis has focused on topics related to machine translation. [1/2]

1

11

2

0

2K

0

1

0

1

341

Huda Khayrallah

@HudaKhay

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users