Dan Deutsch @_danieldeutsch - Twitter Profile

Pinned Tweet

over 2 years ago

Excited to receive an Outstanding Paper award for this work at @emnlpmeeting! Thanks to my co-authors George Foster and @markuseful! Updated version available here: https://t.co/XINveU1LvG

Dan Deutsch @_danieldeutsch

about 3 years ago

LLM-based metrics like GEMBA predict many ties, but the way that ties should be handled in Kendall’s tau for meta-evaluating metrics has been a longstanding issue. We propose an update to the meta-evaluation methodology to handle ties. https://t.co/nH6ZA33oa6

_danieldeutsch's tweet photo. LLM-based metrics like GEMBA predict many ties, but the way that ties should be handled in Kendall’s tau for meta-evaluating metrics has been a longstanding issue. We propose an update to the meta-evaluation methodology to handle ties.
https://t.co/nH6ZA33oa6 https://t.co/eQk3PBhfpO

3

60

13

16

20K

4

70

11

12

12K

_danieldeutsch retweeted

Vilém Zouhar @zouharvi

3 months ago

Machine translation is tough to evaluate, partly because most of what you throw at is too easy. That doesn't at all mean that translation is solved; we're just not doing a good job finding interesting inputs.

zouharvi's tweet photo. Machine translation is tough to evaluate, partly because most of what you throw at is too easy. That doesn't at all mean that translation is solved; we're just not doing a good job finding interesting inputs. https://t.co/tQ7SP2h4SC

1

16

1

2

828

_danieldeutsch retweeted

John Hewitt @johnhewtt

7 months ago

Come do a PhD with me at Columbia! My lab tackles basic problems in alignment, interpretability, safety, and capabilities of language systems. If you love adventuring in model internals and behaviors---to understand and improve---let's do it together! pic: a run in central park

johnhewtt's tweet photo. Come do a PhD with me at Columbia!

My lab tackles basic problems in alignment, interpretability, safety, and capabilities of language systems. If you love adventuring in model internals and behaviors---to understand and improve---let's do it together!

pic: a run in central park https://t.co/XZAZJ1ALk9

13

949

128

322

79K

_danieldeutsch retweeted

Eleftheria Briakou @ebriakou

7 months ago

🗺️ Are we making our #LLMs multilingual, or anglocentric? Much work brings languages closer to English, but that comes at the cost of crucial #cultural nuance. @h__j___han tackles this trade-off with surgical steering, adapting LLMs to cultural contexts at inference time.

0

50

11

18

9K

Who to follow

Wei Xu

@cocoweixu

CS professor @GeorgiaTech @gtcomputing @ICatGT @mlatgt. Evaluating & Improving LLMs (multilingual, reasoning, RL, multi-turn, privacy/safety, etc.)

Gabriel Stanovsky

@GabiStanovsky

Associate Professor at @CseHuji

Sihao Chen

@soshsihao

Researcher @ Microsoft #OAR. Making crazy ideas work in practice. Previously: @upennnlp @cogcomp @GoogleAI. Opnions my own.

_danieldeutsch retweeted

Markus Freitag @markuseful

10 months ago

Our Google Translate team is bringing a strong presence to #ACL2025 in Vienna this week! 🇦🇹 My group is excited to present several of our latest papers. 👇 Don't miss them!

1

53

5

2

3K

_danieldeutsch retweeted

Markus Freitag @markuseful

over 1 year ago

Two new datasets from Google Translate targeting high and low resource languages! WMT24++: 46 new en->xx languages to WMT24, bringing the total to 55 SMOL: 6M tokens for 115 very low-resource languages WMT24++: https://t.co/eDU1htGhZt SMOL: https://t.co/y2xQWOXi5W

2

84

24

51

16K

_danieldeutsch retweeted

iseeaswell꩜bʂky @iseeaswell

over 1 year ago

😼SMOL DATA ALERT! 😼Anouncing SMOL, a professionally-translated dataset for 115 very low-resource languages! Paper: https://t.co/HISmFuKe8I Huggingface: https://t.co/TPCFw01rh0

iseeaswell's tweet photo. 😼SMOL DATA ALERT! 😼Anouncing SMOL, a professionally-translated dataset for 115 very low-resource languages! Paper: https://t.co/HISmFuKe8I
Huggingface: https://t.co/TPCFw01rh0 https://t.co/Kf7rr0ESoJ

3

35

12

11

4K

Dan Deutsch @_danieldeutsch

over 1 year ago

@shrutirij @prk_riley @esalesk @FirasTr88060642 Stephanie Winkler @BZhangGo @markuseful #nlproc #nlp #ai

1

0

243

Dan Deutsch @_danieldeutsch

over 1 year ago

🚨New machine translation dataset alert! 🚨We expanded the language coverage of WMT24 from 9 to 55 en->xx language pairs by collecting new reference translations for 46 languages in a dataset called WMT24++ Paper: https://t.co/owplgurKHP Data: https://t.co/ODxHUEq5Xl

_danieldeutsch's tweet photo. 🚨New machine translation dataset alert! 🚨We expanded the language coverage of WMT24 from 9 to 55 en->xx language pairs by collecting new reference translations for 46 languages in a dataset called WMT24++

Paper: https://t.co/owplgurKHP
Data: https://t.co/ODxHUEq5Xl https://t.co/8tHKIR7NEe

3

88

24

26

7K

Dan Deutsch @_danieldeutsch

over 1 year ago

This project was a highly collaborative effort with many people contributing translations, evaluations, analyses, etc., so I want to thank all of my co-authors! @ebriakou @iseeaswell @marafinkels Rebecca Galor @JurikJuraska @gezakovacs Alison Lui @RicardoRei7 @jasonriesa

1

2

0

221

_danieldeutsch retweeted

Yusuf Kocyigit @mykocyigit

over 1 year ago

Thrilled to share our latest findings on data contamination, from my internship at @Google! We trained almost 90 Models on 1B and 8B scales with various contamination types using machine translation as our task and analyze the impact of contamination. https://t.co/4AjY5jSgX8

3

85

19

32

12K

Dan Deutsch @_danieldeutsch

over 1 year ago

@srush_nlp Sent you an email about tennis!

0

1

0

678

_danieldeutsch retweeted

Jurik Juraska @JurikJuraska

over 1 year ago

🚀 We have just released bfloat16 variants of all 3 MetricX-24 models, offering nearly identical performance to their float32 counterparts, but with a 50% smaller memory footprint. ✨ We hope this makes the XL and XXL models more accessible! 🔗 GitHub: https://t.co/dakbwDDBhx

0

2

0

360

_danieldeutsch retweeted

Jurik Juraska @JurikJuraska

over 1 year ago

🌐 Meet MetricX-24, our SOTA machine translation evaluation metric and a successor to the successful MetricX-23. 🚀 Now open-source in PyTorch/Transformers! 🎉 Ready to take this top performer in the WMT24 Metrics Shared Task for a spin? 🔗 Code: https://t.co/dakbwDDBhx

1

17

5

7

2K

Dan Deutsch @_danieldeutsch

over 1 year ago

Super simple and effective way of significantly increasing the performance of your evaluation metric!

Mara Finkelstein @marafinkels

over 1 year ago

LLMs are typically evaluated w/ automatic metrics on standard test sets, but metrics + test sets are developed independently. This raises a crucial question: Can we design automatic metrics specifically to excel on the test sets we prioritize? Answer: Yes! https://t.co/EeJvoXHn0w

marafinkels's tweet photo. LLMs are typically evaluated w/ automatic metrics on standard test sets, but metrics + test sets are developed independently. This raises a crucial question: Can we design automatic metrics specifically to excel on the test sets we prioritize? Answer: Yes!
https://t.co/EeJvoXHn0w https://t.co/pUjT2krTiz

4

49

11

40

12K

0

8

0

2

896

Dan Deutsch @_danieldeutsch

over 1 year ago

@psingh522 Unfortunately this role requires that you are enrolled in a PhD program. But there are plenty of roles at Google for Master's students that you can find on the Google Careers page https://t.co/NjJrvQRsdL

0

234

Dan Deutsch @_danieldeutsch

over 1 year ago

New application link! https://t.co/tujlYYz3OL I am at EMNLP/WMT this week. Please come find me if you want to learn more about this role!

Dan Deutsch @_danieldeutsch

over 1 year ago

Interested in doing research on Google Translate and Gemini? Good news! I’m hiring for full-time roles on the Google Translate Research Team! Apply here: https://t.co/RCojsAMYFD

3

245

82

180

38K

0

35

10

17

6K

Dan Deutsch

@_danieldeutsch

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users