Seth Aycock

@sethjsa

NLP PhD student in Low-resource Translation at @AmsterdamNLP @ltl_uva / Prev @InriaParisNLP / @InfAtEd / @Cambridge_Uni

Amsterdam, The Netherlands

Joined September 2015

597 Following

141 Followers

25 Posts

Pinned Tweet

Seth Aycock @sethjsa

over 1 year ago

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! https://t.co/cRMX6fwEPg - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

3

127

22

57

18K

Seth Aycock @sethjsa

9 months ago

MT Marathon this year organised by @HelsinkiNLP was a great week - I presented my research on chain-of-thought for machine translation, worked on a mini-research project, and explored the wonderful city of Helsinki including a few trips to the sauna 🫠

sethjsa's tweet photo. MT Marathon this year organised by @HelsinkiNLP was a great week - I presented my research on chain-of-thought for machine translation, worked on a mini-research project, and explored the wonderful city of Helsinki including a few trips to the sauna 🫠 https://t.co/fkOyGiIWn6

0

2

0

0

78

sethjsa retweeted

9 months ago

👉 https://t.co/4AzXeqlIxu New study finds that prompt engineering has limits 🚫 in #AI #translation. If a large language model has not learned the task, no prompt will make it perform better. #LLMs #LLM #xl8 #t9n @CharlesUniPRG @JohnsHopkins @LMU_Muenchen @PUT_Poznan @PSchmidtova @BafnaNiyati @sethjsa @AmsterdamNLP @ltl_uva @zouharvi

slatornews's tweet photo. 👉 https://t.co/4AzXeqlIxu
New study finds that prompt engineering has limits 🚫 in #AI #translation. If a large language model has not learned the task, no prompt will make it perform better.
#LLMs #LLM #xl8 #t9n @CharlesUniPRG @JohnsHopkins @LMU_Muenchen @PUT_Poznan @PSchmidtova @BafnaNiyati @sethjsa @AmsterdamNLP @ltl_uva @zouharvi

0

6

2

0

216

Seth Aycock @sethjsa

about 1 year ago

Our paper was accepted at ICLR 2025 as a Spotlight! I will present our poster on Saturday April 26, 3-5pm, Poster #241. See you there! https://t.co/cRMX6fxcEO

Seth Aycock @sethjsa

over 1 year ago

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! https://t.co/cRMX6fwEPg - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

3

127

22

57

18K

1

10

0

0

208

Who to follow

Verified account

PhD in Cph curr. @gen_intuition / https://t.co/uz1rGMixzO the work is mysterious and important

Naeem Haque | 🤖

A passionate software engineer currently working on @WPManageNinja & @authlab —a core contributor to @WordPress and @EmDashCMS

Nikola Mrkšić

Verified account

CEO of PolyAI - the world's best dialog agents for building the conversational enterprise @polyaivoice

sethjsa retweeted

LTL-UvA @ltl_uva

over 1 year ago

LTL News: Happy to announce that Seth's paper got accepted by ICLR (spotlight) 🥳@sethjsa Paper Link: https://t.co/yjjUCL24YS

0

6

2

0

225

Seth Aycock @sethjsa

over 1 year ago

@SimonHiaubeng @ElliotMurphy91 The principle Maximise Minimal Means is part of one version of minimalist theory. But it's not UG - it's a third factor, domain-general constraint

0

1

0

1

79

Seth Aycock @sethjsa

over 1 year ago

@Linguist_UR @ElliotMurphy91 Merge, maybe Agree, maybe Labeling. Though I believe there's work ongoing to attribute Merge itself to third factor, domain-general constraints

0

1

0

0

192

Seth Aycock @sethjsa

over 1 year ago

@RaphaelMerx I'm a fan of this paper! We'd expect exactly the same for Kalamang (if we could collect an OOD test set). In the appendix we show too that the 100-example test set consists of short, easy sentences so a ChrF++ of ~30 is really not that proficient

0

1

0

0

57

Seth Aycock @sethjsa

over 1 year ago

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! https://t.co/cRMX6fwEPg - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

3

127

22

57

18K

Seth Aycock @sethjsa

over 1 year ago

@ylecun https://t.co/c5KuqFGlye And do not confuse learning from grammar with learning from parallel sentences!

Seth Aycock @sethjsa

over 1 year ago

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! https://t.co/cRMX6fwEPg - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

3

127

22

57

18K

0

1

0

0

70

sethjsa retweeted

over 1 year ago

We show that a grammar book provides little or even no help for translation in LLMs, questioning the recent "truly zero-shot translation" --- no data no gain, still 🧐

0

8

1

0

673

Seth Aycock @sethjsa

over 1 year ago

@JeffDean (Plus, Kalamang parallel data has been online since November 2020!)

0

1

0

0

23

Seth Aycock @sethjsa

over 1 year ago

@JeffDean https://t.co/K5MN4xGEA9 Actually we find LLMs learn most/all translation ability from parallel sentences in the book, not the grammar. And we can predict translation performance just from prompts' test set vocab coverage! But we do find that grammar can help *linguistic* tasks

Seth Aycock @sethjsa

over 1 year ago

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! https://t.co/cRMX6fwEPg - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

3

127

22

57

18K

1

1

1

0

220

Seth Aycock @sethjsa

over 1 year ago

@jxmnop https://t.co/1OC5FtiTVb It turns out LLMs learn most or all translation ability from parallel sentences in the book, not the grammar. And fine-tuning a small translation model matches or beats long-context LLM results! (plus Kalamang parallel data has been online since Nov 2020)

Seth Aycock @sethjsa

over 1 year ago

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! https://t.co/cRMX6fwEPg - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

3

127

22

57

18K

0

3

0

0

71

Seth Aycock @sethjsa

over 1 year ago

More generally, we suggest that data collection efforts for multilingual XLR tasks like translation are best focused on parallel data over linguistic description, given the advantages in computational cost, token efficiency, availability!

0

6

0

0

332

Seth Aycock @sethjsa

over 1 year ago

Our results emphasise the importance of task-appropriate data for XLR languages: parallel data for translation, and grammatical data for linguistic tasks.

1

4

0

0

345

Last Seen Users on Sotwe

Trends for you

Most Popular Users