You want to train BERT/RoBERTa for your language in a day or two and you have only 1 GPU. Come to DeepLo workshop, #emnlp2019 to learn more about our work on transferring entire English pretrained models to foreign languages.
1/4 #ACL2024 Excited to share our new paper on the impact of fine-tuning on the qualitative advantages of LLMs in machine translation! 🤖 Our work highlights the importance of preserving LLM capabilities during fine-tuning.
https://t.co/9XknbQGNye
Does Vocabulary Selection cause human perceived translation quality degradations not visible in BLEU? Yes! Find out more in our #NAACL2022 paper:
https://t.co/B2Gk5lvEJM
Code: https://t.co/vNaaLVGvRv
joint work with @EvaHasler@sonytrenous@ketran@hifelix84@unattributed
I'm recruiting for 3 Phd + 2 post-doc positions in #NLProc on multilingual neural machine translation at the University of Amsterdam. Apply by Feb 13.
PhD positions: https://t.co/3EZNfxQZR5
Post-doc positions: https://t.co/WqLRmCIfsZ
For questions DM or email me.
@AmsterdamNLP
I'm looking for a bright and enthusiastic student that will join me and the @GroNlp group, to design more Interpretable Neural MT models! 4-year salaried PhD position in beautiful Groningen (northern Netherlands), deadline 22 March https://t.co/Y9mDkomMeG #NLProc@univgroningen
CALL FOR TASKS CAPTURING LIMITATIONS OF LARGE LANGUAGE MODELS
We are soliciting contributions of tasks to a *collaborative* benchmark designed to measure and extrapolate the capabilities and limitations of large language models. Submit tasks at https://t.co/eJJXFtqPpi
#BIGbench
@Wietsedv@GroNlp@MalvinaNissim nice to see smart initialization also works for GPT-2. We did the same for BERT/RoBERTa and managed to train foreign models on one GPU within a day https://t.co/V98Wi1ji7Y
Code: https://t.co/d7iq6TzGKg
I am extremely excited and proud to share with you the ELLIS PhD program. To students around the world: please apply. This will hopefully become one of the most competitive PhD programs in the world. https://t.co/ZMYs4RuWWi
Me: In this work, we bake a bread.
Reviewer 2a: Reject! Snorting coffee via ears is not novel.
Reviewer 2b: Wearing mask is not a good motivation for using ears. Also you can't snort two lines at the same time. Useless!
Me: ... 🤦♂️
We're looking for a talented PhD student to work on the Responsible Processing of Text Data, at the intersection of #NLP and #privacy, supervised by @turkmenf@brtvrh and myself at @univgroningen. Details here: https://t.co/yKz5JzMJc1 Application deadline: April 1st!
You want to train BERT/RoBERTa for your language in a day or two and you have only 1 GPU. Come to DeepLo workshop, #emnlp2019 to learn more about our work on transferring entire English pretrained models to foreign languages.
@tallinzen@jaaanaru@tyrell_turing we had a paper with a similar idea of separating syntax/semantics in the encoder and shared the attention between syntax and semantic representation. We applied it for machine translation thought https://t.co/wXCrcgX3PA