Happy to share that our work on evaluating tokenizers has been accepted to #ACL2025! This is one of two projects I worked on during my internship @Apple.
The Microsoft Translator group is looking for a Ph.D. student intern this summer to work with us in Redmond on machine translation. The application link can be found at https://t.co/sV0ZTS6Do1. Please share if you know of someone who might be interested!
I'm very excited to share this great example of applied research:
https://t.co/5SgiJ9Legb
Research (https://t.co/J3w4JnOoFf with data https://t.co/NtkfbiQj8N) applied in our product (https://t.co/eJ9Jopl2yR)
Congrats to the team!
Have you recently used COMET for MT evaluation? ☄️
- Did you report the specific model? ≥12% of papers don't!
- Did you report the package version? Makes a difference.
- `pip install sacrecomet` generates a nice version+model signature. Not too late for WMT/EMNLP camera-ready!
A new 🆕 approach from @Google mimics the human translation process, incorporating research, 🧐 drafting, ✍️ and iterative 🔄 refinement in the #LLM translation process and promises better #translation quality.
#xl8#t9n@ebriakou@ColinCherry@markuseful
https://t.co/HyM9Ykj9wH
Finishing up the #iOS18 update of @StudySnacksApp by adding support for the new Translation API! 🌐
The app will detect if you create a language learning list and translate the vocabularies for you automatically while you type. 🥳
#buildinpublic#iosdev
🥳 LLMs are changing the game, even for datasets! NewsPaLM, a publicly released LLM-generated dataset, outperforms larger web-crawled corpora for MT. It includes sentence & paragraph-level, MBR-decoded data. See paper for more, incl. LLM self-distillation. https://t.co/iqtiGD2gE1
I will soon be hiring PhDs/postdocs for a new university lab in Germany! If you have strong CL/NLP skills and are interested in modeling implicit/underspecified language, misunderstandings, or commonsense vs. individual background knowledge, let's connect. Please DM me or RT! :)
📢 WMT24 Metric Shared Task Starts NOW! 🎉
Calling all researchers and metrics enthusiasts! We invite you to submit your best evaluation metrics to the WMT Metric Shared Task. Assign quality scores to all WMT submissions and translations by top LLMs like Gemini, GPT4o, Claude, LLambda, and more. Let's push the boundaries of MT evaluation together! #WMT #MachineTranslation #NLProc
All details including information how to download the data:
https://t.co/RlZHYmWK3J
Deadline for submissions is 12:00 pm July 31 (AoE)
Let's go! :-)
🚀 How accurately can automatic metrics evaluate the best MT systems and cutting-edge LLM translations? 🌟
🔥 WMT24 Metrics Shared Task Alert: This year's challenge is all about pushing the boundaries of automatic metrics! 🎯 Your mission: assign quality scores to all WMT submissions and translations by top LLMs like GPT, Gemini, Claude, LLambda 3, and more. 💥 Are you ready to revolutionize the field? Let's do this! 🚀 #NLP #WMT #LLMs #MachineTranslation #EMNLP
All details here:
https://t.co/OvPxODv8X9 (1/n)
Big achievement and an important milestone! 🏆
Happy to see that my wife’s work on the success rate of intensive #aphasia therapy has been finally published in the Journal of Neurology! 🧠
Congratulations! 🚀
Excuse me internet, we now have a German adaptation of https://t.co/ykaNxvcdcO thanks to Lena Werner, @DoroPeitz and Katja Hußmann 🇩🇪🥳
This provides a free option to *self manage* therapy tasks for German speakers with #aphasia
Help us break LLMs! The test suite sub-task will be included for the sixth time in the General MT Shared Task of the Conference on Machine Translation (WMT24). This year's theme is to reveal weaknesses of LLMs when translating #LLMs#EMNLP#NLProc| https://t.co/FwFpXfnIv0
We're looking for a final-year PhD intern passionate about working on automatic metrics for machine translation and NLP in Mountain View. If interested, please send an email to me and @_danieldeutsch.
The ideal candidate should have experience working on automatic evaluation and be familiar with metrics like Metric-X, COMET, BLEURT and human evaluation frameworks like MQM.