Wanna check how well a model can share knowledge between languages? Of course you do! 🤩
But can you do it without access to the model’s weights? Now you can with ECLeKTic 🤯
📢#SIGTYP2026 has AMAZING Keynote talks:
🔥Jennifer Culbertson: Language Universals in Individual Minds: Experimental Tests of Linguistic Hypotheses
🔥Terry Regier: Boas, Shannon, and the Origin of Semantic Categories
Join us at #EACL2026: https://t.co/arAKEcXABq
#EACL#NLPRoc
Four years ago, NLLB set a milestone with MT for 200 languages. Today we present OMT: a family of models that extend support to 1600 languages while delivering competitive results in high/mid-resource language, with our 1B-8B models matching frontier and open 70B LLMs.
🧵(1/n)
📢I'm organizing a BoF session at #EACL2026 called Tokenization & Beyond, aiming to gather researchers exploring tokenization and alternatives such as byte-level and pixel-based approaches. Sign up using the form if you're interested! #NLProc@eaclmeeting
New paper:
We are often told that reasoning tokens aren't faithful explanations. But to have a useful metaphor for their operation we need a characterization of what they are, not what they are not.
To that end, we suggest "State over Tokens" (SoT) 👇🧵
🧑🔬I’m recruiting PhD students in Natural Language Processing @UniLeipzig Computer Science, together with @Sca_DS!
Topics include, but aren’t limited to:
🔎Linguistic Interpretability
🌍Multilingual Evaluation
📖Computational Typology
Please share!
#NLProc#NLP
Producing reasoning texts boosts the capabilities of AI models, but do we humans correctly understand these texts? Our latest research suggests that we do not.
This highlights a new angle on the "Are they transparent?" debate: they might be, but we misinterpret them. 🧵
First contributed talks are under way for Phonology, Morphology, and Syntax! Hall M1 Level 1.
Schedule here: https://t.co/c3BDfhP605
#ACL2025NLP#CoNLL2025
🚨 New paper alert! 🚨
We propose an IQ Test for LLMs — a new way to evaluate models that goes beyond benchmarks and uncovers their core skills.
Think: 🧠🤖 psychometrics for LLMs.
👇
(1/6)
Wanna check how well a model can share knowledge between languages? Of course you do! 🤩
But can you do it without access to the model’s weights? Now you can with ECLeKTic 🤯
On my way to #ACL2025 ! 🤗
Find me talking about crosslingual transfer at the Google booth, morphology - at @conll_conf , and tokenization - just at the coffee breaks
🚨 RAG is a popular approach but what happens when the retrieved sources provide conflicting information?🤔
We're excited to introduce our paper:
“DRAGged into CONFLICTS: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs”🚀
A thread 🧵👇
🤯 MIND-BLOWN! A new paper just SHATTERED everything we thought we knew about AI reasoning!
This is paradigm-shifting. A MUST-READ. Full breakdown below 👇
🧵 1/23
I really wanted to see the review details. It's clearly above the acceptance threshold of findings for me. When you fall into the cycle of rejection from ARR, it's hard to come out.