Our @Cardiff_NLP team and collaborators have been releasing amazing NLP models and datasets over the last few years. We've now organised the content as @huggingface collections 🤗
A short thread summarising some of these NLP resources, in particular related to social media! 🧵
This was an interesting and fun project, led by @joseba_fdl.
The findings are quite intriguing as well, and probably not what one would immediately expect! 👇
🚨 New paper alert!
🇯🇵🤔 Why are all LLMs Obsessed with Japanese Culture? On the Hidden Cultural and Regional Biases of LLMs 🇯🇵🤔
👉 We find that LLMs show strong regional preferences, often disproportionately favoring certain countries, such as Japan. 🇯🇵
Everyone thinks AI is culturally biased towards the West.
But this new study finds evidence that LLMs show a clear cultural bias towards countries like Japan.
Link the in the replies below:
@PHChess@GMJacobAagaard Hopefully we can improve these models soon or at least they become more transparent and self-aware of their own limitations - currently working on this but definitely not easy!
Reference for anyone interested: https://t.co/BKfBSpy1Dn
@PHChess@GMJacobAagaard And this was an easy case where all the data was provided to the LLM and they were asked to perform very basic counting operations.
In the draw analysis case, the model also had to collect the data itself, etc. which may introduce further errors.
See you tomorrow at the WASSA workshop! 👋
I will be talking about social media from an interdisciplinary perspective, and lessons learned on how we NLP'ers can contribute to other disciplines (e.g. social sciences) more effectively.
⏳ Only two days until #WASSA2026 on March 29 at #EACL2026!
📜 The program is now available. See the full schedule, including the invited talk by @CamachoCollados:
“Social Media Analysis in the Language Model Era: An Interdisciplinary Perspective”
🔗 https://t.co/fHTjCBRHu8
@petergostev This is very cool, thanks for sharing! 🙏
May I ask which LLM did you use to co-create the questions? Sorry if the info is there somewhere, couldn't find it!
Pleased to announce the 5th Cardiff #NLProc Workshop!
📍Cardiff 📅 22–23 June 2026
🌐 More information (to be regularly updated): https://t.co/1NDGa2Xpb0
👉 If you’re interested in attending, please complete the EoI by 11 April: https://t.co/wyWg6Fkhw5
Registration is ✨free✨1/
@DaveFuertes Indeed, I hadn't even been to Wales in 2017 😅 (arrived for the first time in 2018 and transferred my federation in 2022 if I recall correctly)
3️⃣ AI and Data Science for Electoral Integrity and Democratic Governance (Federico Liberatore)
4️⃣ Designing Benchmarks that Reflect Real-World NLP Use (@feralvam)
@Cardiff_NLP is hiring PhD students!
🇬🇧🇪🇺 Full PhD scholarship open to UK/EU students
⏲️ Application deadline: February 13th
List of projects and supervisors below (find them in FindAPhD for more details) 👇
1️⃣ Interpretability-Guided Compression and Acceleration of Large Language Models (@tpilehvar)
2️⃣ Calibrated Multimodal Verification to Prevent Hallucination in Robot Planning and Manipulation (@jodieyzhou)
Some good suggestions on how to make your research useful for others (accessible/memorable/usable). For me, this has always been a major driver of how I approach research.
✅ As a bonus, this will likely increase your citation count!
Our latest work was featured in The Guardian, both in the tech and comedy sections! 🤣
LLMs are good at recognising puns, but they often think everything is a pun, as long as it looks like one. They just try too hard, and the reasons provided are often deep and hilarious! 👇