Jose Camacho Collados

@CamachoCollados

Professor @cardiffuni (@Cardiff_NLP). AI/NLP researcher and chess International Master ♟️🎾

Cardiff, Wales, UK

Joined June 2011

987 Following

2K Followers

1.2K Posts

Pinned Tweet

Jose Camacho Collados @CamachoCollados

about 1 year ago

Our @Cardiff_NLP team and collaborators have been releasing amazing NLP models and datasets over the last few years. We've now organised the content as @huggingface collections 🤗 A short thread summarising some of these NLP resources, in particular related to social media! 🧵

CamachoCollados's tweet photo. Our @Cardiff_NLP team and collaborators have been releasing amazing NLP models and datasets over the last few years. We've now organised the content as @huggingface collections 🤗

A short thread summarising some of these NLP resources, in particular related to social media! 🧵 https://t.co/5YvgS4hD52

1

21

6

3

2K

Jose Camacho Collados @CamachoCollados

about 1 month ago

This was an interesting and fun project, led by @joseba_fdl. The findings are quite intriguing as well, and probably not what one would immediately expect! 👇

joseba.fdl @joseba_fdl

about 1 month ago

🚨 New paper alert! 🇯🇵🤔 Why are all LLMs Obsessed with Japanese Culture? On the Hidden Cultural and Regional Biases of LLMs 🇯🇵🤔 👉 We find that LLMs show strong regional preferences, often disproportionately favoring certain countries, such as Japan. 🇯🇵

joseba_fdl's tweet photo. 🚨 New paper alert!

🇯🇵🤔 Why are all LLMs Obsessed with Japanese Culture? On the Hidden Cultural and Regional Biases of LLMs 🇯🇵🤔

👉 We find that LLMs show strong regional preferences, often disproportionately favoring certain countries, such as Japan. 🇯🇵 https://t.co/5QdHWbWOLF

1

7

2

1

652

0

2

2

1

526

CamachoCollados retweeted

AIDB @ai_database

about 1 month ago

「なぜLLMは、日本文化に執着するのか？」という意外な論文が出ています。研究者らの検証によると、Claudeなど主要LLMの出力はなぜか皆、日本文化に偏っているとのこと。たとえば「伝統的な踊りには何がありますか？」と聞くと盆踊りや歌舞伎、「毎日食べる料理は？」と聞くと寿司や味噌汁、「よくある運動習慣は？」と聞くとラジオ体操、「川は集落にどう影響していますか？」と聞くと利根川や信濃川を例に持ち出してくる、というのです。また、日本に続くのは米国やインド、中国、フランス。それ以外の国はほとんど登場しない、という偏りが見えたとのことです。なお、事前学習では均等なのに教師あり微調整の後で急に偏りが噴出することが判明しています。 LLMは西洋中心とも言われてきた裏で、ふるまいはその通りでもないようです。 ※この現象は自分の言語圏を除いた場合の話です。たとえば英語で聞けば米国がまず多く、中国語で聞けば中国がまず多く、その「自国優先」の次に来る外国の代表として日本がほぼ全言語でトップに立つ、という発見です。

ai_database's tweet photo. 「なぜLLMは、日本文化に執着するのか？」という意外な論文が出ています。
研究者らの検証によると、Claudeなど主要LLMの出力はなぜか皆、日本文化に偏っているとのこと。

たとえば「伝統的な踊りには何がありますか？」と聞くと盆踊りや歌舞伎、「毎日食べる料理は？」と聞くと寿司や味噌汁、「よくある運動習慣は？」と聞くとラジオ体操、「川は集落にどう影響していますか？」と聞くと利根川や信濃川を例に持ち出してくる、というのです。

また、日本に続くのは米国やインド、中国、フランス。それ以外の国はほとんど登場しない、という偏りが見えたとのことです。

なお、事前学習では均等なのに教師あり微調整の後で急に偏りが噴出することが判明しています。
LLMは西洋中心とも言われてきた裏で、ふるまいはその通りでもないようです。

※この現象は自分の言語圏を除いた場合の話です。たとえば英語で聞けば米国がまず多く、中国語で聞けば中国がまず多く、その「自国優先」の次に来る外国の代表として日本がほぼ全言語でトップに立つ、という発見です。

132

7K

2K

3K

1M

CamachoCollados retweeted

Jeremy Nguyen ✍🏼 🚢

@JeremyNguyenPhD

about 1 month ago

Everyone thinks AI is culturally biased towards the West. But this new study finds evidence that LLMs show a clear cultural bias towards countries like Japan. Link the in the replies below:

JeremyNguyenPhD's tweet photo. Everyone thinks AI is culturally biased towards the West.

But this new study finds evidence that LLMs show a clear cultural bias towards countries like Japan.

Link the in the replies below: https://t.co/cEbbRb8ljH

2

20

6

5

2K

Who to follow

EMNLP 2026 - The 2026 Conference on Empirical Methods in Natural Language Processing Hashtag: #EMNLP2026 Dates: October 24 –29 Submission: ACL ARR March and May

The Natural Language Processing Group at the University of Edinburgh.

Isabelle Augenstein

Full Professor @CopeNLU @uni_copenhagen. Formerly @ucl_nlp, @SheffieldNLP. Explainable AI, Natural Language Processing, ML.

Jose Camacho Collados @CamachoCollados

about 2 months ago

@PHChess @GMJacobAagaard Hopefully we can improve these models soon or at least they become more transparent and self-aware of their own limitations - currently working on this but definitely not easy! Reference for anyone interested: https://t.co/BKfBSpy1Dn

0

1

0

0

37

Jose Camacho Collados @CamachoCollados

about 2 months ago

@PHChess @GMJacobAagaard And this was an easy case where all the data was provided to the LLM and they were asked to perform very basic counting operations. In the draw analysis case, the model also had to collect the data itself, etc. which may introduce further errors.

1

0

0

0

61

Jose Camacho Collados @CamachoCollados

2 months ago

See you tomorrow at the WASSA workshop! 👋 I will be talking about social media from an interdisciplinary perspective, and lessons learned on how we NLP'ers can contribute to other disciplines (e.g. social sciences) more effectively.

CamachoCollados's tweet photo. See you tomorrow at the WASSA workshop! 👋

I will be talking about social media from an interdisciplinary perspective, and lessons learned on how we NLP'ers can contribute to other disciplines (e.g. social sciences) more effectively. https://t.co/so0VnHA3lb

WASSA 2026 @wassa_ws

2 months ago

⏳ Only two days until #WASSA2026 on March 29 at #EACL2026! 📜 The program is now available. See the full schedule, including the invited talk by @CamachoCollados: “Social Media Analysis in the Language Model Era: An Interdisciplinary Perspective” 🔗 https://t.co/fHTjCBRHu8

0

0

0

0

990

0

6

2

0

881

Jose Camacho Collados @CamachoCollados

3 months ago

@petergostev This is very cool, thanks for sharing! 🙏 May I ask which LLM did you use to co-create the questions? Sorry if the info is there somewhere, couldn't find it!

0

0

1

0

114

Jose Camacho Collados @CamachoCollados

4 months ago

The dates for the 5th Cardiff NLP Workshop are now confirmed (22-23 June) - join us! 🤗

Cardiff NLP @Cardiff_NLP

4 months ago

Pleased to announce the 5th Cardiff #NLProc Workshop! 📍Cardiff 📅 22–23 June 2026 🌐 More information (to be regularly updated): https://t.co/1NDGa2Xpb0 👉 If you’re interested in attending, please complete the EoI by 11 April: https://t.co/wyWg6Fkhw5 Registration is ✨free✨1/

1

7

2

0

892

0

4

1

0

605

Jose Camacho Collados @CamachoCollados

4 months ago

@DaveFuertes Indeed, I hadn't even been to Wales in 2017 😅 (arrived for the first time in 2018 and transferred my federation in 2022 if I recall correctly)

0

0

0

0

4

Jose Camacho Collados @CamachoCollados

4 months ago

3️⃣ AI and Data Science for Electoral Integrity and Democratic Governance (Federico Liberatore) 4️⃣ Designing Benchmarks that Reflect Real-World NLP Use (@feralvam)

0

0

0

0

247

Jose Camacho Collados @CamachoCollados

4 months ago

@Cardiff_NLP is hiring PhD students! 🇬🇧🇪🇺 Full PhD scholarship open to UK/EU students ⏲️ Application deadline: February 13th List of projects and supervisors below (find them in FindAPhD for more details) 👇

1

8

6

1

676

Jose Camacho Collados @CamachoCollados

4 months ago

1️⃣ Interpretability-Guided Compression and Acceleration of Large Language Models (@tpilehvar) 2️⃣ Calibrated Multimodal Verification to Prevent Hallucination in Robot Planning and Manipulation (@jodieyzhou)

1

1

1

0

354

Jose Camacho Collados @CamachoCollados

6 months ago

First interaction I'm having with GPT 5.2. It isn't going too well... "Draw 6 people across a table"

CamachoCollados's tweet photo. First interaction I'm having with GPT 5.2. It isn't going too well...

"Draw 6 people across a table" https://t.co/AU6sDqLmOe

CamachoCollados's tweet photo. First interaction I'm having with GPT 5.2. It isn't going too well...

"Draw 6 people across a table" https://t.co/AU6sDqLmOe

CamachoCollados's tweet photo. First interaction I'm having with GPT 5.2. It isn't going too well...

"Draw 6 people across a table" https://t.co/AU6sDqLmOe

CamachoCollados's tweet photo. First interaction I'm having with GPT 5.2. It isn't going too well...

"Draw 6 people across a table" https://t.co/AU6sDqLmOe

1

2

0

0

371

Jose Camacho Collados @CamachoCollados

6 months ago

@PMinervini @WenhuChen 5-month summer break sounds amazing! I will try to apply for one of those tenure jobs

0

1

0

0

100

Jose Camacho Collados @CamachoCollados

6 months ago

@JBlackburnChess @4NCL What a tournament by Indy, great level throughout! 👏🏼👏🏼

0

1

0

0

64

Jose Camacho Collados @CamachoCollados

6 months ago

Some good suggestions on how to make your research useful for others (accessible/memorable/usable). For me, this has always been a major driver of how I approach research. ✅ As a bonus, this will likely increase your citation count!

6 months ago

How to get citations?

11

131

9

123

22K

0

2

0

0

499

Jose Camacho Collados @CamachoCollados

6 months ago

👉 Our #EMNLP2025 paper: https://t.co/1bjbd5SEKw 🗞️ The Guardian article: https://t.co/GIZMuDY8Kj

0

0

0

1

230

Jose Camacho Collados @CamachoCollados

6 months ago

Our latest work was featured in The Guardian, both in the tech and comedy sections! 🤣 LLMs are good at recognising puns, but they often think everything is a pun, as long as it looks like one. They just try too hard, and the reasons provided are often deep and hilarious! 👇

CamachoCollados's tweet photo. Our latest work was featured in The Guardian, both in the tech and comedy sections! 🤣

LLMs are good at recognising puns, but they often think everything is a pun, as long as it looks like one. They just try too hard, and the reasons provided are often deep and hilarious! 👇 https://t.co/pwbsH6RSLf

2

8

4

2

853

Last Seen Users on Sotwe

Trends for you

Most Popular Users