James Michaelov @jamichaelov - Twitter Profile

2 months ago

Had a great first day at #HSP2026 yesterday! Looking forward to presenting on the relationship between reading time, n-grams, and language model scaling at the 12.10-2pm poster session today!

0

2

0

201

James Michaelov @jamichaelov

6 months ago

Presenting this at the poster session this morning (11-2pm) at #5109

James Michaelov @jamichaelov

6 months ago

Excited to announce that I’ll be presenting a paper at #NeurIPS this year! Reach out if you’re interested in chatting about LM training dynamics, architectural differences, shortcuts/heuristics, or anything at the CogSci/NLP/AI interface in general! #Neurips2025

jamichaelov's tweet photo. Excited to announce that I’ll be presenting a paper at #NeurIPS this year! Reach out if you’re interested in chatting about LM training dynamics, architectural differences, shortcuts/heuristics, or anything at the CogSci/NLP/AI interface in general! #Neurips2025 https://t.co/HPCu1YDbiz

2

34

5

10

3K

0

2

0

342

James Michaelov @jamichaelov

6 months ago

Looking forward to #NeurIPS25 this week 🏝️! I'll be presenting at Poster Session 3 (11-2 on Thursday). Feel free to reach out!

James Michaelov @jamichaelov

6 months ago

Excited to announce that I’ll be presenting a paper at #NeurIPS this year! Reach out if you’re interested in chatting about LM training dynamics, architectural differences, shortcuts/heuristics, or anything at the CogSci/NLP/AI interface in general! #Neurips2025

2

34

5

10

3K

0

7

0

304

James Michaelov @jamichaelov

6 months ago

I'll also be presenting this paper with @linguist_cat at #CogInterp! https://t.co/oHJLl16syg

Catherine Arnett @linguist_cat

6 months ago

@jamichaelov and I will be presenting our paper at the @CogInterp workshop 13:15 - 14:45 on Dec 7th. We show how disaggregating grammatical benchmarks over the course of training reveals stages of training where models learn heuristics before learning more generalizable patterns.

linguist_cat's tweet photo. @jamichaelov and I will be presenting our paper at the @CogInterp workshop 13:15 - 14:45 on Dec 7th. We show how disaggregating grammatical benchmarks over the course of training reveals stages of training where models learn heuristics before learning more generalizable patterns. https://t.co/vuP0hs88rq

1

4

0

370

1

3

0

250

Who to follow

Cameron Jones

@camrobjones

Assistant Professor in Psychology at Stony Brook University. I’m interested in how people interact with LLMs and they impact they might have on our psychology.

Tyler Chang

@tylerachang

Research scientist @GoogleDeepMind. He/him/his.

Noga Zaslavsky

@NogaZaslavsky

Computational cognitive scientist, developing integrative models of language, perception, and action. Asst Prof @NYUPsych https://t.co/YkOAsxZA3O

James Michaelov @jamichaelov

6 months ago

Excited to announce that I’ll be presenting a paper at #NeurIPS this year! Reach out if you’re interested in chatting about LM training dynamics, architectural differences, shortcuts/heuristics, or anything at the CogSci/NLP/AI interface in general! #Neurips2025

2

34

5

10

3K

James Michaelov @jamichaelov

6 months ago

Preprint link: https://t.co/AtMeWfA9eA

1

4

0

131

James Michaelov @jamichaelov

12 months ago

See the full paper here: https://t.co/F8lMhSomHP

0

1

0

65

James Michaelov @jamichaelov

12 months ago

New paper accepted at Findings of ACL! TL;DR: While language models generally predict sentences describing possible events to have a higher probability than impossible (animacy-violating) ones, this is not robust for generally unlikely events + is impacted by semantic relatedness

jamichaelov's tweet photo. New paper accepted at Findings of ACL! TL;DR: While language models generally predict sentences describing possible events to have a higher probability than impossible (animacy-violating) ones, this is not robust for generally unlikely events + is impacted by semantic relatedness https://t.co/7wg9En87Y3

1

9

2

411

James Michaelov @jamichaelov

12 months ago

In the most extreme case, LMs assign sentences such as ‘the car was given a parking ticket by the explorer’ (unlikely but possible event) a lower probability than ‘the car was given a parking ticket by the brake’ (impossible event, related final word) over half of the time.

jamichaelov's tweet photo. In the most extreme case, LMs assign sentences such as ‘the car was given a parking ticket by the explorer’ (unlikely but possible event) a lower probability than ‘the car was given a parking ticket by the brake’ (impossible event, related final word) over half of the time. https://t.co/JChA5iUOFS

1

2

0

103

James Michaelov @jamichaelov

about 1 year ago

Excited to share the second paper of this research project!

Catherine Arnett @linguist_cat

about 1 year ago

✨New pre-print✨ Crosslingual transfer allows models to leverage their representations for one language to improve performance on another language. We characterize the acquisition of shared representations in order to better understand how and when crosslingual transfer happens.

linguist_cat's tweet photo. ✨New pre-print✨ Crosslingual transfer allows models to leverage their representations for one language to improve performance on another language. We characterize the acquisition of shared representations in order to better understand how and when crosslingual transfer happens. https://t.co/h4e5yTw9R8

2

86

11

41

20K

0

12

0

3

935

James Michaelov @jamichaelov

over 1 year ago

Also generally interested in chatting about cognitive modeling, scaling, and language comprehension/understanding in humans and machines! @COLM_conf #COLM2024

James Michaelov @jamichaelov

over 1 year ago

Excited to present this at COLM this week! Reach out if you want to meet/chat!

1

7

0

1

2K

0

6

0

1K

James Michaelov @jamichaelov

over 1 year ago

Excited to present this at COLM this week! Reach out if you want to meet/chat!

James Michaelov @jamichaelov

about 2 years ago

New preprint with @linguist_cat and Ben Bergen! We’ve all heard of the new wave of recurrent language models, but how good are they for modeling human language comprehension? Quite good, it turns out! 🧵 https://t.co/ADtxfcDVBb

jamichaelov's tweet photo. New preprint with @linguist_cat and Ben Bergen! We’ve all heard of the new wave of recurrent language models, but how good are they for modeling human language comprehension? Quite good, it turns out! 🧵 https://t.co/ADtxfcDVBb https://t.co/Outwx4szNy

2

25

5

7

5K

1

7

0

1

2K

James Michaelov @jamichaelov

almost 2 years ago

This paper is now accepted to be presented at @COLM_conf! Updated version is on arXiv. Feeling excited for the conference, let me know if you want to meet!

James Michaelov @jamichaelov

about 2 years ago

New preprint with @linguist_cat and Ben Bergen! We’ve all heard of the new wave of recurrent language models, but how good are they for modeling human language comprehension? Quite good, it turns out! 🧵 https://t.co/ADtxfcDVBb

2

25

5

7

5K

0

19

1

3

2K

James Michaelov @jamichaelov

about 2 years ago

@linguist_cat And the current wave of recurrent architectures has just started! As we see more and more new architectures and developments, it will be interesting to see how they compare. One thing does seem clear though: recurrent models are back with a vengeance!

0

1

0

175

James Michaelov @jamichaelov

about 2 years ago

New preprint with @linguist_cat and Ben Bergen! We’ve all heard of the new wave of recurrent language models, but how good are they for modeling human language comprehension? Quite good, it turns out! 🧵 https://t.co/ADtxfcDVBb

2

25

5

7

5K

James Michaelov @jamichaelov

about 2 years ago

@linguist_cat With reading time, the results are more variable between experiments, and this seems like it might be related to the difference in stimuli (see paper for more details)

jamichaelov's tweet photo. @linguist_cat With reading time, the results are more variable between experiments, and this seems like it might be related to the difference in stimuli (see paper for more details) https://t.co/G8E7d6g7Um

1

2

0

274

James Michaelov @jamichaelov

about 2 years ago

Exciting to see our paper (with @MeganBardolph, Cyma K. Van Petten, Benjamin K. Bergen, and @CoulsonSeana) 'in print' at @jneurolang!

Ev (like in 'evidence', not Eve) Fedorenko 🇺🇦 @ev_fedorenko

about 2 years ago

5️⃣Michaelov etal. find surprisal explains N400s to sentence-final words varying in predictability, plausibility, and relation to the likely completion better than sem. similarity. The results support lexical predictive coding accounts. https://t.co/EnuOJeNO9O @jamichaelov 7/n

ev_fedorenko's tweet photo. 5️⃣Michaelov etal. find surprisal explains N400s to sentence-final words varying in predictability, plausibility, and relation to the likely completion better than sem. similarity. The results support lexical predictive coding accounts. https://t.co/EnuOJeNO9O @jamichaelov 7/n https://t.co/0uPQng9h7u

1

3

0

2

2K

0

13

2

0

1K

James Michaelov @jamichaelov

about 2 years ago

This is concerning, and I wouldn't be surprised if it leads to some students having to withdraw their papers from the conference

Eve Fleisig @enfleisig

about 2 years ago

NAACL 2024 seems to charge $750 for students to register if they're a presenter (every paper requires at least one registered presenter). @naacl am I reading this right? Seems like a major burden on students, especially if (as is common) only a paper's student authors attend.

8

48

6

4

25K

0

2

0

371

James Michaelov @jamichaelov

about 2 years ago

Really enjoyed the @babyLMchallenge talks and posters hosted by @conll_conf/@CMCL_NLP at @emnlpmeeting last year! Looking forward to seeing what people come up with this time round!

babyLM @babyLMchallenge

about 2 years ago

👶 BabyLM Challenge is back! Can you improve pretraining with a small data budget? BabyLMs for better LLMs & for understanding how humans learn from 100M words New: How vision affects learning Bring your own data Paper track https://t.co/uU12YWwLTe 🧵

7

134

41

55

35K

1

5

0

786

James Michaelov

@jamichaelov

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users