Andy J Yang @pentagonalize - Twitter Profile

19 days ago

Given black-box access to a Transformer's output, can we efficiently recover its parameters? We analyse the learnability of attention-based models with query access in our new work. Accepted at #ICML2026 🎉 Work done with @shahkulin98, @mhahn29 and Varun Kanade. 🧵

satwik1729's tweet photo. Given black-box access to a Transformer's output, can we efficiently recover its parameters?

We analyse the learnability of attention-based models with query access in our new work. Accepted at #ICML2026 🎉

Work done with @shahkulin98, @mhahn29 and Varun Kanade.

🧵 https://t.co/FdSBn3cJK2

8

161

23

117

27K

pentagonalize retweeted

Gašper Beguš

@begusgasper

about 1 month ago

Can syntax begin to emerge from speech alone? In our new paper, we show that unsupervised neural networks trained only on individual spoken words can spontaneously generate two- and even three-word sequences — without ever seeing multi-word examples during training. We call this phenomenon spontaneous concatenation. It can help model an early step in both language acquisition and the evolution of syntax. We also propose a possible neural mechanism behind this behavior: disinhibition, which offers a pathway from raw speech representations toward compositional structure. Using AI interpretability techniques, we can begin to identify neural mechanisms behind operations that resemble basic symbolic processes such as concatenation — a precursor to operations like Merge.

begusgasper's tweet photo. Can syntax begin to emerge from speech alone?

In our new paper, we show that unsupervised neural networks trained only on individual spoken words can spontaneously generate two- and even three-word sequences — without ever seeing multi-word examples during training.

We call this phenomenon spontaneous concatenation. It can help model an early step in both language acquisition and the evolution of syntax.

We also propose a possible neural mechanism behind this behavior: disinhibition, which offers a pathway from raw speech representations toward compositional structure.

Using AI interpretability techniques, we can begin to identify neural mechanisms behind operations that resemble basic symbolic processes such as concatenation — a precursor to operations like Merge.

6

113

21

73

12K

pentagonalize retweeted

Tom McCoy @RTomMcCoy

about 1 month ago

Now out in TiCS: "Whither symbols?" Neural networks can now do many things long thought to require symbols. What does this mean for the role of symbols in CogSci? Read the paper for our answer!

0

35

6

7

3K

pentagonalize retweeted

Aleksandra Bakalova @abakalova13175

about 1 month ago

Will be presenting this at the Latent & Implicit Thinking Workshop (https://t.co/8cytLakurb) at #ICLR2026 Come by our poster! Always happy to chat :)

0

30

9

13

4K

Who to follow

r i c h 💣

@imm_richh

時々の折り紙・描き・Arcaea (13.09) レゼ💣とこいし👒は好きですよ ⚠️ 多めのRTs. BSky: https://t.co/per2tYSX6s Insta: hoangminh117 Booth: https://t.co/jfzryJlKSV

Allen Su

@AllenSu15828937

Views are entirely my own. likes, comments, follows ≠ endorsements.

Xiao Yao

@XiaoYao707

折纸爱好者（origami player）。尝试设计，染色，绘图中[email protected]

pentagonalize retweeted

breandan

@breandan

about 1 month ago

New work! Introduces a parallel RASP variant highly suited for SIMD architectures. I implement a VM bytecode and lower a heapless array language onto it, demonstrating significant speedups over serial evaluation on a massive multitenancy benchmark with millions of concurrent VMs.

1

9

2

3

4K

pentagonalize retweeted

Pete Shaw @ptshaw2

about 2 months ago

I will be presenting this paper at ICLR next week! 🇧🇷 Come chat about Kolmogorov complexity, the MDL principle, and what this all means for training better models! 🧵

3

104

12

55

10K

pentagonalize retweeted

Michael Hahn @mhahn29

2 months ago

We have 1-2 more extra spots due to new funding -- apply by end of April!

5

161

31

102

28K

Andy J Yang @pentagonalize

2 months ago

Last day to apply for financial support! Find the registration and financial support forms here: https://t.co/FRJlkLr4oH

Andy J Yang @pentagonalize

4 months ago

📣 FLaNN 2026 at Yale 🍮 Invited talks+posters (non-archival): expressivity, computation, and learning in neural nets/LLMs Speakers: Pablo Barceló, David Chiang, Will Merrill, Naomi Saphra, Gail Weiss Abstracts due Feb 12, 2026 Details: https://t.co/AzgF1TMyOS

pentagonalize's tweet photo. 📣 FLaNN 2026 at Yale 🍮

Invited talks+posters (non-archival): expressivity, computation, and learning in neural nets/LLMs

Speakers: Pablo Barceló, David Chiang, Will Merrill, Naomi Saphra, Gail Weiss

Abstracts due Feb 12, 2026
Details: https://t.co/AzgF1TMyOS https://t.co/XNpk53Burb

1

9

5

2

5K

0

2

788

pentagonalize retweeted

Satwik Bhattamishra @satwik1729

2 months ago

Given access to a language model, can we extract an interpretable object like a DFA that captures which strings a language model is likely to generate? Our new work on automata learning theory studies this question. To be presented at ##ICLR2026 🎉

satwik1729's tweet photo. Given access to a language model, can we extract an interpretable object like a DFA that captures which strings a language model is likely to generate?

Our new work on automata learning theory studies this question. To be presented at ##ICLR2026 🎉 https://t.co/bh58Tu5Gme

1

85

16

48

11K

pentagonalize retweeted

Yash Sarrof @yashYRS

2 months ago

Most work on Transformer length generalization assumes a fixed vocabulary. But in real tasks, longer inputs may have new symbols (e.g. more objects in planning). Our new paper introduces C-RASP* to study this and explains the inconsistent performance of Transformers in planning.

yashYRS's tweet photo. Most work on Transformer length generalization assumes a fixed vocabulary. But in real tasks, longer inputs may have new symbols (e.g. more objects in planning). Our new paper introduces C-RASP* to study this and explains the inconsistent performance of Transformers in planning. https://t.co/KV6pVqYfhU

1

54

12

29

9K

Andy J Yang @pentagonalize

3 months ago

📍 New Haven, CT, USA 📅 May 11-13, 2026 🌐 https://t.co/AzgF1TMyOS 📩 [email protected]

0

66

Andy J Yang @pentagonalize

3 months ago

A reminder to register for the FLaNN workshop! The financial support application is now open to all attendees, not limited to graduate students. Find the registration and financial support forms here: https://t.co/FRJlkLr4oH See you there!

Andy J Yang @pentagonalize

4 months ago

📣 FLaNN 2026 at Yale 🍮 Invited talks+posters (non-archival): expressivity, computation, and learning in neural nets/LLMs Speakers: Pablo Barceló, David Chiang, Will Merrill, Naomi Saphra, Gail Weiss Abstracts due Feb 12, 2026 Details: https://t.co/AzgF1TMyOS

1

9

5

2

5K

1

5

3

0

997

pentagonalize retweeted

Aleksandra Bakalova @abakalova13175

3 months ago

Can we rewrite Transformers as a human-readable code? In this paper, we decompile Transformers trained on algorithmic and formal language tasks into D-RASP – a programming language that mirrors Transformer architecture. 🧵

abakalova13175's tweet photo. Can we rewrite Transformers as a human-readable code?

In this paper, we decompile Transformers trained on algorithmic and formal language tasks into D-RASP – a programming language that mirrors Transformer architecture. 🧵 https://t.co/hRboPQwUFM

2

236

39

182

28K

Andy J Yang @pentagonalize

4 months ago

We also have (limited) financial support available on a need basis for graduate students who are not able to attend otherwise.

0

2

0

70

Andy J Yang @pentagonalize

4 months ago

The FLaNN Workshop submission deadline has been extended to Feb 19! Invited talks + posters (non-archival): expressivity, computation, and learning in neural nets/LLMs. Previous work welcome. Graduate students encouraged to submit! 📍 Yale University 🗓️ May 11-13, 2026

Andy J Yang @pentagonalize

4 months ago

📣 FLaNN 2026 at Yale 🍮 Invited talks+posters (non-archival): expressivity, computation, and learning in neural nets/LLMs Speakers: Pablo Barceló, David Chiang, Will Merrill, Naomi Saphra, Gail Weiss Abstracts due Feb 12, 2026 Details: https://t.co/AzgF1TMyOS

1

9

5

2

5K

1

10

7

2

1K

Andy J Yang @pentagonalize

4 months ago

Call for Submissions: https://t.co/tSof6Ajvbp Registration: https://t.co/FRJlkLrCef Contact: [email protected]

1

2

1

0

106

Andy J Yang @pentagonalize

4 months ago

We welcome posters on the formal expressivity, computational properties, and learning behavior of neural nets (incl. LLMs). Graduate students are especially encouraged to submit! Contact: [email protected]

0

147

Andy J Yang @pentagonalize

4 months ago

📣 FLaNN 2026 at Yale 🍮 Invited talks+posters (non-archival): expressivity, computation, and learning in neural nets/LLMs Speakers: Pablo Barceló, David Chiang, Will Merrill, Naomi Saphra, Gail Weiss Abstracts due Feb 12, 2026 Details: https://t.co/AzgF1TMyOS

1

9

5

2

5K

Andy J Yang @pentagonalize

4 months ago

Deadline in just under two weeks!

0

2

0

173

Andy J Yang @pentagonalize

4 months ago

Inviting submissions to the first Workshop on Formal Languages and Neural Networks! We welcome posters dicussing the formal expressivity, computational properties, and learning behavior of neural networks! Call for posters: https://t.co/tSof6AiXlR Deadline: February 12, 2026

Andy J Yang @pentagonalize

6 months ago

Announcing the first Workshop on Formal Languages and Neural Networks (FLaNN) 🍮! We invite the submission of abstracts for posters that discuss the formal expressivity, computational properties, and learning behavior of neural network models, including large language models.

pentagonalize's tweet photo. Announcing the first Workshop on Formal Languages and Neural Networks (FLaNN) 🍮!

We invite the submission of abstracts for posters that discuss the formal expressivity, computational properties, and learning behavior of neural network models, including large language models. https://t.co/HZeuHqSp3B

1

35

14

8

11K

2

21

11

5

7K

Andy J Yang

@pentagonalize

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users