commotum @commotum - Twitter Profile

commotum retweeted

2 days ago

Introducing paper replications for arXiv Research codebases are notoriously hard to set up We deploy autoresearch agents to ingest popular arXiv repos, resolve setup issues, and get the core claim running. You can now sort papers by ease of implementation 🚀

10

420

50

259

56K

commotum retweeted

BURKOV

@burkov

2 days ago

Transformers keep their entire input history available and can look back into it for any fact, which suits retrieval but doesn't work well with a task the authors call state tracking. Imagine a model playing a guessing game: after each guess it says "higher" or "lower," so it must carry along the range the hidden number could still be in and narrow that range with every guess. That range is a running summary the model has to keep and revise as the conversation goes; each guess moves it one step forward; and each narrowed range is computed from the previous range plus the latest guess, so the values form a chain, every one depending on the last. In this recent arXiv paper from Google, the authors show transformers handle this badly: a model gives contradictory hints in this game or could flips between the two meanings of "bank" mid-conversation. The reason is in how the information flows in a transformer. As a transformer reads input, it holds a separate vector of numbers for every word at every layer of the network, and information only ever flows upward through the layers, never back down—so once the model computes the current range and parks it at some layer, the next range, which depends on that one, can only land at the same layer or deeper, never shallower. Each step pushes the running summary one level up the stack, and since the stack has a fixed number of layers, a long enough conversation runs it off the top with nowhere left to put the next update. The author argue that a proper state tracking needs recurrence—feeding a deep representation back down to a shallow one, as older recurrent networks did—and explain why this implicit recurrence beats making a model "think out loud" in extra tokens for routine bookkeeping. https://t.co/uCmLHZxRDI

7

109

20

83

6K

commotum retweeted

Sam Whitmore

@sjwhitmore

2 days ago

been thinking about why I don't feel comfortable with AI being a true personal assistant still, even though the models have gotten so much better. i realized a lot of the reason for me is actually git / version control. using an agent that is 99.999999% accurate vs 97% accurate doesn't really matter to me when the errors are high impact. with code, i can yolo a PR & know that its will not be destructive to my existing codebase. there's no similar staging mechanism for my calendar, shopping cart, etc. in theory you can create calendar event drafts, but very few of the other useful activities (esp spending money) have a draft / easy review paradigm. the solves here will most likely be on the infra / scaffolding level. once you start making a universal shopping cart draft, you're basically inventing a new web browser, or an agent-first wallet / credit card.

20

188

4

69

18K

commotum retweeted

Jacob X. Li

@jacobli99

2 days ago

Continual learning is widely discussed right now, but mostly as improving on the job or avoiding catastrophic forgetting. But it has a different, difficult, and already urgent form: Given nothing but a corpus of documents, how should AI systems develop expertise in a new, unfamiliar domain? We call this problem Machine Studying.

jacobli99's tweet photo. Continual learning is widely discussed right now, but mostly as improving on the job or avoiding catastrophic forgetting. But it has a different, difficult, and already urgent form:

Given nothing but a corpus of documents, how should AI systems develop expertise in a new, unfamiliar domain? We call this problem Machine Studying.

10

483

86

569

108K

Who to follow

commotum retweeted

2 days ago

the only two skills that matter: - kernel engineering - rl environment design

24

1K

43

876

140K

commotum retweeted

AluanWang

@IOivm

2 days ago

This system's key feature: direct SVG export. What looks 3D is really just a 2D projection — so you edit it straight in Illustrator (right). The whole thing thinks in paths. Let it follow the brush, and it takes on any style. Not a filter !! teaching a computer how humans draw.

1

44

4

20

2K

commotum retweeted

Arip

@machinestein

3 days ago

ICML 2026: Latent Reasoning in TRMs is Secretly a Policy Improvement Operator Why does recursive reasoning, especially latent reasoning, actually work? The theory is still young, and even mechanistic explanations are limited. We close part of this gap by showing that latent reasoning is secretly doing policy improvement. Each recursion pushes the model steadily toward the target. Based on this view, we propose an algorithm that boosts learning and inference efficiency by up to 18x.

9

376

48

399

24K

commotum retweeted

himanshu @retr0sushi_

3 days ago

anybody with a world models background is not surprised! pretty neat work

2

445

22

383

54K

commotum retweeted

Jayden Teoh

@jayden_teoh_

3 days ago

6/ Free Lunch: Speculative Decoding. Because NextLat learns a latent dynamics, it unlocks 𝘃𝗮𝗿𝗶𝗮𝗯𝗹𝗲-𝗹𝗲𝗻𝗴𝘁𝗵 𝘀𝗽𝗲𝗰𝘂𝗹𝗮𝘁𝗶𝘃𝗲 𝗱𝗲𝗰𝗼𝗱𝗶𝗻𝗴: we can draft a flexible number of future tokens recursively in latent space. 🚀 This accelerates inference by up to 3.3x, much faster than MTP-style drafting!

1

56

2

17

5K

commotum retweeted

Adam Shuaib

@adamshuaib

4 days ago

A strong predictor of who does extraordinary work is being terrible with everyday life; they get people’s names wrong, forget meetings, constantly lose things or don’t remember to eat because all their mental bandwidth is going elsewhere. We met a researcher with 87000+ unread emails and a passport that expired the day before an AI conference he was speaking at, and a founder who had worn pretty much the same outfit every day for 10yrs because choosing took up way too much mental effort. Maintaining a well run life is a part-time job hours-wise. For someone who is juggling a hard problem in their head 24-7, there is no spare capacity to run these background processes. Society treats this as a flaw to coach out (the command is “be more present”), but this level of absorption is necessary to solve a problem nobody else has.

108

2K

169

1K

286K

commotum

@commotum

4 days ago

@notch @igelblork Nailed it. For me the big trip was realizing that 3D APIs usually write q*v, but mean rotate(v) = q v q^-1. So i/j/k are 180° rotors there. But Hamilton’s multiplication is a 90° turn between perpendicular units, which makes ij=k and ijk=-1 way more intuitive.

0

2

0

1

176

commotum retweeted

Eddie Lee

@eddietree

6 days ago

built a tiny RL rock-stacking experiment trained on 200k sims via PufferLib (@jsuarez) observes rock geometry, mass, friction, velocity, stack contour, support intervals, roll torque, balance, etc

2

118

12

63

14K

commotum retweeted

Guodong Zhang

@Guodzh

9 days ago

spent half of my PhD working on optimization research, I only published one negative result paper showing beating SGD with momentum is hard especially when noise dominated regime 🤣

7

326

5

72

40K

commotum retweeted

dax

@thdxr

8 days ago

it's not done if it's not implemented it's not done if the implementation is ugly it's not done if it's not documented it's not done if users can't discover it it's not done if you can't market it

130

3K

246

788

97K

commotum

@commotum

9 days ago

@owl_posting for me it was when HBO pulled Westworld from the catalogue

1

25

0

1

4K

commotum retweeted

HomeMadeGarbage

@H0meMadeGarbage

10 days ago

Codexで強化学習生き物つくってもうた。。

109

6K

443

2K

819K

commotum retweeted

LoLNothingMatters @DastDn

10 days ago

What a sentence.

9

6K

260

63

97K

commotum retweeted

Dhruv π

@dhruv31415

10 days ago

New personal blog! (n, k)-Local Polynomial Simplicial Attention, which generalizes LLA & 2-simplicial attention as two orthogonal design axes that jointly shape the regression landscape.