chittem @AdithyaChittem - Twitter Profile

Pinned Tweet

3 months ago

New Paper: mmWave Radar Aware Dual-Conditioned GAN for Speech reconstruction of Signals with Low SNR. 🧵(1/4) Demo Website with audios and spectrograms: https://t.co/v7X1EobN8y Paper: https://t.co/DXTi4Wb0CD

1

15

0

1

1K

chittem @AdithyaChittem

1 day ago

@TexasPercy 10/10

0

17

AdithyaChittem retweeted

ohshin

@ohshinbhat

6 days ago

i dont understand this codebase anymore man

36

166

3

7

10K

chittem @AdithyaChittem

6 days ago

@sarvagya_kul Was this through backdoor too lmao

0

119

Who to follow

building @Truva_homes • alum @bitspilaniindia • hip-hop artist/business //

Sid

@sidsaysstuff

22 | non practising intellectual | @bitspilaniindia ‘26

chittem @AdithyaChittem

6 days ago

@sdianahu Grading reasoning traces is probably one of the hardest things to do considering how much traces vary across model providers and even releases across the same model provider. You'd just have to keep coming up with new measures/evals even if there's a small weight change.

0

43

AdithyaChittem retweeted

Xiao Ma

@infoxiao

11 days ago

I just open sourced my “lazy or craftsman?” simple test.

10

750

43

93

51K

chittem @AdithyaChittem

11 days ago

@Ishansharma7390 What the fuck

0

14

AdithyaChittem retweeted

sean lee

@infinitefun_

15 days ago

most of what is considered "taste" (read: design) is in the realm of zero sum signaling games your tasteslop will just be the next AI slop and then your anti-tasteslop will become the next tasteslop taste is defined in terms of slop and therefore can never transcend slop.

92

875

41

187

180K

chittem @AdithyaChittem

18 days ago

Is anyone at OpenAI still working on Prism? Was such a great idea I cannot believe they messed it up Overleaf HAS to go

0

2

0

103

AdithyaChittem retweeted

maharshi

@maharshii

22 days ago

is it just me or has claude fable 5 gotten noticeably worse?

186

3K

82

86

278K

AdithyaChittem retweeted

David K 🎹

@DavidKPiano

24 days ago

Too many developers don't understand what "compounding slop" is. A loop that prompts agents is a great way to automate slop creation. Constrain the state-action space so the loop can't drift, then automate inside it. Human-in-the-loop = feature, not bottleneck.

42

414

34

58

29K

chittem @AdithyaChittem

27 days ago

AUSTRALIA TIMEEEEEEE LFGGGGGGGGG

chittem @AdithyaChittem

3 months ago

New Paper: mmWave Radar Aware Dual-Conditioned GAN for Speech reconstruction of Signals with Low SNR. 🧵(1/4) Demo Website with audios and spectrograms: https://t.co/v7X1EobN8y Paper: https://t.co/DXTi4Wb0CD

1

15

0

1

1K

2

16

0

1K

AdithyaChittem retweeted

K.Bourdieu @infohazard_lol

29 days ago

Anthropic has seriously shattered my AI psychosis, so now I feel extremely anxious about the sudden lack of perceived hyperproductivity. I don't trust any of my agents, Claude or otherwise, will get a single thing even partially correct. Even for vibe code fun the magic is gone

117

4K

69

616

383K

chittem @AdithyaChittem

about 1 month ago

If a year from now we are still writing 3000 line markdown files we will have completely lost the plot

Rohan Paul

@rohanpaul_ai

about 1 month ago

This Meta + Stanford + Illinois survey paper argues that AI agents work better when code becomes their main working layer. The problem is that an LLM by itself is mostly a text predictor, so long tasks can lose state, hide mistakes, and turn plans into actions in fragile ways. The real advance is not “AI writes code,” but “AI uses code as the environment it thinks inside.” The authors call the surrounding system an agent harness, meaning the tools, memory, sandboxes, checks, and feedback loops that turn a model into an agent. Their core idea is that code should sit at the center of that harness, because code can be run, inspected, checked, saved, edited, and shared. Tests become sensors. Repositories become memory. Logs become history. Sandboxes become boundaries. A generated script is no longer merely an answer; it is a handle the system can run, check, revise, share, and roll back. The main finding is a pattern across many fields: code helps agents reason through executable steps, act through tool calls or control programs, and model environments through tests, traces, logs, repositories, and simulators. ---- Paper Link – arxiv. org/abs/2605.18747 Paper Title: "Code as Agent Harness"

rohanpaul_ai's tweet photo. This Meta + Stanford + Illinois survey paper argues that AI agents work better when code becomes their main working layer.

The problem is that an LLM by itself is mostly a text predictor, so long tasks can lose state, hide mistakes, and turn plans into actions in fragile ways.

The real advance is not “AI writes code,” but “AI uses code as the environment it thinks inside.”

The authors call the surrounding system an agent harness, meaning the tools, memory, sandboxes, checks, and feedback loops that turn a model into an agent.

Their core idea is that code should sit at the center of that harness, because code can be run, inspected, checked, saved, edited, and shared.

Tests become sensors.

Repositories become memory.

Logs become history.

Sandboxes become boundaries.

A generated script is no longer merely an answer; it is a handle the system can run, check, revise, share, and roll back.

The main finding is a pattern across many fields: code helps agents reason through executable steps, act through tool calls or control programs, and model environments through tests, traces, logs, repositories, and simulators.

----

Paper Link – arxiv. org/abs/2605.18747

Paper Title: "Code as Agent Harness"

20

169

42

157

10K

0

117

chittem @AdithyaChittem

about 1 month ago

@redrodeo03 @c_engines Congratsss

0

66

chittem @AdithyaChittem

about 1 month ago

Sloppification is long to be a large scale problem soon enough and whoever builds smthng that can balance abstraction without missing out on nuance that actually solves issues in production grade code wins no doubt

0

1

0

29

chittem @AdithyaChittem

about 1 month ago

I do think long term the health of repos everywhere will be near dogshjt. The instant gratification from seeing the sloppy code "work" makes most devs completely ignorant to anything long term. Moreover the increased expectations in terms of output doesn't help this either

Mario Zechner

@badlogicgames

about 1 month ago

recommended reading. https://t.co/2GgZb04PoE

17

1K

87

2K

74K

1

0

85

chittem @AdithyaChittem

about 1 month ago

"let's build a review agent" sure that'll report 100 bugs in its report. Will YOU fix it? No. You feed it back to an agent that does the same thing all over again

1

0

38