vish @vishctx - Twitter Profile

vish

@vishctx

about 6 hours ago

@pritopian Gemini does the same thing. It links everything to my work interests as well.

0

1

0

44

vish

@vishctx

2 days ago

I don't think a lot of input tokens go into producing code. Almost all of the tokens spent in a mature product /codebase is from investigations. For sending this patch: https://t.co/k7ludOHRr5 I roughly ended up using 1.2 million tokens across debugging kernel debug logs which are verbose, writing reproducers which never got checked in and understanding the codebase.

0

2

0

712

vish

@vishctx

3 days ago

Everyone wants multi-agent workflows, but the mental model required to build them is not straightforward. It's the same roadblock as multithreaded programming where you're trying to manage a bunch of distinct contexts at once. Until you nail the communication layer, the shared goals just get lost in translation.

0

5

0

123

vish

@vishctx

5 days ago

Is anyone in my network currently working on Optimal Transport for generative modeling of multimodal data? I'm expanding my scope into this space and would love some recommendations for introductory papers, resources. What should I be reading?

0

170

Who to follow

SB

@niyamagaana

👨‍💻. Sports Fan. Cricket, Tennis, Football, F1 etc.

Joshua Imel

@JoshuaImel

Sold my 7-figure software agency and bought a farm. Director of Product @zendropofficial - 3x girl dad. I post about startups and tech. #keepgoing

Celestial Being

@ShiningMojo

Tech enthusiast, seeking a better way of life. Hopefully share that life with her

vish

@vishctx

5 days ago

@charvispeaks 'Claude, please resolve this Sev 2 so I can go back to sleep. Make no mistakes.'

0

2

0

1K

vish

@vishctx

6 days ago

There’s a massive blind spot in the benchmarks. By the time an issue makes it to GitHub with a reproducible state, 80% of the hardest engineering work is already done. Current benchmarks hand models extremely precise problem statements. But in the real world, like when debugging the Linux kernel, you rarely start knowing what the problem actually is. All a user will report is “the app is OOMing, and increasing memory doesn’t help.” Digging into that requires intuition built from past issues. The root cause could be memory leaks, memory fragmentation, or a race condition where threads acquire memory and never release it leading to starvation. We desperately need benchmarks with highly ambiguous starting conditions to test if a model can navigate a state with multiple distinct root-cause scenarios. Right now, models like Opus easily get stuck in loops during open ended investigations. They rarely move forward unless I ask it to check for hypotheses A, B, or C. The next frontier for SWE evals should also include cases where the model is trying to figure out what's actually broken in the first place.

0

1

0

77

vish

@vishctx

11 days ago

@championswimmer I think they recently launched an ad feature, maybe that gets a boost because of this?

1

2

0

367

vish

@vishctx

11 days ago

@charles_irl I always found KV Cache to be like naan bread. KV and Cache kinda mean the same thing!

2

17

0

1

4K

vish

@vishctx

13 days ago

@amitpr Wait until you reach the KVM guest clocks, fun begins then! :)

0

88

vish

@vishctx

14 days ago

C is the most secure language right now because it has no package manager. Zero supply chain attacks. You want to know if a number is even? You don't ⁠npm install⁠. You write ⁠x & 1⁠ and manipulate those bits yourself.

6

20

0

1

6K

vish

@vishctx

15 days ago

@hahnbeelee At this point every company should start offering a headless mode

0

1

0

324

vish

@vishctx

16 days ago

@qianl_cs Thank you!!

0

1

0

107

vish

@vishctx

16 days ago

@ivanleomk Thanks man

0

69

vish

@vishctx

16 days ago

@agentic_matt Yeah, I am open to any roles in that space.

0

1

0

50

vish

@vishctx

17 days ago

@LeylaKuni @upInYerCommentz Assuming they could figure out a way to do cabling, a lot of old office buildings are in downtown area. There’s only so much power that can be drawn from the power lines today, and getting more power into DCs means all of the other building around lose capacity.

0

73

vish

@vishctx

17 days ago

@oanaolt I work with model trajectories and logs, I can help with it!

1

0

67

vish

@vishctx

18 days ago

@championswimmer @badlogicgames Vibe coding will become Vibe Clauding

0

1

0

88

vish

@vishctx

18 days ago

LLM just like me during my entrance exam. Look at options and think a human definitely can’t walk more than 10kms in 20 minutes and then derive an answer!

Nishant Balepur @NishantBalepur

19 days ago

🚨 New Paper! 🚨 One of my first Ph.D. papers found that LLMs can answer multiple-choice questions without seeing the question 🤔 At #ACL2026, I'm presenting a follow-up showing that current reasoning LLMs can still do this! And quite similarly to a clever test-taker 🧑‍🎓🧵

NishantBalepur's tweet photo. 🚨 New Paper! 🚨

One of my first Ph.D. papers found that LLMs can answer multiple-choice questions without seeing the question 🤔

At #ACL2026, I'm presenting a follow-up showing that current reasoning LLMs can still do this! And quite similarly to a clever test-taker 🧑‍🎓🧵 https://t.co/X25UnlSJY2

50

2K

110

829

1M

1

5

1

7K

vish

@vishctx

19 days ago

@ivanburazin Move portfolio to Daytona! ;)

0

1

0

171

vish

@vishctx

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users