Manjunath @MIrukulla - Twitter Profile

Pinned Tweet

1 day ago

GPT ni adugu "STRAWBERRY" ane word lo enni letters unayi ani. Watch it fail! Counting raaka kaadhu, dheeni venaka chala peddha reason undhi. Adhi teliyali ante, meeku Tokenization gurinchi teliyali saami...! Antha peddha katha emi kaadhu le saami.

MIrukulla's tweet photo. GPT ni adugu "STRAWBERRY" ane word lo enni letters unayi ani. Watch it fail! Counting raaka kaadhu, dheeni venaka chala peddha reason undhi. Adhi teliyali ante, meeku Tokenization gurinchi teliyali saami...!

Antha peddha katha emi kaadhu le saami.

8

249

30

278

30K

Manjunath

@MIrukulla

about 5 hours ago

@Machi091 That's my dad's work setup. I use it at nights mostly 😅

2

1

0

35

Manjunath

@MIrukulla

about 5 hours ago

That’s all for the day. ✅ Revision Deep Learning ✅ Revision LLMs ✅ Prepare for an interview. ✅Applied Google DeepMind hackathon. ✅ Project research and architecture planning.

MIrukulla's tweet photo. That’s all for the day.

✅ Revision Deep Learning
✅ Revision LLMs
✅ Prepare for an interview.
✅Applied Google DeepMind hackathon.
✅ Project research and architecture planning. https://t.co/pa3tgIiaxN

3

14

2

3

292

Manjunath

@MIrukulla

about 5 hours ago

@SiO2ey All the best!

0

18

Manjunath

@MIrukulla

about 14 hours ago

@unknowexistence Yeah, maybe that helps, but the question is how much is enough to achieve that?

0

25

Manjunath

@MIrukulla

about 16 hours ago

Interview Question - Is the context window a software problem? A hardware problem? Or both? Your answer tells me how deeply you understand LLMs. 🧵

MIrukulla's tweet photo. Interview Question -

Is the context window a software problem?

A hardware problem?

Or both?

Your answer tells me how deeply you understand LLMs. 🧵 https://t.co/b8Hzf4yR2X

2

10

4

0

238

Manjunath

@MIrukulla

about 16 hours ago

This exact tension, needing to hold growing context without paying growing memory cost, is precisely why KV caching exists. A full breakdown of how it actually works next Monday. What's your take? Should context windows scale via smarter software, or are we just waiting on better hardware?

0

3

0

40

Manjunath

@MIrukulla

about 16 hours ago

So, the verdict is that software created the shape of the problem, quadratic compute growth, and trained positional limits. Hardware decides how far you're allowed to push before physically running out of memory or bandwidth. Neither one is "the" bottleneck. They're a coupled system, software writes the check, and hardware decides if it clears.

1

0

41

Manjunath

@MIrukulla

about 17 hours ago

@virattt @ycombinator @findatasets Congratulations but curious, why do we need stock market infrastructure for agents, especially apart from latency issues?

0

47

Manjunath

@MIrukulla

about 17 hours ago

@dreamydoer_9 Woahhhh!!! Thanks for reposting. 🙌🏻❤️

0

29

MIrukulla retweeted

Praveen Kumar Verma

@Alacritic_Super

1 day ago

As an AI Infrastructure Engineer Please learn: - GPU architecture, VRAM fundamentals, CUDA & memory hierarchy - Quantization (INT8/FP8/4-bit), batching & continuous batching - vLLM, TensorRT-LLM, SGLang, llama.cpp & inference optimization - KV caching, speculative decoding, prefix caching & token throughput - Distributed training (DDP, FSDP, DeepSpeed, ZeRO) - Model serving (Triton, vLLM, KServe, Ray Serve, SGLang) - Kubernetes, Docker & GPU orchestration - NCCL, InfiniBand & high-speed networking - Multi-GPU & multi-node inference - LoRA, QLoRA, PEFT & fine-tuning pipelines - Vector databases, embeddings & RAG pipelines - Prompt caching, semantic caching & cost optimization - Observability (OpenTelemetry, Prometheus, Grafana, Langfuse, Phoenix) - LLM evaluation, benchmarking & A/B testing - Model routing & fallback strategies - MCP, AI agents & workflow orchestration - Data pipelines, Kafka & streaming inference - Security, guardrails & prompt injection mitigation - CI/CD for ML (MLOps), MLflow & model registries - Linux, networking & storage fundamentals - PyTorch internals, CUDA profiling & kernel optimization

47

2K

233

3K

130K