Prajjwal · building nanoserve @pdurdenj - Twitter Profile

Pinned Tweet

5 days ago

I'm building an LLM inference engine from scratch, in public. Not using vLLM or TGI. The goal: re-derive every serving trick until I understand how modern inference squeezes throughput from a GPU. I call it nanoserve: https://t.co/TbCxgWkUna

pdurdenj's tweet photo. I'm building an LLM inference engine from scratch, in public. Not using vLLM or TGI. The goal: re-derive every serving trick until I understand how modern inference squeezes throughput from a GPU. I call it nanoserve: https://t.co/TbCxgWkUna https://t.co/VdAl5pnKf5

2

1

0

155

Prajjwal · building nanoserve @pdurdenj

about 1 hour ago

Anthropic has officially filed for its IPO, cementing its place as one of the most valuable AI startups. The move lands amid loud debate over an AI bubble, setting up a defining test of how much investors will pay for frontier AI. #AI #Anthropic #Claude #IPO #LLM

pdurdenj's tweet photo. Anthropic has officially filed for its IPO, cementing its place as one of the most valuable AI startups. The move lands amid loud debate over an AI bubble, setting up a defining test of how much investors will pay for frontier AI. #AI #Anthropic #Claude #IPO #LLM https://t.co/qSlT2li1Nb

0

5

Prajjwal · building nanoserve @pdurdenj

about 2 hours ago

Vint Cerf, the 'Father of the Internet,' is finally retiring. The co-designer of TCP/IP has spent decades as Google's chief internet evangelist, shaping the protocols every AI system now runs on. End of an era. #Internet #TCPIP #Google #Tech #AI

pdurdenj's tweet photo. Vint Cerf, the 'Father of the Internet,' is finally retiring. The co-designer of TCP/IP has spent decades as Google's chief internet evangelist, shaping the protocols every AI system now runs on. End of an era. #Internet #TCPIP #Google #Tech #AI https://t.co/G5AzJ15TXa

0

5

Prajjwal · building nanoserve @pdurdenj

about 3 hours ago

The Trump administration is lifting US export controls on Anthropic's Mythos and Fable AI models, reversing curbs imposed over cybersecurity concerns and clearing the way for wider international access. #AI #Anthropic #ExportControls #LLM #AIPolicy

pdurdenj's tweet photo. The Trump administration is lifting US export controls on Anthropic's Mythos and Fable AI models, reversing curbs imposed over cybersecurity concerns and clearing the way for wider international access. #AI #Anthropic #ExportControls #LLM #AIPolicy https://t.co/6n4zLMullk

0

6

Who to follow

Gaurav Rawal

@iamgrawal

A tech smith who loves to mould the new technologies and play around. A hackathon enthusiast. Co-organiser @gdg_nd

Software Engineer | Evolving from Bugs to Business Wins | Tech, Laughs, and Lifelong Learning.

Prajjwal · building nanoserve @pdurdenj

about 4 hours ago

Microsoft Copilot had a critical flaw that let attackers steal users' 2FA codes, per Ars Technica. It exposed one-time login tokens before Microsoft patched it. A reminder that AI assistants widen the attack surface. #AI #Copilot #Microsoft #InfoSec #CyberSecurity

pdurdenj's tweet photo. Microsoft Copilot had a critical flaw that let attackers steal users' 2FA codes, per Ars Technica. It exposed one-time login tokens before Microsoft patched it. A reminder that AI assistants widen the attack surface. #AI #Copilot #Microsoft #InfoSec #CyberSecurity https://t.co/sNGSI6NCPR

0

11

Prajjwal · building nanoserve @pdurdenj

about 5 hours ago

Meta quietly launched Pocket, a mobile gaming app built largely through vibe coding, using AI to generate much of it instead of hand written code. A notable test of how far AI assisted development can go for consumer products. #AI #Meta #VibeCoding #GenerativeAI

pdurdenj's tweet photo. Meta quietly launched Pocket, a mobile gaming app built largely through vibe coding, using AI to generate much of it instead of hand written code. A notable test of how far AI assisted development can go for consumer products.

#AI #Meta #VibeCoding #GenerativeAI https://t.co/HfBV5QwcuT

0

20

Prajjwal · building nanoserve @pdurdenj

about 6 hours ago

Notion is shutting down Notion Mail, its Skiff-influenced email app, saying most users now lean on AI agents to handle email instead of a dedicated inbox. A telling sign of how agentic tools are quietly reshaping productivity software. #AI #Notion #AIagents #Productivity #Skiff

pdurdenj's tweet photo. Notion is shutting down Notion Mail, its Skiff-influenced email app, saying most users now lean on AI agents to handle email instead of a dedicated inbox. A telling sign of how agentic tools are quietly reshaping productivity software. #AI #Notion #AIagents #Productivity #Skiff https://t.co/uHwCJZgeJz

1

0

14

Prajjwal · building nanoserve @pdurdenj

about 7 hours ago

A new startup says LLMs are stuck in a 'groupthink groove,' converging on the same safe answers and losing output diversity. Its pitch: methods to nudge models toward more varied, less homogenized responses. #AI #LLM #MachineLearning #AIResearch #Startups

pdurdenj's tweet photo. A new startup says LLMs are stuck in a 'groupthink groove,' converging on the same safe answers and losing output diversity. Its pitch: methods to nudge models toward more varied, less homogenized responses. #AI #LLM #MachineLearning #AIResearch #Startups https://t.co/BxdMDRknMt

0

1

0

3

Prajjwal · building nanoserve @pdurdenj

about 8 hours ago

Microsoft is building a bouncer for Teams: a new control that blocks unauthorized AI bots and notetaker agents from silently joining meetings as automated attendees pile up. Admins get to decide what gets a seat in the room. #AI #Microsoft #Teams #Bots #Enterprise

pdurdenj's tweet photo. Microsoft is building a bouncer for Teams: a new control that blocks unauthorized AI bots and notetaker agents from silently joining meetings as automated attendees pile up. Admins get to decide what gets a seat in the room. #AI #Microsoft #Teams #Bots #Enterprise https://t.co/8vODftYUxQ

0

1

0

28

Prajjwal · building nanoserve @pdurdenj

about 9 hours ago

Oracle is cutting roughly 21,000 jobs to fund a massive, debt-fueled buildout of AI data centers and compute. The layoffs bankroll billions in capex as it leans into cloud AI infrastructure. A bold bet on AI. #AI #Oracle #Layoffs #CloudComputing #DataCenters

pdurdenj's tweet photo. Oracle is cutting roughly 21,000 jobs to fund a massive, debt-fueled buildout of AI data centers and compute. The layoffs bankroll billions in capex as it leans into cloud AI infrastructure. A bold bet on AI. #AI #Oracle #Layoffs #CloudComputing #DataCenters https://t.co/47vJf6YOoC

0

10

Prajjwal · building nanoserve @pdurdenj

about 10 hours ago

New platform Flare lets anyone publicly report AI systems behaving badly, flagging flaws, unsafe outputs and failures so researchers can track them. Basically a crowdsourced early warning system for AI risk. #AI #AISafety #LLM #MachineLearning #TechNews

pdurdenj's tweet photo. New platform Flare lets anyone publicly report AI systems behaving badly, flagging flaws, unsafe outputs and failures so researchers can track them. Basically a crowdsourced early warning system for AI risk. #AI #AISafety #LLM #MachineLearning #TechNews https://t.co/SpRreq8bDL

0

1

0

10

Prajjwal · building nanoserve @pdurdenj

about 11 hours ago

Ashton Kutcher is leaving Sound Ventures, the firm he co-founded, to launch a new VC firm with ex-a16z partner Morgan Beller, who helped create Meta's Diem crypto project. Their fresh fund targets AI and frontier tech. #AI #VentureCapital #Startups #Tech #VC

pdurdenj's tweet photo. Ashton Kutcher is leaving Sound Ventures, the firm he co-founded, to launch a new VC firm with ex-a16z partner Morgan Beller, who helped create Meta's Diem crypto project. Their fresh fund targets AI and frontier tech. #AI #VentureCapital #Startups #Tech #VC https://t.co/itbLw2ZO2Z

0

1

0

22

Prajjwal · building nanoserve @pdurdenj

about 12 hours ago

Mark Zuckerberg reportedly told Meta staff that AI agents haven't progressed as quickly as he'd hoped. A candid admission from a CEO betting billions on AI, and a notable crack in the agent hype cycle. #AI #AIagents #Meta #Zuckerberg #MachineLearning

pdurdenj's tweet photo. Mark Zuckerberg reportedly told Meta staff that AI agents haven't progressed as quickly as he'd hoped. A candid admission from a CEO betting billions on AI, and a notable crack in the agent hype cycle. #AI #AIagents #Meta #Zuckerberg #MachineLearning https://t.co/0oxAlFkpWA

0

1

0

46

Prajjwal · building nanoserve @pdurdenj

about 13 hours ago

A new humanoid robot from Flexion is being pitched as a white collar office intern, handling routine desk work and errands with unsettling competence, per WIRED. Automation is going physical. #AI #Robotics #Humanoids #Automation #FutureOfWork

pdurdenj's tweet photo. A new humanoid robot from Flexion is being pitched as a white collar office intern, handling routine desk work and errands with unsettling competence, per WIRED. Automation is going physical. #AI #Robotics #Humanoids #Automation #FutureOfWork https://t.co/s5vZ78RjiU

1

2

0

11

Prajjwal · building nanoserve @pdurdenj

about 14 hours ago

Same trick vLLM uses to pack many sequences into one shared pool. Still matches HF on Llama-3.2-1B token for token. Week 5 done. 112 tests green. #AI #LLM #vLLM #BuildInPublic #Claude #OpenAI

0

18

Prajjwal · building nanoserve @pdurdenj

about 14 hours ago

Day 17 of building an LLM inference engine from scratch. Paged KV memory now reaches the path the server actually calls: sampling. And a finished sequence hands its blocks back so the next one reuses them. https://t.co/TbCxgWkUna

pdurdenj's tweet photo. Day 17 of building an LLM inference engine from scratch.

Paged KV memory now reaches the path the server actually calls: sampling. And a finished sequence hands its blocks back so the next one reuses them.

https://t.co/TbCxgWkUna https://t.co/HgzXcFe9Zg

1

0

9

Prajjwal · building nanoserve @pdurdenj

about 14 hours ago

How I proved reuse instead of claiming it: one pool sized for a single sequence, handed to two runs back to back. The second only allocates because the first freed. If free-on-finish regressed, it would raise instead of quietly passing.

1

0

5

Prajjwal · building nanoserve @pdurdenj

about 14 hours ago

Report: Meta had contractors pose as teens to prompt rival chatbots about suicide, sex, and drugs, testing how competitors handle sensitive queries from minors. A messy look at how AI safety benchmarking really happens. #AI #Meta #AISafety #Chatbots #TechNews

pdurdenj's tweet photo. Report: Meta had contractors pose as teens to prompt rival chatbots about suicide, sex, and drugs, testing how competitors handle sensitive queries from minors. A messy look at how AI safety benchmarking really happens. #AI #Meta #AISafety #Chatbots #TechNews https://t.co/phGTsJSZQ1

0

1

0

53

pdurdenj retweeted

TensorTonic

@TensorTonic

1 day ago

If you want to actually understand LLMs (not just use them), read these in order: 1. Attention Is All You Need (transformers) 2. GPT-2 (scaling + zero-shot) 3. Scaling Laws (Kaplan, 2020) 4. GPT-3 (few-shot) 5. Chinchilla (how much data you actually need) 6. InstructGPT (RLHF, why ChatGPT works) 7. LoRA (fine-tuning without going broke) 8. FlashAttention (why it's fast) 9. Chain-of-Thought (reasoning) 10. DPO (RLHF without the pain)

9

889

79

1K

31K

Prajjwal · building nanoserve

@pdurdenj

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users