If you're using Claude Code for research: stop making it read directly from PDFs
We've introduced a SKILL.md that fetches structured, AI-friendly paper overviews from alphaXiv ๐
๐ถ๏ธ Some (perhaps) spicy thoughts. Itโs been a while since my last tweet, but I wanted to write about how disorienting it has been from academia to an LLM lab ๐
The kind of research I was trained to do during my PhD almost doesnโt exist here. The obsession with mathematical elegance and novelty is mostly gone. Everything is about scaling data and compute. For a while, that really got to me. At my lowest point, I felt like Iโd lost interest in building LLMs altogether. I didnโt feel intellectually challenged anymore.
What made this even stranger was that, at a technical level, things worked. If there was a capability I wanted to teach a model, scaling the right data and compute always got me there, no exception (so far).
But recently, I found a way to reconcile with myself..
I realized the real competition isnโt in the ML recipe anymore. Most teams do roughly the same thing. What actually matters is how fast you can iterate, test ideas, and recover from mistakes. And that speed is mostly backed by infrastructure ๐๏ธ Faster loops, fewer bugs, better tooling.
Seeing this made me excited again! Infra is its own deep, hard, and intellectually fun problem space.
In 2026, I want to become an ML researcher whoโs really good at infra. And I'll come back to ML problems with that edge, and will be excited to share what I find ๐
It will soon be PhD application season.
If you are applying for PhD programs in engineering, apply to groups that are building something cool and interesting, and are not just optimizing for the number of papers they publish every year.
CSHL is launching a new PhD program focusing on AI ร Bio/Neuro! If you already have a masterโs or equivalent and want a faster, more streamlined route to a PhD, check it out! #PhD
https://t.co/VorfVfzXrq
๐๐ณ๐๐ฒ๐ฟ ๐ญ๐ฌ+ ๐๐ฒ๐ฎ๐ฟ๐ ๐ถ๐ป ๐ฟ๐ผ๐ฏ๐ผ๐ ๐น๐ฒ๐ฎ๐ฟ๐ป๐ถ๐ป๐ด, from my PhD at Imperial to Berkeley to building the Dyson Robot Learning Lab, one frustration kept hitting me:
๐ช๐ต๐ ๐ฑ๐ผ ๐ ๐ต๐ฎ๐๐ฒ ๐๐ผ ๐ฟ๐ฒ๐ฏ๐๐ถ๐น๐ฑ ๐๐ต๐ฒ ๐๐ฎ๐บ๐ฒ ๐ถ๐ป๐ณ๐ฟ๐ฎ๐๐๐ฟ๐๐ฐ๐๐๐ฟ๐ฒ ๐ผ๐๐ฒ๐ฟ ๐ฎ๐ป๐ฑ ๐ผ๐๐ฒ๐ฟ ๐ฎ๐ด๐ฎ๐ถ๐ป?
๐ง๐ต๐ฒ ๐ฝ๐ฎ๐๐๐ฒ๐ฟ๐ป ๐ ๐ธ๐ฒ๐ฝ๐ ๐๐ฒ๐ฒ๐ถ๐ป๐ด:
โข New robotics team starts
โข Spends 6 months building data collection pipeline
โข Spends another 3 months debugging synchronization issues
โข Finally starts collecting task-specific data
โข Realizes their infrastructure choices limit their flexibility
โข Starts over
๐ง๐ต๐ถ๐ ๐ถ๐ ๐๐ต๐ฒ ๐๐ต๐ผ๐น๐ฒ ๐ฝ๐ผ๐ถ๐ป๐ ๐ผ๐ณ ๐ฟ๐ผ๐ฏ๐ผ๐ ๐น๐ฒ๐ฎ๐ฟ๐ป๐ถ๐ป๐ด: Robot learning is fundamentally data-driven. Whether you're picking strawberries or assembling electronics, the core infrastructure needs are identical. That's actually why I was so interested in pursuing data-driven robotics over a decade ago.
๐ฌ๐ผ๐ ๐ฎ๐น๐๐ฎ๐๐ ๐ป๐ฒ๐ฒ๐ฑ:
โข Multi-sensor data synchronization across different frequencies
โข Flexible storage that works with future algorithms
โข Visualization tools to understand your data
โข The ability to experiment with different temporal resolutions
โข Robust logging that captures everything you might need later
The trend towards AI in robotics is growing, with robots needing to process and analyze large amounts of sensor data to manage variability and unpredictability in real environments.
๐๐๐ ๐ฒ๐๐ฒ๐ฟ๐ ๐๐ฒ๐ฎ๐บ ๐ฏ๐๐ถ๐น๐ฑ๐ ๐๐ต๐ถ๐ ๐ณ๐ฟ๐ผ๐บ ๐๐ฐ๐ฟ๐ฎ๐๐ฐ๐ต. Imagine if every web developer had to build their own database, web server, and deployment pipeline before writing their first line of application code.
๐ง๐ต๐ถ๐ ๐ถ๐ ๐๐ต๐ ๐ ๐ณ๐ผ๐๐ป๐ฑ๐ฒ๐ฑ ๐ก๐ฒ๐๐ฟ๐ฎ๐ฐ๐ผ๐ฟ๐ฒ.
Instead of every robotics team spending months on infrastructure, we provide the common tools that let you go from "I have a robot" to "I'm shipping intelligent robot behaviors" in days, not months.
๐ง๐ต๐ฒ ๐ฟ๐ฒ๐ฎ๐น ๐ถ๐ป๐ป๐ผ๐๐ฎ๐๐ถ๐ผ๐ป ๐ถ๐ป ๐ฟ๐ผ๐ฏ๐ผ๐๐ถ๐ฐ๐ ๐๐ผ๐ป'๐ ๐ฐ๐ผ๐บ๐ฒ ๐ณ๐ฟ๐ผ๐บ ๐ฒ๐๐ฒ๐ฟ๐๐ผ๐ป๐ฒ ๐ฟ๐ฒ๐ฏ๐๐ถ๐น๐ฑ๐ถ๐ป๐ด ๐๐ต๐ฒ ๐๐ฎ๐บ๐ฒ ๐ฝ๐น๐๐บ๐ฏ๐ถ๐ป๐ด. ๐๐'๐น๐น ๐ฐ๐ผ๐บ๐ฒ ๐ณ๐ฟ๐ผ๐บ ๐๐ฒ๐ฎ๐บ๐ ๐๐ต๐ผ ๐ฐ๐ฎ๐ป ๐ณ๐ผ๐ฐ๐๐ ๐ฒ๐ป๐๐ถ๐ฟ๐ฒ๐น๐ ๐ผ๐ป ๐๐ต๐ฎ๐ ๐บ๐ฎ๐ธ๐ฒ๐ ๐๐ต๐ฒ๐ถ๐ฟ ๐ฎ๐ฝ๐ฝ๐น๐ถ๐ฐ๐ฎ๐๐ถ๐ผ๐ป ๐๐ป๐ถ๐พ๐๐ฒ.
Robot learning shouldn't be bottlenecked by infrastructure. It should be bottlenecked by creativity.
What's the longest you've spent building infrastructure before getting to the actual robotics problem you wanted to solve?
๐จ Googleโs MedGemma & OpenAIโs GPT-4o are impressive, but their openness is limitedโeither fully closed-source or releasing only weights without data/training code.
๐ฅ Meet MedVLThinker โ a fully open multimodal medical reasoning recipe that matches their performance.
Simple. Transparent. Reproducible.
๐ Project: https://t.co/DUarGuG7et
๐ Paper: https://t.co/wRcZQoxZPb
"My research group, the Interfaces-Oxides Nexus Electrochemical Laboratory (IONE Lab), at Arizona State University, has one more opening for a graduate student. If you are interested in electrochemical interfaces and epitaxial metal oxides, contact me via email ([email protected]). "-Prof. Sanaz Koohfar
Are you planning to apply for faculty positions in North America this Fall? Our team of researchers have synthesized 3 years of applicant survey data to create a dashboard to allow assistant professor candidates to see where they stand.
https://t.co/zgKXcD0mD1
Prospective MSc and PhD students can find in the link below a list of professors from various departments who recently received research grants, along with their grant titles.
https://t.co/wWFIULZVpH
LLMs for Engineering
Finds that when RL is used, a 7B parameter model outperforms both SoTA foundation models and human experts at high-powered rocketry design.
What a crazy week in AI ๐คฏ
- Kling 2.0 AI video
- Canva Visual Suite 2.0
- Microsoft Copilot Vision
- Grok Studio and Memories
- ChatGPT 4.1, o3, & o4-mini
- OpenAIโs new coding agent
- ByteDance Seaweed AI video
- Claude Autonomous Research
Hereโs EVERYTHING you need to know:
I asked o3, โWhat are the top 10 most important questions or challenges that need to be solved or answered in T cell immunobiology?โ This is what I have been studying for the past 35 years.
o3 came up with not one, not two, not three, but 10/10 top-notch questions that I could not improve upon!
Not only that, it explained why each of these questions matters, outlined the roadblocks and leverage points, and offered insights that were simply amazing! At the end it even unified and linked all these ๐คฏ (shared in the thread).
This is a topic I understand extremely deeply, yet o3 effortlessly identified the best 10 questions in my field-better rate than I could have myself!
Here are the first 5 grand questions; the rest are in the thread. Each one them is so densely packed with knowledge that unpacking them would require PhD-level courses! This is why I called o3 genius-level, itโs just too damn smart!
Top 10 open problems that will decide the next decade of Tโcell immunobiology:
1.Crack the โTCR codeโ at scale
Grand question: Can we reliably predict which peptideโMHC (or CD1/MR1) any given Tโcell receptor will recogniseโand design new receptors on demand?
Why it matters: Unlocking this would transform cancer neoantigen targeting, infectiousโdisease vaccines, autoimmunity screening and truly personalised immunology.
Roadblocks: Astronomical sequence space, crossโreactivity, sparse groundโtruth data, and contextโdependent recognition rules.
Leverage points: Deepโlearning models trained on multiโomic singleโcell datasets, highโthroughput yeast/RNA display pMHC libraries, and federated dataโsharing across clinics.
2.Rewire exhaustion and senescence instead of merely blocking checkpoints
Grand question: How do we durably restore proliferative capacity and killing potential in chronically stimulated or aged T cells without triggering cytokine storms?
Why it matters: Exhaustion limits CARโT efficacy, chronic infection control, and antiโtumour immunity; senescence fuels โinflammโagingโ.
Roadblocks: Layered transcriptional/epigenetic locks, mitochondrial dysfunction, and suppressive metabolites in the TME.
Leverage points: Metabolic rewiring (FAOโโโglycolysis toggling), chromatin editing, combinatorial cytokine cocktails, and synthetic โresetโ circuits.
3.Master tissueโresident memory (TโฏRM) formation and control
Grand question: What signals decide when effector T cells park longโterm in skin, gut, lung or brain, and how can we amplify or silence them at will?
Why it matters: TโฏRM cells underpin frontline immunity against respiratory and mucosal pathogens, drive relapse in psoriasis/vitiligo, and protectโor aggravateโtumours.
Leverage points: Decoding local cytokine + metabolic niches, microbiome crosstalk, and TRMโstromal cell interactions; epigenetic โTRM locksโ as drug targets.
4.Make engineered T cells work in solid tumours
Grand question: How do we ensure CAR/TCRโT cells infiltrate, persist and kill in hypoxic, cytokineโhostile, antigenโheterogeneous solid tumours?
Roadblocks: Physical exclusion, suppressive myeloid nets, onโtarget offโtumour toxicity, and antigen escape.
Leverage points: Logicโgated multiโantigen CARs, chemokineโarmoured cells, regional (e.g., intratumoural) delivery, and onโdemand suicide switches.
https://t.co/Hb9fZHaxcb truly universal, offโtheโshelf T cells
Grand question: Can we generate banked iPSCโderived or genomeโedited T cells that evade host rejection and GVHD yet retain vigorous function?
Why it matters: Autologous products are expensive, slow and inconsistent; scalable allogeneic cells would democratise cellular therapy and allow rapid pandemic response.
Roadblocks: HLA matching, alloreactivity, residual pluripotent cells, manufacturing QC, and regulatory hurdles.
Leverage points: MHCโknockout plus HLAโE/G overโexpression, multiplex baseโediting, orthogonal cytokine receptors, closedโsystem robotic production.
Weโre opening the doors to DAIR Academy for students.
Students can now get full access to our Academy for free for a month!
Reach out with a .edu email (or equivalent), and start learning how to use AI, build AI Agents, use LLMs for learning and work, and access our community with weekly events hosted by industry professionals building with AI.
The offer runs until the end of April, so reach out soon!
Email us at: [email protected] for access!
Libraries and tools that every deep learning project should use: loguru, tqdm, torchmetrics, einops, python 3.11, black. Optional: prettytable. Good for debugging: lovely_tensors. Any other ones I've missed?
Below a few words on each of them:
๐ Excited to launch my new course, Introduction to AI Agents!
Every AI dev and company Iโve worked with this year is keen to build with AI Agents.
Iโve worked hard to put all my learnings into this course so you too can learn how to build agentic AI systems.
Material ranges from fundamentals to practical tips to help you build advanced agentic workflows.
Enroll now: https://t.co/52C6RzuIKc
This course is for you if you are looking to apply AI agents in a professional setting. No programming is required for this course!
Use code AGENTS20 to get an extra 20% discount now. (Prices will increase soon so make sure to take advantage of the current offer)