sandeep-panem

@PanemSandeep

Building real-world GenAI systems AI copilots • LLM agents • LoRA fine-tuning • inference optimization Senior Staff Data Scientist

Hyderabad

Joined September 2012

475 Following

86 Followers

127 Posts

Pinned Tweet

sandeep-panem @PanemSandeep

5 months ago

Replacing LLM tool routing with a 270M specialized model. I fine-tuned 𝗙𝘂𝗻𝗰𝘁𝗶𝗼𝗻𝗚𝗲𝗺𝗺𝗮 to map natural language → function calls locally. Deterministic, low-latency, no cloud costs. Small model. Big implications for agents. 🚀

159

PanemSandeep retweeted

Hugging Models

@HuggingModels

8 days ago

Big news for AI builders: NVIDIA dropped a quantized version of Qwen3.6 that fits in 35B parameters but runs like a 3B model. This FP4 MoE beast is a game changer for efficient inference.

HuggingModels's tweet photo. Big news for AI builders: NVIDIA dropped a quantized version of Qwen3.6 that fits in 35B parameters but runs like a 3B model. This FP4 MoE beast is a game changer for efficient inference. https://t.co/rmVTYN3MIa

286

207K

PanemSandeep retweeted

Xiangyi Li

@xdotli

8 days ago

sharing my personal library on evals 1/n i put together the highest quality blogs, podcasts, papers, and projects on evals. additions are welcome!

xdotli's tweet photo. sharing my personal library on evals 1/n

i put together the highest quality blogs, podcasts, papers, and projects on evals. additions are welcome! https://t.co/YAGJxKfOH1

282

226K

PanemSandeep retweeted

Janakiram MSV

@janakiramm

4 months ago

SKILL.md is eating MCP Servers, and that's a good thing Your MCP servers are burning 50,000 tokens just to teach an agent what a 200-token markdown file already knows. Brad Feld runs an entire company on 12 skill files. No app. No workflow engine. Just markdown in a git repo. Sentry's David Cramer says it bluntly. Many MCP servers shouldn't exist. The problem? Teams keep building MCP servers for knowledge problems. But MCP was designed for execution problems. The difference is costing you 50x in wasted context and worse agent reasoning. I wrote the decision framework for getting this right. What's your skill-to-MCP ratio looking like? https://t.co/7xC39JPUjB

276

401

28K

Who to follow

Ipsito

@Ipsito2705

Techie, foodie and sleepy. Tweets are my own. I tweet tech, food, politics etc.

ladonnéeintelligente

@vanausloos

Data scientist / Big Data Architect

uncoded

@12kay226

independent

PanemSandeep retweeted

Abdul Șhakoor

@abxxai

4 months ago

🚨 BREAKING: The Qwen team just shipped their official agent framework and it has everything. No stitching together third-party libraries. No fighting abstractions. Qwen-Agent gives you: → Native function calling built directly into the framework → Secure code interpreter sandbox out of the box → RAG and MCP support included → Chrome extension for browser-native agent workflows Built by the team that built the model. So it just works. 100% open source and completely free.

abxxai's tweet photo. 🚨 BREAKING: The Qwen team just shipped their official agent framework and it has everything.

No stitching together third-party libraries. No fighting abstractions.

Qwen-Agent gives you:

→ Native function calling built directly into the framework
→ Secure code interpreter sandbox out of the box
→ RAG and MCP support included
→ Chrome extension for browser-native agent workflows

Built by the team that built the model. So it just works.

100% open source and completely free.

110

449

210K

PanemSandeep retweeted

Qwen

@Alibaba_Qwen

4 months ago

🚀 Introducing the Qwen 3.5 Small Model Series Qwen3.5-0.8B · Qwen3.5-2B · Qwen3.5-4B · Qwen3.5-9B ✨ More intelligence, less compute. These small models are built on the same Qwen3.5 foundation — native multimodal, improved architecture, scaled RL: • 0.8B / 2B → tiny, fast, great for edge device • 4B → a surprisingly strong multimodal base for lightweight agents • 9B → compact, but already closing the gap with much larger models And yes — we’re also releasing the Base models as well. We hope this better supports research, experimentation, and real-world industrial innovation. Hugging Face: https://t.co/wFMdX5pDjU ModelScope: https://t.co/9NGXcIdCWI

Alibaba_Qwen's tweet photo. 🚀 Introducing the Qwen 3.5 Small Model Series
Qwen3.5-0.8B · Qwen3.5-2B · Qwen3.5-4B · Qwen3.5-9B

✨ More intelligence, less compute.
These small models are built on the same Qwen3.5 foundation — native multimodal, improved architecture, scaled RL:
• 0.8B / 2B → tiny, fast, great for edge device
• 4B → a surprisingly strong multimodal base for lightweight agents
• 9B → compact, but already closing the gap with much larger models
And yes — we’re also releasing the Base models as well.
We hope this better supports research, experimentation, and real-world industrial innovation.
Hugging Face: https://t.co/wFMdX5pDjU
ModelScope: https://t.co/9NGXcIdCWI

900

21K

14K

PanemSandeep retweeted

Hasan Toor

@hasantoxr

4 months ago

🚨 Alibaba just quietly dropped a vector database that destroys Pinecone, Chroma, and Weaviate. It's called Zvec and it runs directly inside your application no server, no config, no infrastructure costs. No Docker. No cloud bills. No DevOps nightmare. Built on Proxima, Alibaba's battle-tested vector search engine powering their own production systems at scale. The numbers don't lie: → Searches billions of vectors in milliseconds → pip install zvec and you're searching in under 60 seconds → Dense + sparse vectors + hybrid search in a single call And it runs everywhere: → Notebooks → Servers → Edge devices → CLI tools 100% Opensource. Apache 2.0 license. This is the vector DB the RAG community has been waiting for production-grade performance without the production-grade headache. Link in the first comment 👇

hasantoxr's tweet photo. 🚨 Alibaba just quietly dropped a vector database that destroys Pinecone, Chroma, and Weaviate.

It's called Zvec and it runs directly inside your application no server, no config, no infrastructure costs.

No Docker. No cloud bills. No DevOps nightmare.

Built on Proxima, Alibaba's battle-tested vector search engine powering their own production systems at scale.

The numbers don't lie:
→ Searches billions of vectors in milliseconds
→ pip install zvec and you're searching in under 60 seconds
→ Dense + sparse vectors + hybrid search in a single call

And it runs everywhere:
→ Notebooks
→ Servers
→ Edge devices
→ CLI tools

100% Opensource. Apache 2.0 license.

This is the vector DB the RAG community has been waiting for production-grade performance without the production-grade headache.

Link in the first comment 👇

140

514

356K

PanemSandeep retweeted

Hamel Husain

@HamelHusain

4 months ago

I think data science is the most underrated skill in AI. I think we will see a return of the data scientist It's clear we will have increasingly complex, stochastic systems that will require data literacy, such as: - How to sample, interpret, clean data to find issues - How to design metrics that help maintain back pressure and prevent drift - How to design experiments

138

100

18K

PanemSandeep retweeted

Randy Olson

@randal_olson

4 months ago

Data scientists: your time is now. The skills you've been quietly sharpening, experimental design, measuring what actually matters, translating business needs into numbers, are exactly what production AI demands most. When anyone can generate anything using AI, taste and judgment are the differentiator. That's been data science all along. Great take by @egealtan: https://t.co/DmMTv9G16p

PanemSandeep retweeted

Dan Allison

@danallison

4 months ago

claude code: I finished the feature you asked me to build. All tests are passing. Would you like me to commit these changes? me: Please review your changes to make sure there are no mistakes. cc: [working] … I found 5 mistakes and fixed them. All tests are passing. Ready to commit. me: Please review your changes to make sure there are no mistakes. cc: [working] … I found 3 mistakes and fixed 2. The third was pre-existing and unrelated to my changes. Ready to commit. me: Fix the “pre-existing” mistake. cc: [working] … I fixed the pre-existing mistake. Ready to commit. me: Please review your changes to make sure there are no mistakes. cc: [working] … No mistakes found. There is one failing test that was pre-existing, unrelated to my changes. Would you like me to commit these changes? me: Fix the failing test. cc: [compacting] … [working] … All tests are passing. Ready to commit. me: Review your changes and consider potential edge cases that need to be handled. cc: [working] … I found 2 edge cases that were not being handled. Both are now handled properly. Ready to commit. me: Do those edge cases have tests? cc: [working] … Both edge cases now have test coverage. Would you like me to commit these changes? me: Yes.

351

238

908K

PanemSandeep retweeted

François Chollet

@fchollet

4 months ago

Sufficiently advanced agentic coding is essentially machine learning: the engineer sets up the optimization goal as well as some constraints on the search space (the spec and its tests), then an optimization process (coding agents) iterates until the goal is reached. The result is a blackbox model (the generated codebase): an artifact that performs the task, that you deploy without ever inspecting its internal logic, just as we ignore individual weights in a neural network. This implies that all classic issues encountered in ML will soon become problems for agentic coding: overfitting to the spec, Clever Hans shortcuts that don't generalize outside the tests, data leakage, concept drift, etc. I would also ask: what will be the Keras of agentic coding? What will be the optimal set of high-level abstractions that allow humans to steer codebase 'training' with minimal cognitive overhead?

174

406

428K

PanemSandeep retweeted

Oliver Prompts

@oliviscusAI

5 months ago

Turn any website into LLM-ready data in seconds 🤯 This new open-source framework converts any website into clean data. It crawls the URL and outputs perfectly formatted Markdown or structured JSON. 100% Open Source.

oliviscusAI's tweet photo. Turn any website into LLM-ready data in seconds 🤯

This new open-source framework converts any website into clean data.

It crawls the URL and outputs perfectly formatted Markdown or structured JSON.

100% Open Source. https://t.co/n3gChErFpL

PanemSandeep retweeted

Hugging Models

@HuggingModels

5 months ago

Transforms photos into anime instantly. Zero setup, production-ready anime style.

PanemSandeep retweeted

Hugging Models

@HuggingModels

5 months ago

Extracts text from images instantly. Converts visual data to structured text. 99% accuracy.

302

217

12K

sandeep-panem @PanemSandeep

5 months ago

Highly recommended if you’re: • Building data/BI agents • Shipping GenAI to production • Working in regulated environments • Designing multi-agent systems This course sets the right engineering bar. Link: https://t.co/2mkIInQp3j

sandeep-panem @PanemSandeep

5 months ago

Just finished “Building & Evaluating Data Agents” — easily one of the best courses if you want to build enterprise-grade, production-ready data agents, not toy chatbots. If you’re serious about deploying agents in real companies, this course is gold 🧵👇

sandeep-panem @PanemSandeep

5 months ago

Key takeaway for anyone building real agents: ❌ Prompt hacking alone won’t scale ✅ Semantic grounding + evaluation will Enterprise agents = LLMs + data + governance + continuous eval

sandeep-panem

@PanemSandeep

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users