Vedhas Deshpande

Verified account

@VedhasD

Engineer. I write what I find interesting. Keep some sense of humor. Not an investment advice, do your own due diligence. blog:

Joined January 2011

1K Following

749 Followers

5.8K Posts

Pinned Tweet

Vedhas Deshpande

about 1 month ago

I have been reading about how people are buying Mac Minis specifically for setting up LLMs and OpenClaw locally. That’s what prompted me to check if my own modest rig was sufficient to run LLMs. To my surprise, it is—specifically for Small Language Models (SLMs). On my modest rig: AMD Ryzen 5 (AM4) + MSI B550 - PCIE 4.0 16 GT/s, RAM - 32GB NVIDIA Titan Xp – 12GB VRAM (~2017 card 😐) I am able to run gemma4:E4B and with a context of ~32k can successfully develop simple scripts using opencode using ollama. <link below>

VedhasD's tweet photo. I have been reading about how people are buying Mac Minis specifically for setting up LLMs and OpenClaw locally. That’s what prompted me to check if my own modest rig was sufficient to run LLMs. To my surprise, it is—specifically for Small Language Models (SLMs).

On my modest rig:
AMD Ryzen 5 (AM4) + MSI B550 - PCIE 4.0 16 GT/s, RAM - 32GB

NVIDIA Titan Xp – 12GB VRAM (~2017 card 😐)

I am able to run gemma4:E4B and with a context of ~32k can successfully develop simple scripts using opencode using ollama.

<link below>

2

1

0

0

381

Vedhas Deshpande

about 8 hours ago

Perfect size for mid level VRAM GPUs

about 8 hours ago

Meet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇

googlegemma's tweet photo. Meet Gemma 4 12B!

A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.

Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇 https://t.co/gf4FZv0WZb

248

8K

1K

3K

1M

0

0

0

0

29

Vedhas Deshpande

2 days ago

New Money: Anthropic, OpenAI and SpaceX IPOs New Value Stocks: $GOOGL $AMZN $MSFT $AAPL

0

0

0

0

230

Vedhas Deshpande

3 days ago

@xadhish @Google Congratulations 🎉!

0

0

0

0

90

Who to follow

your local self-inflicted housing crisis ouroboros

Verified account

@itsahousingtrap

Solving your nearby housing crisis through participatory planning and angular planing.

This meal is BEEF-free

Verified account

||software engineering|| investing|| fitness|| movies||

Vedhas Deshpande

3 days ago

I have been playing around with llama.cpp on windows and wsl2 ubuntu. I was using the same model file downloaded in windows on wsl2 via /mnt/c. On windows llama would quickly load model files into the VRAM but not on wsl2. TIL wsl2 uses a network protocol 9P to bridge the file system between NTFS (win) and ext4(linux). Hence the system calls are slow when made in wsl2 on windows drive.

0

0

0

0

42

Vedhas Deshpande

3 days ago

@pitdesi @jaredhecht Same vibe:

VedhasD's tweet photo. @pitdesi @jaredhecht Same vibe: https://t.co/2qFJrJfryh

1

1

0

0

29

Vedhas Deshpande

3 days ago

It’s actually bought an RTX 3090 instead of Porsche.

Itamar Golan 🤓

3 days ago

Men in their 40s used to have cool midlife crises… now they just have agentic workflows. Bought a Claude subscription instead of a Porsche lmao.

65

2K

109

155

163K

0

1

0

0

61

Vedhas Deshpande

3 days ago

This is good advice. Even if you work at a higher level of abstraction, it is beneficial to know the space and time layout of the system.

4 days ago

The best thing you can do as an engineer is learn C and C++ deeply. once you see memory, CPUs, caches, threads, and I/O without layers of abstraction hiding them, you start appreciating how much of modern computing is actually systems programming. even a bit of assembly, hehe :)

20

812

69

338

28K

1

3

0

2

846

Vedhas Deshpande

4 days ago

I can't answer your original question. Regarding market share, that might be true until Q1 2026, I believe. Post-Q1, token spend is being rationed, and frontier model availability has also suffered. In comparison, I have been defaulting to Cursor 2 and 2.5, which are based on Kimi 2 and 2.6. There's real value in open-source models, but I don't know how sustainable it is. I am seeing a lot of on-prem capacity building with Gemma 4 and Qwen 3.6. I think it's going to be hybrid in the future, with corporates building on-prem OSS models and reserving frontier-level models for high-end, difficult, and non-sensitive tasks.

0

1

0

0

42

Vedhas Deshpande

4 days ago

What’s a lightweight PDF alternative for Adobe on Mac? I don’t like the Adobe overlays in the Chrome extension. It’s too bloated, and the app is sticky—I always have to force quit it when installing a macOS update.

0

0

0

0

42

Vedhas Deshpande

4 days ago

@typesfast My version: When my selected options translate correctly across codeshare international flights.

0

1

0

0

110

Vedhas Deshpande

4 days ago

@NoahKingJr It depends on which harness you say hello 😂

0

0

0

0

62

Vedhas Deshpande

4 days ago

Trimble $TRMB linked SketchUp with Anthropic's Claude AI to bring conversational, AI-powered capabilities directly into 3D modeling workflows. https://t.co/Tm8wlWJw7m

0

1

0

0

93

Vedhas Deshpande

4 days ago

I remember playing with Google SketchUp software. Today I checked if it was in the Google Graveyard list, and it turns out it is not. It was sold to Trimble Inc. in 2012. It used to be my favorite indoor pastime during monsoon rains.

VedhasD's tweet photo. I remember playing with Google SketchUp software.

Today I checked if it was in the Google Graveyard list, and it turns out it is not.

It was sold to Trimble Inc. in 2012.

It used to be my favorite indoor pastime during monsoon rains. https://t.co/Ua5uHZ4tAK

5 days ago

It’s never been easier to design your dream house. Draw a shape. Define your rooms. Set your constraints. @DraftedAI generates complete floor plans, elevations, and 3D home designs in seconds. Over the last month, 120,000 people generated 325,000+ home designs with https://t.co/XqC0LP5n3y.

187

4K

341

5K

734K

1

1

0

1

177

Vedhas Deshpande

5 days ago

Is there a way I can merge 2-3 separate chats into one in gemini app?

0

0

0

0

27

Vedhas Deshpande

5 days ago

@david_nix There’s technically 2x PCIe x1 expansion slot but not for GPUs. It’s something like wifi/bt nic upgrades.

0

1

0

0

66

Vedhas Deshpande

5 days ago

Based. I have been postponing building llama.cpp and been using ollama. Now I can try this quickly!

Georgi Gerganov

5 days ago

llama.cpp now has an official website: https://t.co/vztdUpdBWL Our goal is to make local AI accessible to everyone, and improving the user experience is a big part of that. On the new landing page you’ll find a single-line cross-platform installer. The installation provides a single unified `llama` entrypoint which you can use to run/serve models and interface with 3rd-party agentic applications. While oriented towards simplified user experience, the new `llama` application also provides all the advanced functionality of the existing llama.cpp tooling with which experienced users are already familiar. Also note that all GGUF models that you might have already downloaded with llama.cpp in the past will be automatically available to use without downloading again (they are stored in the common HF cache on your machine). We have many improvements in the pipeline both at the UX and at the engine level and we plan to iteratively ship new things over the coming months. One of the main focuses will be seamless integration with local-friendly 3rd-party agents (such as Pi). In the meantime, we’ll continue to listen for feedback from the community and adjust accordingly, so keep letting us know what you think and need.

95

3K

485

1K

160K

0

0

0

0

45

Vedhas Deshpande

6 days ago

@david_nix Me too, every man's pre-midlife crisis :D.

1

1

0

0

59

Vedhas Deshpande

6 days ago

@paulg Few people have this luxury.

0

0

0

0

36

Vedhas Deshpande

6 days ago

everything is cyclic. it's good that hardware is getting the spot light but when the supply, demand and capacity stabilizes the spotlight moves to next money making layer (SaaS in the last decade). Hopefully the spotlight remains on hardware, infra, energy, quantum, and materials for long. :)

0

1

0

0

48

VedhasD retweeted

7 days ago

magicsilicon's tweet photo. https://t.co/UCI4b7dau5

1

99

11

3

8K

Last Seen Users on Sotwe

Trends for you

Most Popular Users