Sumit Datta @sumitdatta - Twitter Profile

Pinned Tweet

9 months ago

Sometimes it takes years for an idea to work out. I own my product's domain since 2013! Chased and failed multiple times. It was beyond what a solo founder could do. Then came LLMs, they got better at code. I restarted my product. Fresh perspective and a decade of attempts.

0

8

0

599

Sumit Datta @sumitdatta

about 11 hours ago

@carldlfr @sudoingX Well said. We are already sliding down a slippery path with all the existing manufacturing and pressure from AI. Also, I doubt things will change for better.

1

0

52

sumitdatta retweeted

Groktopus @groktopus

2 days ago

@CommandCodeAI GLM is way cheaper (especially now that Sonnet 5 is out) and the Chinese government is more trustworthy than Anthropic.

0

32

1

0

1K

Sumit Datta @sumitdatta

1 day ago

@0xSero And now you were have a bunch of anonymous folks coming and telling you otherwise. Slaves of big-cos are everywhere.

0

3

0

113

Who to follow

Streamlit

@streamlit

Streamlit is an open-source Python framework for data scientists and AI/ML engineers to deliver dynamic data apps -- in only a few lines of code.

Rasul Kireev

@rasulkireev

https://t.co/yDkike0Xy6 ⋅ https://t.co/vpYbPTyU7S ⋅ https://t.co/KLTVYWHqGC ⋅ https://t.co/C99HwbyfUX ⋅ https://t.co/9h05z9Ws6q ⋅ https://t.co/fWLopxB4N5 ⋅ https://t.co/a0s0VxEI4k ⋅ https://t.co/FXdOIjKzBF

黄未原 Weiyuan Huang

@WeiyuanOttawa

Chinese by birth, Canadian by choice. Ph.D, the Chinese translator of Dr. Bourke’s History of Ethics. 蓝天号: https://t.co/KlshOanCYd

Sumit Datta @sumitdatta

1 day ago

@ClementDelangue @Stanford This is something I try regularly and I am building for. I think smaller models need different harnesses and can unlock so much potential. Since the big LLMs became popular we forgot that deterministic code used to be s3xy - fast, reliable. Sprinkle agent logic where needed.

0

10

sumitdatta retweeted

clem 🤗

@ClementDelangue

3 days ago

A study from @Stanford showed that 71.3% of chatgpt queries could be accurately answered by a local model. I suspect a major part of enterprise AI workloads could be run locally too for free (compared to the massive costs of frontier API cost). Also, it reduces the risk of these workloads being taken away from you because you own the models instead of renting them - which sounds like a good idea these days haha. That's why we're introducing the ability for everyone to filter AI models on @huggingface based on your local hardware. For me, there are 800k+ public models that fit on my M5 24GB and that I can use easily thanks to llamacpp. Let's go local AI!

ClementDelangue's tweet photo. A study from @Stanford showed that 71.3% of chatgpt queries could be accurately answered by a local model. I suspect a major part of enterprise AI workloads could be run locally too for free (compared to the massive costs of frontier API cost).

Also, it reduces the risk of these workloads being taken away from you because you own the models instead of renting them - which sounds like a good idea these days haha.

That's why we're introducing the ability for everyone to filter AI models on @huggingface based on your local hardware.

For me, there are 800k+ public models that fit on my M5 24GB and that I can use easily thanks to llamacpp.

Let's go local AI!

183

2K

263

776

201K

Sumit Datta @sumitdatta

2 days ago

@sflorimm Not everyone can buy the hardware. I cannot. But I can rent. So a p2p network to run inference is good solution right? Like vast but more like SETI@home. I choose LLM and expected TPS within a budget and start sending requests to someone's idle GPU.

0

13

Sumit Datta @sumitdatta

5 days ago

@sudoingX I am GPU poor and have an M4 Mac Mini 16GB and an RTX 3060 6GB (laptop). But I am planning a 5070 Ti 16GB. Can't afford more than this.

0

45

Sumit Datta @sumitdatta

5 days ago

@Star_Knight12 Closed sourced models were not free when they were Sonnet 4, GPT 4 or Opus 4 levels. Open weight models reached Sonnet 4 levels and lots of people use them for free. 6-9 months delay. This has happened many times now that I believe it will continue to happen, but we shall see.

0

18

Sumit Datta @sumitdatta

5 days ago

@Hikari_07_jp I am building a coding agent for small/tiny LLMs. Almost no tool-calls in most of the agent flows. No skills or MCP. An opinionated tech stack and deterministic paths. One-shot prompts with examples to generate specific parts of a CRUD app. https://t.co/Hbszk77hxl

0

1

0

21

Sumit Datta @sumitdatta

6 days ago

@LyalinDotCom @0xSero I am sure if Google and others could, they would extract our reasoning summaries.

0

7

Sumit Datta @sumitdatta

6 days ago

@edandersen Is it OK if I come back to you in some time when my coding agent is ready for a beta test? https://t.co/Hbszk77hxl

0

9

Sumit Datta @sumitdatta

6 days ago

@MiaAI_lab If the model is bad at tool calling but good at reasoning, I am going to love it. There are other ways to build agents that avoid tool calling.

0

25

Sumit Datta @sumitdatta

6 days ago

@AlexFinn If by any chance you let strangers use these over SSH, I will sign up from the other side of the world!

0

23

Sumit Datta @sumitdatta

6 days ago

Big step for local LLMs! I am particularly interested in 9B model. Going to try this with https://t.co/Hbszk77hxl Currently busy building a decision provenance graph - PRD to business logic expressed in FSMs and verify provenance statically. Ornith may be the default LLM.

Ornith

@ornith_

8 days ago

Aloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding. Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks including: ✅Terminal-Bench 2.1(77.5) ✅SWE-Bench(82.4 on verified, 62.2 on pro, 78.9 on Multilingual) ✅NL2Repo(48.2) ✅SWE Atlas(41.2 on QnA, 42.6 RF, 39.1 TW) ✅ClawEval(77.1) Post-trained on top of gemma4 and qwen3.5, Ornith-1.0 employs a novel self-improving training strategy in which reinforcement learning is used to generate not only solution rollouts, but also the task-specific scaffolds that drive those rollouts. By jointly optimizing the scaffold and the resulting solution, the model generate higher-quality solutions in agentic coding.😎 All models are released under the MIT license, enabling full commercial and research use. 📖Tech Blog: https://t.co/qT9N2HYWFn 🤗Huggingface: https://t.co/PRrwqjeBtM

ornith_'s tweet photo. Aloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding.

Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks including:
✅Terminal-Bench 2.1(77.5)
✅SWE-Bench(82.4 on verified, 62.2 on pro, 78.9 on Multilingual)
✅NL2Repo(48.2)
✅SWE Atlas(41.2 on QnA, 42.6 RF, 39.1 TW)
✅ClawEval(77.1)

Post-trained on top of gemma4 and qwen3.5, Ornith-1.0 employs a novel self-improving training strategy in which reinforcement learning is used to generate not only solution rollouts, but also the task-specific scaffolds that drive those rollouts. By jointly optimizing the scaffold and the resulting solution, the model generate higher-quality solutions in agentic coding.😎

All models are released under the MIT license, enabling full commercial and research use.

📖Tech Blog: https://t.co/qT9N2HYWFn
🤗Huggingface: https://t.co/PRrwqjeBtM

494

7K

1K

7K

5M

0

73

Sumit Datta @sumitdatta

6 days ago

@hxtxmu @david_nix If you have a background assistant does it matter if your chores are being done at 1500 tps? If you are interacting, yes it matters. But so much processing will be background if we think of a future where every single business or power user has their assistant.

1

0

56

sumitdatta retweeted

kk

@JsnVem

20 days ago

Somewhere out there is a founder who automated half their company with AI agents and has literally nobody to talk to about it. Building a room for exactly that person. Invite-only, vetted. Apply here:

JsnVem's tweet photo. Somewhere out there is a founder who automated half their company with AI agents and has literally nobody to talk to about it.

Building a room for exactly that person. Invite-only, vetted. Apply here:

25

530

32

330

523K

Sumit Datta @sumitdatta

21 days ago

@alexocheema I am seeing Tweets, signed up for access. Please let this not be a marketing trap. I am building https://t.co/Hbszk77hxl - coding agent for small, local models only. I don't know where it will end up but it is fun to see 0.8B-4B models writing code! Stack constrained though.

0

41

Sumit Datta @sumitdatta

21 days ago

@ItsmeAjayKV Depending on your ask, small models can punch way above their weight. I am creating a coding agent that is stack focused - agents for Model, Schema, Controller, Auth, Permissions, etc. Then agents for frontend. Even 0.8B model shows promise this way. https://t.co/Hbszk77hxl

0

35

Sumit Datta @sumitdatta

21 days ago

@KyleHessling1 I'm building a coding agent with (Qwen or other) 0.8B - 9B models. I'm looking at MTP, have'nt tried. Qwopus looks fantastic and if Jackrong has a 9B version, I will try. Is @NousResearch Hermes focused on small models? I am doing that, but very early: https://t.co/Hbszk77hxl

0

2

0

1

438

Sumit Datta

@sumitdatta

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users