ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1 - Twitter Profile

Pinned Tweet

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

about 1 month ago

Got featured in @ZeeNews Trusted leaders to look out for in 2026 https://t.co/YmbmuQ03tZ

0

1

0

29

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

about 1 month ago

Great to see @ParallelDots featured among Brands to Watch in 2026 by @htTweets. Building ShelfWatch from India to serve Fortune 500 consumer brands in 50+ countries has been one heck of a ride. The best is ahead. 🙏 https://t.co/Vmmf66j9Qc

0

19

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

about 1 month ago

@MohapatraHemant 100% agree, we’ve been selling AI-powered SKU recognition for 6+ years. Usage-based pricing isn’t a choice, it’s a constraint. Any other model either hides cost unpredictability or forces you to degrade UX with limits, batching, or throttling.

0

186

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

about 1 month ago

AI's two-country race just became three. https://t.co/7oIRCXe4Jr

0

22

Who to follow

Palkush Chawla

@palkush

Building @GoMarble_ai || AI Agent for paid media marketers; find your competitors’ ads, hooks, patterns in a single click 👇

Mark Lojeski

@FHLFORKIDS1

Christian Cofounder and President of #fhlforkids Wisconsin sports fan. foster child advocate. pro life. No DM’S please. No bots please

Matthew Mayo

@mattmayo13

Data Scientist | AI Engineer | Digital Strategist Managing Editor for Guiding Tech Media's family of professional data-oriented websites

ankitnarayan1 retweeted

Forbes Technology Council @ForbesTechCncl

about 1 month ago

How To Stop Feature Creep And Prioritize Product Value https://t.co/a2zSGaMeDY from @kedkorte @ankitnarayan1 @zentrumhub @bhivanov @uttam_alld @iamsalimg and more

0

1

0

74

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

about 2 months ago

@Prathkum Google lost pretty soon, seriously they have the best products but poorest integrations!

0

161

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

about 2 months ago

This is exactly the kind of work that makes AI practical for enterprises. When inference gets 5x cheaper at the edge with zero accuracy loss, every enterprise gets unlimited budget to experiment!

Tom Turney

@no_stp_on_snek

2 months ago

I implemented Google's TurboQuant paper (ICLR 2026) in llama.cpp with Metal kernels for Apple Silicon. 4.9× KV cache compression. Working end-to-end on M5 Max with Qwen 3.5 35B MoE and Qwopus v2 27B. Speed needs work (unoptimized shader), compression target met. Repo: https://t.co/7aUaWo7Mm1 **Note**: as you'll see from the git when I saw "I" it's in conjunction with claudecode and codex. Just lots of steering and babysitting.

no_stp_on_snek's tweet photo. I implemented Google's TurboQuant paper (ICLR 2026) in llama.cpp with Metal kernels for Apple Silicon.

4.9× KV cache compression. Working end-to-end on M5 Max with Qwen 3.5 35B MoE and Qwopus v2 27B.

Speed needs work (unoptimized shader), compression target met.

Repo: https://t.co/7aUaWo7Mm1

**Note**: as you'll see from the git when I saw "I" it's in conjunction with claudecode and codex. Just lots of steering and babysitting.

26

388

46

254

120K

0

53

ankitnarayan1 retweeted

Tom Turney

@no_stp_on_snek

2 months ago

I implemented Google's TurboQuant paper (ICLR 2026) in llama.cpp with Metal kernels for Apple Silicon. 4.9× KV cache compression. Working end-to-end on M5 Max with Qwen 3.5 35B MoE and Qwopus v2 27B. Speed needs work (unoptimized shader), compression target met. Repo: https://t.co/7aUaWo7Mm1 **Note**: as you'll see from the git when I saw "I" it's in conjunction with claudecode and codex. Just lots of steering and babysitting.

26

388

46

254

120K

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

about 2 months ago

Sparse MoE is quietly becoming the architecture that makes enterprise AI agents economically viable. At ParallelDots we're seeing this firsthand where the inference cost curve is what unlocks real-world retail AI at scale, not just benchmark scores!

Zhihu Frontier

@ZhihuFrontier

2 months ago

🧵As 2026 unfolds, sparse MoE models are emerging as the new backbone for high-throughput inference and Agent workloads. Step 3.5 Flash from StepFun @StepFun_ai stands out with its efficient attention mixture design. Here’s a technical deep dive from Zhihu contributor kaiyuan 👇 🤖 Model Overview • Open-source MoE LLM built for high-throughput inference & Agent scenarios • Matches or exceeds leading models in reasoning, coding, and Agent benchmarks • Delivers speed-quality balance via sparse MoE routing + Multi-Token Prediction (MTP) 📐 Core Architecture • Backbone: Transformer + MoE | Total params: ~196B | Active params: 11B/token • Attention mix: GQA + Sliding Window Attention (SWA) + Full Attention • Routing: 288 experts | Top-8 activation per token • Context window: 256K (262144 max sequence length) ⚙️ Key Configs (config.json) • Layers: 45 | Hidden dim: 4096 | Attention heads: 64 • Sliding window: 512 | Max sequence length: 256K ✨ Standout Features • Sparse MoE Routing: 288 experts with Top-8 selection → cuts compute without losing capacity • Dual Attention Mechanism: SWA for efficient local modeling (512-window) + Full Attention for global context • MTP Acceleration: Faster decoding for real-world interactive throughput ✅ Final Takeaway Step 3.5 Flash proves that sparse MoE architectures can deliver enterprise-grade performance for long-context, high-throughput applications without proportional resource growth. #AI #Engineering #Tech #LLM #Agent #StepFun 🔗 Full article(CN):https://t.co/WobioAOycv

ZhihuFrontier's tweet photo. 🧵As 2026 unfolds, sparse MoE models are emerging as the new backbone for high-throughput inference and Agent workloads. Step 3.5 Flash from StepFun @StepFun_ai stands out with its efficient attention mixture design.

Here’s a technical deep dive from Zhihu contributor kaiyuan 👇

🤖 Model Overview
• Open-source MoE LLM built for high-throughput inference & Agent scenarios
• Matches or exceeds leading models in reasoning, coding, and Agent benchmarks
• Delivers speed-quality balance via sparse MoE routing + Multi-Token Prediction (MTP)

📐 Core Architecture
• Backbone: Transformer + MoE | Total params: ~196B | Active params: 11B/token
• Attention mix: GQA + Sliding Window Attention (SWA) + Full Attention
• Routing: 288 experts | Top-8 activation per token
• Context window: 256K (262144 max sequence length)

⚙️ Key Configs (config.json)
• Layers: 45 | Hidden dim: 4096 | Attention heads: 64
• Sliding window: 512 | Max sequence length: 256K

✨ Standout Features
• Sparse MoE Routing: 288 experts with Top-8 selection → cuts compute without losing capacity
• Dual Attention Mechanism: SWA for efficient local modeling (512-window) + Full Attention for global context
• MTP Acceleration: Faster decoding for real-world interactive throughput

✅ Final Takeaway
Step 3.5 Flash proves that sparse MoE architectures can deliver enterprise-grade performance for long-context, high-throughput applications without proportional resource growth.

#AI #Engineering #Tech #LLM #Agent #StepFun
🔗 Full article(CN):https://t.co/WobioAOycv

0

26

2

10

4K

0

51

ankitnarayan1 retweeted

Bo Wang

@BoWang87

2 months ago

Three weeks ago I shared that Claude had shocked Prof. Donald Knuth by finding an odd-m construction for his open Hamiltonian decomposition problem in about an hour of guided exploration. Prof. Knuth titled the paper Claude’s Cycles. The story didn't end there. The updated paper shows the story got much bigger. For the base case m=3, there are exactly 11,502 Hamiltonian cycles. Of those, 996 generalize to all odd-m, and Prof. Knuth shows there are exactly 760 valid “Claude-like” decompositions in that family. The even case, which Claude couldn’t finish, was then cracked by Dr. Ho Boon Suan using GPT-5.4 Pro to produce a 14-page proof for all even m≥8, with computational checks up to m=2000. Soon after, Dr. Keston Aquino-Michaels used GPT + Claude together to find simpler constructions for both odd and even m, by using the multi-agent workflow. Dr. Kim Morrison also formalized Knuth’s proof of Claude’s odd-case construction in Lean. So yes: the problem now appears fully resolved in the updated paper’s ecosystem of human + AI + proof assistant work! We went from one AI solving one problem to a full mathematical ecosystem (multiple AI systems, multiple humans, formal verification) running in parallel on a problem that stumped experts for weeks. We are living in very interesting times indeed. Paper (updated): https://t.co/Ecu6X5StbY

BoWang87's tweet photo. Three weeks ago I shared that Claude had shocked Prof. Donald Knuth by finding an odd-m construction for his open Hamiltonian decomposition problem in about an hour of guided exploration. Prof. Knuth titled the paper Claude’s Cycles.

The story didn't end there.

The updated paper shows the story got much bigger. For the base case m=3, there are exactly 11,502 Hamiltonian cycles. Of those, 996 generalize to all odd-m, and Prof. Knuth shows there are exactly 760 valid “Claude-like” decompositions in that family.

The even case, which Claude couldn’t finish, was then cracked by Dr. Ho Boon Suan using GPT-5.4 Pro to produce a 14-page proof for all even m≥8, with computational checks up to m=2000.

Soon after, Dr. Keston Aquino-Michaels used GPT + Claude together to find simpler constructions for both odd and even m, by using the multi-agent workflow.

Dr. Kim Morrison also formalized Knuth’s proof of Claude’s odd-case construction in Lean.

So yes: the problem now appears fully resolved in the updated paper’s ecosystem of human + AI + proof assistant work!

We went from one AI solving one problem to a full mathematical ecosystem (multiple AI systems, multiple humans, formal verification) running in parallel on a problem that stumped experts for weeks.

We are living in very interesting times indeed.

Paper (updated): https://t.co/Ecu6X5StbY

41

1K

263

753

176K

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

2 months ago

This is huge if it works as promised!!

Google Research

@GoogleResearch

2 months ago

Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: https://t.co/CDSQ8HpZoc

1K

39K

6K

22K

19M

0

46

ankitnarayan1 retweeted

Ian Jones

@IanLJones98

2 months ago

The human traits AI can’t replicate and why they are worth developing… By @ForbesTechCncl expert panel from @alloyautomation @ankitnarayan1 @createdbyjannn and more… https://t.co/XIec7WCXiR @BetaMoroney @Nicochan33 @enilev @mvollmer1 @mikeflache @antgrasso @FrRonconi @ramonvidall @baski_LA @AkwyZ @Khulood_Almani @sijlalhussain @PawlowskiMario @pierrepinna @sonu_monika @mvollmer1 @sallyeaves @NevilleGaunt @Corix_JC @enricomolinari @Shi4Tech @wcrpaul @RagusoSergio @RLDI_Lamy @NigelTozer @EstelaMandela @JagersbergKnut @DrFerdowsi @PerBBerggreen @sir4K_zen @AmitChampaneri1 @FmFrancoise @HLStockenstrom @ILoveBooks786 @Hana_ElSayyed @CurieuxExplorer @HaroldSinnott @SegundoConnect @pchamard @trudydarwin

IanLJones98's tweet photo. The human traits AI can’t replicate and why they are worth developing…

By @ForbesTechCncl expert panel from @alloyautomation @ankitnarayan1 @createdbyjannn and more…

https://t.co/XIec7WCXiR

@BetaMoroney @Nicochan33 @enilev @mvollmer1 @mikeflache @antgrasso @FrRonconi @ramonvidall @baski_LA @AkwyZ @Khulood_Almani @sijlalhussain @PawlowskiMario @pierrepinna @sonu_monika @mvollmer1 @sallyeaves @NevilleGaunt @Corix_JC @enricomolinari @Shi4Tech @wcrpaul @RagusoSergio @RLDI_Lamy @NigelTozer @EstelaMandela @JagersbergKnut @DrFerdowsi @PerBBerggreen @sir4K_zen @AmitChampaneri1 @FmFrancoise @HLStockenstrom @ILoveBooks786 @Hana_ElSayyed @CurieuxExplorer @HaroldSinnott @SegundoConnect @pchamard @trudydarwin

2

31

18

2

708

ankitnarayan1 retweeted

Forbes Technology Council @ForbesTechCncl

3 months ago

The Human Traits Ai Cant Replicate And Why Theyre Worth Developing https://t.co/pxH0NibS3v from @alloyautomation @ankitnarayan1 @createdbyjannn and more

0

2

1

0

93

ankitnarayan1 retweeted

Forbes Technology Council @ForbesTechCncl

3 months ago

Five Valuable Engineering Skills For The AI-First World (Before Research Catches Up) https://t.co/F9Y0aS4LkI Written by @ankitnarayan1 of @paralleldots

0

2

1

0

83

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

4 months ago

Just got featured in the Most Promising Leaders To Watch in 2026: Redefining Leadership for a Changing World https://t.co/GrPLCxfKK9 @bombaytimes

0

1

0

47

ankitnarayan1 retweeted

India Digital Summit @idsiamai

4 months ago

Dhruv Bajpai, @Accenture ; Ankit Narayan Singh, @ParallelDots ; Sandeep Jabbal, @shoppersstop ; Praveen Govindu, Delloite & Vikraman Sridharan, @Lenskart_com , offered insights into "The Phygital Renaissance: Architecting India's "Intelligence-First" Retail Ecosystem"

idsiamai's tweet photo. Dhruv Bajpai, @Accenture ; Ankit Narayan Singh, @ParallelDots ; Sandeep Jabbal, @shoppersstop ; Praveen Govindu, Delloite & Vikraman Sridharan, @Lenskart_com , offered insights into "The Phygital Renaissance: Architecting India's "Intelligence-First" Retail Ecosystem" https://t.co/uOhFDYTHb4

1

0

69

ankitnarayan1 retweeted

Aarthi Ramamurthy

@aarthir

4 months ago

This is an excellent piece on how to think about Forward Deployed Engineers (FDEs) for enterprise AI startups. My favorite part is right at the end - “Linear services scale by adding bodies. Exponential services scale by adding capability. Both have FDEs. Only one is building something that compounds. If the product isn’t improving, you don’t have forward deployed engineers.”

8

144

16

215

35K

ankitnarayan1 retweeted

adam @adamdotnew

5 months ago

Introducing Adam, the first AI mechanical engineer

98

3K

222

2K

423K

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

5 months ago

@gokulr Companies rarely go under because of hiring delays, but they frequently collapse when they run out of money to meet payroll

0

32

ANKIT NARAYAN SINGH ⚡️ @ankitnarayan1

5 months ago

@gokulr That benchmarking is how you find where you’re still meaningfully better and should double down. In our case, we learned general VLMs lack the fine-grained object recognition our customers need but excel at context, so we combined both into a unique stack.

0

88

ANKIT NARAYAN SINGH ⚡️

@ankitnarayan1

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users