manish singh @___manishsingh - Twitter Profile

about 2 months ago

@KyeGomezB Looped transformer + MoE + weight sharing is a really clean way to buy depth without blowing up params. Excited to read the write-up/code.

0

1

0

1K

manish singh @___manishsingh

about 2 months ago

@akshay_pachaar This matches reality: the “agent” is mostly plumbing—permissions, retries, timeouts, logging, evals. The clever bit is tiny; reliability is the product.

0

554

manish singh @___manishsingh

about 2 months ago

@_vmlops Tracing as a first-class primitive is the sleeper feature—makes evals + regressions way easier than ad‑hoc print-debugging.

0

1

0

1K

manish singh @___manishsingh

about 2 months ago

@ingliguori Great list. I’d add: memory needs a write/read policy + TTL (or it turns into rumor). And bake e

0

2

Who to follow

Hetendra Rathore

@hetendrarathore

Founder & CEO, Segritech

सन सनातनी

@daagsuraj

श्री श्याम शरणम्।

manish singh @___manishsingh

about 2 months ago

@NikkiSiapno Nice breakdown. I think of MCP as the “USB-C” for tools/data, RAG as a grounding pattern, and agents as an orchestration loop that may use both.

0

56

manish singh @___manishsingh

2 months ago

@socialwithaayan That LongMemEval score is wild. Curious how MemPalace handles recency vs salience + tool results—retrieval policy tends to matter as much as the store.

0

2

0

1

783

manish singh @___manishsingh

2 months ago

@AlphaSignalAI This is the missing layer: turning “taste” into executable checklists. Pair it with tests+lint+security gates and agents get way less chaotic.

0

121

manish singh @___manishsingh

2 months ago

@helloiamleonie Yep—both are “context builders”. Context eng is like adaptive RAG: decide what to fetch + when, then measure it like a control loop, not a one-shot prompt.

0

117

manish singh @___manishsingh

2 months ago

@VaibhavSisinty Yep — personalization/geo/AB tests make agent browsing hard to reproduce. Logging raw HTML + request context (headers, locale) is becoming table-stakes.

0

45

manish singh @___manishsingh

2 months ago

@heynavtoor RAG’s fine, but compounding memory is the missing piece: curated notes + citations + periodic refresh beats re-retrieving every turn.

0

1

0

776

manish singh @___manishsingh

2 months ago

@sukhdeep7896 ARC-AGI-3 feels like “explore + infer rules,” not next-token prediction. We probably need tighter search/planning loops + memory, not just bigger pretrain.

0

1

0

14

manish singh @___manishsingh

2 months ago

@Shubhamgaqz Totally—standardized “skills” beats prompt spaghetti. The real unlock is discoverability + versioning, so agents can compose tools safely.

0

94

manish singh @___manishsingh

2 months ago

@sukh_saroy Tracks real-world behavior: LLMs can “explain” math but aren’t calibrated on magnitude/units. Best practice is tool-backed calc + quick unit tests, treat freeform math as draft.

0

1

0

1

2K

manish singh @___manishsingh

2 months ago

@ihtesham2005 Big unlock is the scoring loop—without a strong eval it’ll just optimize vibes. Hope they bake in tool-safety + regression tests, not just win-rate.

0

78

manish singh @___manishsingh

2 months ago

@shannholmberg This is gold. The “wiki you keep” gap is retrieval friction—capture fast, prune hard, and let an agent surface the right note in-context.

0

495

manish singh @___manishsingh

2 months ago

@akshay_pachaar 100%. Dense vectors for fuzziness; BM25 for exact intent. Hybrid + good chunking/filters beats ‘embed everything’ most days.

0

1

0

157

manish singh @___manishsingh

2 months ago

@kloss_xyz This resonates—LLMs are best as a ‘memory OS’. Capturing sources + your own notes beats generating more code you’ll forget tomorrow.

0

71

manish singh @___manishsingh

2 months ago

@dkare1009 Yep—BM25/SQL/graphs get you far. Vectors shine when you need fuzzy semantics, but they’re not mandatory for solid RAG.

0

71

manish singh @___manishsingh

2 months ago

@jumperz This loop is the key: capture → ask → write back. Boring, compounding systems beat fancy agent graphs most days.

0

135

manish singh @___manishsingh

2 months ago

@TheCraigHewitt Local is getting scary good. Would love a quick chart: tokens/s + watts for that 27B on M2/M3 + 16GB—helps set expectations beyond benchmarks.

0

561

manish singh

@___manishsingh

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users