Richmond @_rixchy - Twitter Profile

Pinned Tweet

Richmond @_Rixchy

9 months ago

_Rixchy's tweet photo. https://t.co/leixouW0fC

0

4

0

273

_Rixchy retweeted

Azalia Mirhoseini

@Azaliamirh

6 days ago

Introducing Decentralized Language Models (DeLM)! DeLM is a multi-agent framework that enables asynchronous, verified & reusable progress! It makes agentic tasks more accurate and significantly cheaper. For example, it achieves 65.7% on SWE-bench Verified using Gemini 3-Flash, a ~10% jump over the best centralized alternatives at less than half the cost. Great work led by @Mao_Yuzhen !

Azaliamirh's tweet photo. Introducing Decentralized Language Models (DeLM)!

DeLM is a multi-agent framework that enables asynchronous, verified & reusable progress!

It makes agentic tasks more accurate and significantly cheaper. For example, it achieves 65.7% on SWE-bench Verified using Gemini 3-Flash, a ~10% jump over the best centralized alternatives at less than half the cost.

Great work led by @Mao_Yuzhen !

8

243

27

172

28K

_Rixchy retweeted

Glenn Hitchcock

@glennui

8 days ago

Design is full of codewords. Knowing them changes what you can ask for, and what you can get back, whether you're working with devs, or an AI. “tint this neutral color”, “fix this widow”, “nudge it to the optical center” I wrote them down: https://t.co/aFyd5avj9o

63

2K

180

4K

288K

Richmond @_Rixchy

8 days ago

@olusesan__tolu But the functionality is still lacking.

1

0

970

Who to follow

_Rixchy retweeted

8 days ago

designing loops is so outdated. if you’re not astral projecting to become one with claude, you’re ngmi

25

1K

54

29

36K

Richmond @_Rixchy

11 days ago

@yacinelearning @paulg FlenQA benchmarked.

0

1

0

46

_Rixchy retweeted

Thariq

@trq212

14 days ago

https://t.co/R6exTuF7P8

260

10K

1K

24K

3M

_Rixchy retweeted

Asteri

@Asteri_eth

17 days ago

Karpathy found a way to reduce token consumption by 90% The problem is that the LLM re-reads the same files over and over again, loses context between documents, and provides less accurate answers as a result The solution is called Wiki Layer the LLM cleans, structures, and links all your data once, after which it never works with raw files again Three folders `raw/` for originals, `wiki/` for a clean knowledge base in Markdown, and files with rules for the agent Result up to 90% token savings on repeat queries, automatic links between documents, and a visual knowledge graph in Obsidian Everything stays on your local machine nothing goes to the cloud

155

4K

420

9K

1M

_Rixchy retweeted

Ettore Di Giacinto

@mudler_it

16 days ago

parakeet.cpp: native C++/ggml (@ggml_org) inference for @NVIDIAAIDev's Parakeet, one of the best speech-to-text models out there, from the @LocalAI_API team. Every Parakeet model (TDT/CTC/RNNT/hybrid + cache-aware streaming), byte-for-byte identical output to NeMo, now running anywhere with no Python and even a bit faster, on CPU and GPU. Quantized GGUF on @huggingface 🤗 Huge thanks to @ggerganov for ggml and to @NVIDIAAIDev for releasing Parakeet! 🧵

14

367

56

361

55K

_Rixchy retweeted

Chelsea FC

@ChelseaFC

over 4 years ago

'I only support teams in London without European Cups' 🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩🚩

2K

80K

22K

1K

0

_Rixchy retweeted

Chelsea FC

@ChelseaFC

11 months ago

FIFA CLUB WORLD CUP WINNERS!!! 🏆

7K

229K

62K

3K

13M

_Rixchy retweeted

Huaxiu Yao

@HuaxiuYaoML

25 days ago

Every memory system for LLM agents evolves what it stores. None evolves how it retrieves. 🧬 EvolveMem is out, now shipping inside the SimpleMem v0.3.0 update. Powered by AutoResearch: the system researches its own retrieval, treating the full retrieval config as a structured action space and running a closed loop: evaluate ➜ diagnose ➜ propose ➜ validate ➜ repeat. 🔬 From a minimal baseline, 7 autonomous rounds produce a retrieval policy that beats the strongest published baseline by +25.7% on LoCoMo and +18.9% on MemBench. 🧬 It discovers entirely new retrieval dimensions not present in the original design, all integrated into the unified SimpleMem package. 📄 Paper: https://t.co/BWCXebWhG1 💻 Code: https://t.co/hhdgvVjblP Led by @itsJiaqiLiu, @XinyeYee with contributions from @richardxp888, @ZhengBerkeley, @cihangxie

HuaxiuYaoML's tweet photo. Every memory system for LLM agents evolves what it stores. None evolves how it retrieves.

🧬 EvolveMem is out, now shipping inside the SimpleMem v0.3.0 update. Powered by AutoResearch: the system researches its own retrieval, treating the full retrieval config as a structured action space and running a closed loop: evaluate ➜ diagnose ➜ propose ➜ validate ➜ repeat.

🔬 From a minimal baseline, 7 autonomous rounds produce a retrieval policy that beats the strongest published baseline by +25.7% on LoCoMo and +18.9% on MemBench.

🧬 It discovers entirely new retrieval dimensions not present in the original design, all integrated into the unified SimpleMem package.

📄 Paper: https://t.co/BWCXebWhG1
💻 Code: https://t.co/hhdgvVjblP

Led by @itsJiaqiLiu, @XinyeYee with contributions from @richardxp888, @ZhengBerkeley, @cihangxie

12

423

79

376

29K

_Rixchy retweeted

Zyphra

@ZyphraAI

25 days ago

Backprop strongly shapes the GPU hardware AI runs on today. Learning algorithms without backprop open new opportunities for neuromorphic silicon, biologically grounded models, and heterogeneous compute. Paper: https://t.co/rNFmIKzCXz Blog: https://t.co/oSq2pN0brU

2

153

12

169

40K

_Rixchy retweeted

Isha Puri

@ishapuri101

25 days ago

It's never made sense to me that RL collapses all reward signals to a single scalar. Today, we fix that! Introducing Vector Policy Optimization: we train models to inherently optimize for the varied nature of a reward vector, creating diverse sets of answers ideal for test time search. Website and code coming soon!

11

716

66

577

69K

Richmond @_Rixchy

28 days ago

@Google @antigravity Are you all vibe coding now?

0

1

0

99

_Rixchy retweeted

Eric Ho

@ericho_goodfire

29 days ago

neural geometry in biology! i think we're going to learn a lot about our own brains by studying neural networks

0

204

23

83

15K

_Rixchy retweeted

Vikash Kumar

@Vikashplus

29 days ago

SCALING ISN’T EVERYTHING Another tiny model breaking the rule. -trained on less than 1/1000th of the data - can be trained in a single day with <1000 USD Human knowledge base ca be compressed & retrieved much tighter than LLMs do today.

4

81

5

51

11K

_Rixchy retweeted

Santosh Kumar Radha

@santoshradha

about 1 month ago

@RoundtableSpace Ooo let me add a layer above it. Harness-Orchestration - https://t.co/ej3wFSqHRJ

0

31

6

117

15K

_Rixchy retweeted

Felipe Coury 🦀

@fcoury

about 1 month ago

SSH + Tailscale is how I am managing multiple machines from the same phone. Give it a shot.

12

261

12

182

32K

_Rixchy retweeted

Andy @prompt_Tunes

about 1 month ago

👀

prompt_Tunes's tweet photo. 👀 https://t.co/LTqk3go4Ek

15

2K

156

1K

117K

_Rixchy retweeted

Simplifying AI

@simplifyinAI

about 1 month ago

🚨 BREAKING: NVIDIA proved back-propagation isn't the only way to build an AI. Billion-parameter models were trained without a single gradient. No calculus, no exploding memory, no massive GPU clusters. The culprit? A long-dismissed technique called Evolution Strategies. NVIDIA and Oxford just made it scalable with EGGROLL, which replaces bloated mutation matrices with two tiny ones, enabling hundreds of thousands of parallel mutations at inference-level speed. They're pretraining models from scratch using only simple integers. No backprop. No decimals. We assumed the future of AI required endless precision hardware. Evolution had other plans.

simplifyinAI's tweet photo. 🚨 BREAKING: NVIDIA proved back-propagation isn't the only way to build an AI.

Billion-parameter models were trained without a single gradient. No calculus, no exploding memory, no massive GPU clusters.

The culprit? A long-dismissed technique called Evolution Strategies.

NVIDIA and Oxford just made it scalable with EGGROLL, which replaces bloated mutation matrices with two tiny ones, enabling hundreds of thousands of parallel mutations at inference-level speed.

They're pretraining models from scratch using only simple integers. No backprop. No decimals.

We assumed the future of AI required endless precision hardware. Evolution had other plans.

29

1K

179

2K

269K

Richmond

@_Rixchy

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users