raullen @raullen - Twitter Profile

about 15 hours ago

Shipped Rapid-MLX v0.7.3 — bringing best performance and structured tool calling to DiffusionGemma 26B on Apple Silicon. 🛠️ Engine optimizations pushed throughput to 50 tok/s on an M3 Ultra. Zero parsing headaches. v0.7.3 perfectly handles DiffusionGemma's tool calls natively right out of the box. My new fav local LLM now! 🔗 https://t.co/bxNYbSQb8N

Google Gemma

@googlegemma

3 days ago

Meet DiffusionGemma! An experimental open model that explores a fast approach to text generation, released under an Apache 2.0 license. Moving beyond sequential, token-by-token processes to generate entire blocks of text simultaneously. Here’s what’s new with DiffusionGemma: 👇

170

5K

798

2K

905K

31

265

7

4

15K

raullen

@Raullen

1 day ago

SpaceX ($SPCX) isn't a rocket IPO—it's a $1.75T index-inclusion game masking a cash-burning AI bet. I pulled apart the numbers ahead of Friday's debut. Here is the actual signal through the noise: 1/ The Identity Crisis: Post-xAI merger, it’s a four-headed beast: Launch, Starlink, xAI, and X. The reality? Only Starlink is actually printing money. 2/ The Valuation Void: At $1.75T (~94x sales) with widening losses and zero P/E, there is no fundamental anchor. Fair-value estimates are hallucinating anywhere from $63 to $330. 3/ The Float Mechanics: With only ~4.3% of shares trading freely, month one is completely divorced from fundamentals. It’s a pure liquidity squeeze driven by forced index buying (MSCI, then Nasdaq-100 on Jul 7). 4/ The Bottom Line: You are buying a cash-generative ISP engine strapped to an infinite-spend AI rocket, flying blind on traditional metrics. Full 4-part teardown in the attached images. 👇

Raullen's tweet photo. SpaceX ($SPCX) isn't a rocket IPO—it's a $1.75T index-inclusion game masking a cash-burning AI bet.

I pulled apart the numbers ahead of Friday's debut. Here is the actual signal through the noise:

1/ The Identity Crisis: Post-xAI merger, it’s a four-headed beast: Launch, Starlink, xAI, and X. The reality? Only Starlink is actually printing money.

2/ The Valuation Void: At $1.75T (~94x sales) with widening losses and zero P/E, there is no fundamental anchor. Fair-value estimates are hallucinating anywhere from $63 to $330.

3/ The Float Mechanics: With only ~4.3% of shares trading freely, month one is completely divorced from fundamentals. It’s a pure liquidity squeeze driven by forced index buying (MSCI, then Nasdaq-100 on Jul 7).

4/ The Bottom Line: You are buying a cash-generative ISP engine strapped to an infinite-spend AI rocket, flying blind on traditional metrics.

Full 4-part teardown in the attached images. 👇

67

469

114

11

19K

raullen

@Raullen

2 days ago

Hyperscalers are pouring nearly $1 trillion into AI infrastructure. The obvious question is: 🤔 What demand actually justifies $1T of frontier compute? It’s clearly not chatbots, search, or basic copilots. Those markets are rapidly moving toward the edge and open source. Inference costs keep collapsing (~10x every 12–18 months), and “good enough” AI gets cheaper every year. The bull case for frontier compute is usually some combination of: 🧬 Drug discovery 🖥️ Chip design 🔬 Scientific/Math research 🧠 Deeper understanding of the universe (SpaceX is here?) 🤖 Physical AI However, we still have very little evidence that these workloads are generating revenue at a scale that can support a trillion-dollar infrastructure buildout. That’s a scary thought if you’re underwriting the next decade of AI capex.

42

244

69

13

14K

raullen

@Raullen

2 days ago

🤖 AI now makes 100x absurd products than 6mo ago. When the cost of experimentation collapses, the system starts exploring huge regions of the possibility space that were previously too expensive to touch. Most ideas will fail. Most products will die. But that’s exactly what happened during the Cambrian Explosion. Nature didn’t become smarter overnight—it simply became able to run far more experiments. AI is doing the same for startups and products. What’s becoming scarce is selection. The winners will be those who can spot signal in noise, identify the one great idea among a thousand mediocre ones, and iterate faster than everyone else 🧬

38

185

73

12

12K

Who to follow

IoTeX

@iotex_io

The blockchain platform for Real-World AI.

Nicholas Chan

@nicholas_yhchan

Medway Lib Dems🔶 | [email protected] Published by Nicholas Chan, Medway Liberal Democrats, 29 Dunlin Drive, Chatham ME4 3JA.

Larry Pang

@larrypang

real-world AI @iotex_io || prev @oliverwyman @mit

raullen

@Raullen

3 days ago

Apple is the true future of decentralization. The narrative that Apple is "lagging" in the AI arms race completely misses the mark. They aren't failing to compete with hyperscalers—they are executing a profoundly different playbook. 🎯 While the industry forces us to rent intelligence from centralized monoliths (trading our privacy and paying exorbitant token fees 💸), Apple is quietly building the exact opposite: 👤 People-owned AI, running entirely on people-owned devices. 🧠 Core AI (announced in WWDC'26) transforms the device in your pocket from a mere data-harvesting terminal into a sovereign engine of your own intelligence. 🤝 It’s a massive shift: Full Privacy, Low Latency, Zero Token Fees and True AI empowerment isn't a subscription to a distant server. It's local, private, and unequivocally yours am bullish $APPL (as the developer of rapid-mlx inference engine on MacOS)

Raullen's tweet photo. Apple is the true future of decentralization.

The narrative that Apple is "lagging" in the AI arms race completely misses the mark. They aren't failing to compete with hyperscalers—they are executing a profoundly different playbook. 🎯

While the industry forces us to rent intelligence from centralized monoliths (trading our privacy and paying exorbitant token fees 💸), Apple is quietly building the exact opposite:

👤 People-owned AI, running entirely on people-owned devices.
🧠 Core AI (announced in WWDC'26) transforms the device in your pocket from a mere data-harvesting terminal into a sovereign engine of your own intelligence.
🤝 It’s a massive shift: Full Privacy, Low Latency, Zero Token Fees and True AI empowerment isn't a subscription to a distant server. It's local, private, and unequivocally yours

am bullish $APPL (as the developer of rapid-mlx inference engine on MacOS)

65

351

69

12

17K

raullen

@Raullen

3 days ago

I’ve noticed something interesting. The gap between model capability and model marketing seems to be widening. As a heavy user, I haven’t seen a meaningful leap since Opus 4.6. Yet every new release is presented as if AGI has arrived and every profession is about to disappear next quarter. Maybe we’ve entered a phase where AI progress is linear, but AI storytelling is exponential. That tends to happen right before very large IPOs

36

184

33

10

9K

raullen

@Raullen

4 days ago

Anthropic’s latest breakthrough in 'efficiency': an elegant new model optimized to drain your usage limits 2x faster. Truly revolutionary upselling. 👏

Raullen's tweet photo. Anthropic’s latest breakthrough in 'efficiency': an elegant new model optimized to drain your usage limits 2x faster. Truly revolutionary upselling. 👏 https://t.co/5pQ0zVUHTK

44

212

39

11

12K

raullen

@Raullen

4 days ago

Massive performance leap in rapid-mlx v0.6.83 🚀 We fused the top-p sampler into a single lazy-graph segment, completely bypassing mlx-lm's two-compile closure chain bottleneck. The result? The entire hero table is up 12-53%, with Gemma-4-12b flying at +53%. Pure edge inference efficiency. 🔥

Raullen's tweet photo. Massive performance leap in rapid-mlx v0.6.83 🚀

We fused the top-p sampler into a single lazy-graph segment, completely bypassing mlx-lm's two-compile closure chain bottleneck.

The result? The entire hero table is up 12-53%, with Gemma-4-12b flying at +53%. Pure edge inference efficiency. 🔥

73

399

116

13

23K

Raullen retweeted

Xinxin Fan @cryptoxfan

5 days ago

Cornell's IC3 just dropped the first systematic survey on Crypto x AI. The core insight? AI is the translation layer between humans and machines. Crypto is the trust layer that proves machines actually did what they claimed. 🔗

cryptoxfan's tweet photo. Cornell's IC3 just dropped the first systematic survey on Crypto x AI. The core insight? AI is the translation layer between humans and machines. Crypto is the trust layer that proves machines actually did what they claimed. 🔗 https://t.co/AX8oHa3uoN

1

8

3

2

471

raullen

@Raullen

5 days ago

Where @iotex_io is heading? At IoTeX, we’re exploring several concrete directions: - verified real-world data from devices - GPU compute for batch / latency-insensitive AI workloads - identity for machines and AI agents - verifiable AI execution pipelines - agent payments and onchain settlement This is why EVM parity, account abstraction, rollups, and cross-chain readiness matter. Real World AI needs real infra.

IoTeX

@iotex_io

5 days ago

🚨 IoTeX Core v2.4.1 "Yap" Hardfork is HERE! 🚨 Full Ethereum Pectra compatibility has officially landed on @iotex_io! ⚡️ What’s dropping at block 48,985,561? ✅ IIP-60 (Pectra EVM): Parity with Ethereum’s latest! Unlocking Account Abstraction (EIP-7702), Rollups, and cross-chain BLS. ✅ Candidate Exit Queue: Safer, predictable validator exits to protect network stability. 💥 A massive leap forward for large-scale DePIN & Real World AI workloads! 🛠 Node Operators: This is a MANDATORY upgrade! (No config changes needed, just restart with the new image). 📘 Full release notes: https://t.co/eyqtu1N1Zt

iotex_io's tweet photo. 🚨 IoTeX Core v2.4.1 "Yap" Hardfork is HERE! 🚨
Full Ethereum Pectra compatibility has officially landed on @iotex_io! ⚡️

What’s dropping at block 48,985,561?
✅ IIP-60 (Pectra EVM): Parity with Ethereum’s latest! Unlocking Account Abstraction (EIP-7702), Rollups, and cross-chain BLS.
✅ Candidate Exit Queue: Safer, predictable validator exits to protect network stability.
💥 A massive leap forward for large-scale DePIN & Real World AI workloads!

🛠 Node Operators: This is a MANDATORY upgrade! (No config changes needed, just restart with the new image).

📘 Full release notes: https://t.co/eyqtu1N1Zt

38

188

43

4

16K

57

281

51

12

18K

Raullen retweeted

IoTeX_Daily

@iotex_daily

5 days ago

🚨 JUST IN: The IoTeX Mainnet v2.4.0 upgrade has successfully activated at block height 48,985,561 $IOTX

41

204

17

1

13K

raullen

@Raullen

7 days ago

🔴 The Pain: Running local MLX models is incredibly fast and private. But let's be real - testing tool calling via terminal is clunky, and there's zero good UI for it. 🟢 The Fix: rapid-mlx share Just ONE command gives you a polished web chat + seamless tool calling (works beautifully with gemma-4-12b-qat). We are proud to be the ONLY MLX inference engine in the community shipping this. ⚡️ 👇 Try it now: brew install raullenchai/rapid-mlx/rapid-mlx

77

434

18

16

23K

raullen

@Raullen

7 days ago

Gemma4 12B is getting a lot of attention right now, so I ran a quick 4-bit inference benchmark on Apple Silicon. I compared the token generation throughput across rapid-mlx, mlx-lm, Ollama, and LM Studio under 1, 4, and 8 concurrent users. Results are in the image below for those looking to run it locally. 📊👇

Raullen's tweet photo. Gemma4 12B is getting a lot of attention right now, so I ran a quick 4-bit inference benchmark on Apple Silicon.

I compared the token generation throughput across rapid-mlx, mlx-lm, Ollama, and LM Studio under 1, 4, and 8 concurrent users. Results are in the image below for those looking to run it locally. 📊👇

73

316

24

10

19K

raullen

@Raullen

8 days ago

@bridgemindai Try "rapid-mlx chat gemma-4-12b".

0

1

0

540

raullen

@Raullen

8 days ago

@sundarpichai @sundarpichai Try running Gemma 12B with rapid-mlx chat gemma-4-12b on a Mac. This is definitely the fastest way!

0

1

0

192

raullen

@Raullen

8 days ago

The uncomfortable truth: In the short term, AI is an absolute headwind for Crypto. But zoom out, and it’s the greatest long-term catalyst the space has ever seen👇 🩸 The Short-Term Bear Case 1️⃣ The Attention Black Hole: The AI wealth-creation myth has completely hijacked the global narrative. It is aggressively siphoning mindshare, developer talent, and liquidity. Crypto is experiencing a sustained capital drain as both retail and institutional money chase the AI gold rush. 2️⃣ The Security & Quantum Threat: AI poses a dual existential threat to decentralized networks. Not only are AI agents becoming ruthlessly efficient at finding and exploiting code vulnerabilities, but AI is also radically accelerating the timeline for quantum computing—posing a looming threat to the underlying cryptographic security and efficiency of the entire space. This drives massive on-chain capital flight. 3️⃣ An Ideological Shift: This isn't just about hacks; it's a fundamental shift in trust. Driven by the fear of decentralized vulnerabilities, users and capital are retreating to the perceived safety of centralized systems. This ideological pivot is the core reason for the short-term price suppression. 🚀 The Mid-to-Long Term Bull Case The mid-to-long term isn't just a recovery; it’s a total metamorphosis. 1️⃣ Open, decentralized platforms are the only infrastructure neutral and permissionless enough to host a civilization of billions of autonomous AI agents. These agents will perform quadrillions of experiments, crowdsource innovation, and iterate at machine speed—all on-chain, beyond the reach of a central kill-switch. 2️⃣ We aren't just building a financial system; we are building the substrate for an explosion of Global Intellectual Wealth. 3️⃣The End Game? We are so early that we can’t even catch a glimpse of it yet. We are moving past the era of "speculative toys" and into the "Synthetic Intelligence Layer" of humanity. Just my 2cents

The Kobeissi Letter

@KobeissiLetter

8 days ago

BREAKING: MicroStrategy's, $MSTR, unrealized loss on its Bitcoin holdings rises to a record -$12.7 billion. This puts the company's position down -$28 billion over the last 12 months.

KobeissiLetter's tweet photo. BREAKING: MicroStrategy's, $MSTR, unrealized loss on its Bitcoin holdings rises to a record -$12.7 billion.

This puts the company's position down -$28 billion over the last 12 months. https://t.co/Fum7pW5NUk

354

5K

555

254

692K

61

249

46

12

16K

raullen

@Raullen

9 days ago

Gemma4 12B is my fav now.

witcheer

@witcheer

10 days ago

Gemma 4 dropped a 12B. I put it on RTX 5090 against its 31B sibling. when you cut a model from 31B to 12B, what do you actually lose? ~ reasoning barely moves GSM8K (math) 97.5 > 96.4 (−1.1) ARC-C (sci reasoning) 97.6 > 94.0 (−3.6) ~ knowledge falls off a cliff MMLU (world knowledge) 87.8 > 78.9 (−8.9) HellaSwag (commonsense) 92.0 > 81.6 (−10.4) ~~~ parameters store facts, not thinking. the 19B you delete is mostly where the model kept its trivia and world-priors, cut it and recall collapses, while the reasoning machinery stays nearly whole. a 12B reasons almost like its big brother. It just knows less. 122 tok/s vs 53 (2.3x faster generation), ~10GB instead of ~24, meaning that you get 20GB+ free on a 32GB card for long context or a second model. so it depends of your workload: reasoning / math / agentic loops = the 12B is nearly free broad-knowledge Q&A with no retrieval = that's the one job worth paying for the 31B.

witcheer's tweet photo. Gemma 4 dropped a 12B.
I put it on RTX 5090 against its 31B sibling.

when you cut a model from 31B to 12B, what do you actually lose?

~ reasoning barely moves
GSM8K (math) 97.5 > 96.4 (−1.1)
ARC-C (sci reasoning) 97.6 > 94.0 (−3.6)

~ knowledge falls off a cliff
MMLU (world knowledge) 87.8 > 78.9 (−8.9)
HellaSwag (commonsense) 92.0 > 81.6 (−10.4)

~~~
parameters store facts, not thinking. the 19B you delete is mostly where the model kept its trivia and world-priors, cut it and recall collapses, while the reasoning machinery stays nearly whole.

a 12B reasons almost like its big brother. It just knows less.

122 tok/s vs 53 (2.3x faster generation), ~10GB instead of ~24, meaning that you get 20GB+ free on a 32GB card for long context or a second model.

so it depends of your workload:

reasoning / math / agentic loops = the 12B is nearly free

broad-knowledge Q&A with no retrieval = that's the one job worth paying for the 31B.

38

702

81

339

66K

39

182

12

11

12K

raullen

@Raullen

9 days ago

We need verifiable safety protocols for robots. Hardware without guardrails is a recipe for disaster.

Mario Nawfal

@MarioNawfal

9 days ago

A robot kicked a little boy in the stomach We're officially one software update away from Terminator

3K

51K

9K

7K

4M

27

149

14

10

10K

raullen

@Raullen

9 days ago

Google just dropped a beast: Gemma 4 12B. (78.8% GPQA, 256K context, multimodal, Apache 2.0) 🔥 Don't waste time on complex setups. If you're on Apple Silicon, Rapid-MLX shipped day-zero support so you can run it locally right now⚡️ Zero config, just run 💻 brew install raullenchai/rapid-mlx/rapid-mlx rapid-mlx chat gemma-4-12b

Raullen's tweet photo. Google just dropped a beast: Gemma 4 12B. (78.8% GPQA, 256K context, multimodal, Apache 2.0) 🔥

Don't waste time on complex setups. If you're on Apple Silicon, Rapid-MLX shipped day-zero support so you can run it locally right now⚡️

Zero config, just run 💻
brew install raullenchai/rapid-mlx/rapid-mlx
rapid-mlx chat gemma-4-12b

Google Gemma

@googlegemma

10 days ago

Meet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇

googlegemma's tweet photo. Meet Gemma 4 12B!

A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.

Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇 https://t.co/gf4FZv0WZb

404

12K

2K

5K

3M

44

164

30

12

12K

raullen

@Raullen

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users