Alfredo Espinoza @alfredo_ep - Twitter Profile

1 day ago

1-bit GLM-5.2 GGUF vs. Claude 4.8 Opus vs. GPT-5.5 We gave 3 models the same prompt and compared one-shot outputs. The 1-bit GLM-5.2 GGUF ran locally on a Mac Studio M3 Ultra with 256GB RAM at ~21.6 tok/s. Which output do you like best? GGUF: https://t.co/BMkxswdj5N

161

3K

343

2K

1M

alfredo_ep retweeted

François Chollet

@fchollet

1 day ago

With agentic coding, complexity compounds in a mechanical way: unnecessary code ends up in the codebase, moves to the context window, degrades the model's reasoning abilities, leads to more unnecessary code (often to fix issues arising from the unnecessary code). It's exponential

27

393

39

56

19K

Alfredo Espinoza @alfredo_ep

1 day ago · Chacsinkín

@Hesamation To be fair, even if orchestrated or ensembled, it remains a model. Not dirty, just clever

1

2

0

275

alfredo_ep retweeted

Z.ai @Zai_org

8 days ago

Introducing GLM-5.2: Frontier Intelligence, Open Weights - Significant improvements in coding and agentic tasks - Strong long-horizon capabilities with a 1M context window - Two levels of reasoning effort: GLM-5.2 (max) pushes the limits, while GLM-5.2 (high) strikes a strong balance between performance and token efficiency - MIT-licensed open weights - Same API pricing as GLM-5.1 Tech Blog: https://t.co/LAsxUdN0JZ Weights: https://t.co/g0A1C4UWx4 API: https://t.co/Kc3E22cbN7 Coding Plan: https://t.co/Nk8Y98HNhU Chat: https://t.co/WCqWT0qCQb

Zai_org's tweet photo. Introducing GLM-5.2: Frontier Intelligence, Open Weights

- Significant improvements in coding and agentic tasks
- Strong long-horizon capabilities with a 1M context window
- Two levels of reasoning effort: GLM-5.2 (max) pushes the limits, while GLM-5.2 (high) strikes a strong balance between performance and token efficiency
- MIT-licensed open weights
- Same API pricing as GLM-5.1

Tech Blog: https://t.co/LAsxUdN0JZ
Weights: https://t.co/g0A1C4UWx4
API: https://t.co/Kc3E22cbN7
Coding Plan: https://t.co/Nk8Y98HNhU
Chat: https://t.co/WCqWT0qCQb

674

12K

2K

4K

7M

Who to follow

Slytewurk

@Slytewurk

Honest streams, reactions, and commentary on games, shows, and culture. 🔗 https://t.co/3CFaqwtqsy

alfredo_ep retweeted

3 days ago

From one Aruco marker I got the relative positions of my wrist and global cameras and of my robot's kinematics chain. The Aruco is flat on the table so I can project the wrist camera's intrinsics on the table plane, and get an estimate of the wrist cam from global pixels only.

10

236

22

164

33K

alfredo_ep retweeted

bryan pratte

@btp4z7

3 days ago

playing with new backbones this weekend.. impressive how far DINO3+mesh queries can get you.

13

925

65

656

63K

alfredo_ep retweeted

Preferred Networks

@PreferredNetJP

3 days ago

【発表】国産フルスクラッチ開発の生成AI基盤モデルPLaMo 3.0 Primeの提供を開始しました。 1⃣API経由またはオンプレで利用可能 2⃣複雑なタスクに対応するReasoningモデル、応答速度の速いNon-reasoningモデルを提供 3⃣高い日本語性能とコストパフォーマンスを両立 4⃣コンテキスト長を64kから256kに拡張 5⃣危険情報に対する安全性向上 https://t.co/r7IeV3OdwC AIエージェントや企業の実務利用を想定した機能強化がされていますのでおぜひ試しください

15

1K

399

571

356K

alfredo_ep retweeted

𝑙𝑦𝑟𝑎

@naturehealyou

3 days ago

some people are born to like this feeling

1K

102K

18K

14K

6M

alfredo_ep retweeted

matospiso

@matospiso

5 days ago

@tenderizzation Btw this is how billions of user interactions are processed every day and millions of people see recommendations where the retrieval is done by 2 sparse matrix-vector multiplications 😀😀 https://t.co/3MS8aGuzqX

matospiso's tweet photo. @tenderizzation Btw this is how billions of user interactions are processed every day and millions of people see recommendations where the retrieval is done by 2 sparse matrix-vector multiplications 😀😀

https://t.co/3MS8aGuzqX https://t.co/Jx8826ISFf

0

7

1

0

650

alfredo_ep retweeted

tender

@tenderizzation

5 days ago

6-7 decades of computer systems research in a nutshell

33

3K

268

1K

135K

alfredo_ep retweeted

Yun-Ta Tsai

@yunta_tsai

4 days ago

Many people think any given ML project is 99% training. In reality, it’s 50% evaluation, 40% data cleaning, 8% integration, and 2% training. The first two set the noise floor for learning. No ML magic matters; the model cannot lower the noise floor, as that’s the optimal bound of Shannon encoding of your data. Thus, not a single day goes by without me thinking about ontology. Even the old labels have to be constantly reviewed.

556

11K

1K

6K

18M

alfredo_ep retweeted

Drone U

@theDroneU

5 days ago

Major drone industry shakeup as @ExynTech released @SkydioHQ level autonomy for a fraction of the price. The industry wars are here. Remember competition is great for pilots.

79

4K

367

2K

442K

alfredo_ep retweeted

codila

@0xCodila

6 days ago

Anthropic Quant Andrej Karpathy: "Most people use tools that they don't understand - the ones who strip everything down to basics - end up faster than everyone else " "the best code is the code anyone can read " he couldn't fix a bug in 2 hours, so instead of googling - he rewrote the entire system from scratch no frameworks. no dependencies. it ended up faster that's the difference between using AI and understanding it 25-min masterclass - bookmark and watch

24

2K

152

4K

375K

alfredo_ep retweeted

Ettore Di Giacinto

@mudler_it

5 days ago

Depth Anything 3 now runs as pure C++/ggml (@ggml_org) . No Python, no PyTorch, no CUDA toolkit at inference, just one self-contained GGUF. It's faster than PyTorch on CPU! and ties speed on GPU. The CPU win came from the last place..I'd have looked. Quantized GGUF on @huggingface🤗 Shout out to @ggerganov for ggml (we are building a ggml-world!❤️) and to @ByteDanceOSS and Depth Anything 3 authors @bingyikang @jhliew91 @donydchen !

12

626

84

491

34K

alfredo_ep retweeted

Andi Marafioti

@andimarafioti

6 days ago

Can a VLM see without a vision encoder? We trained one for $100, inspired by Gemma 4 12B. Latency on an M3 Pro MacBook: 112 ms -> 1.1 ms for the image path 30% lower end-to-end image+LLM The architecture is just: patchify the image -> linear projection with pos embeddings -> LLM Writeup: https://t.co/yt0IKzsF7O

18

692

103

633

59K

alfredo_ep retweeted

GIROTTO

@GIROTTO

9 days ago

O dinheiro e a engenharia fazendo o seu propósito, melhorar nem que seja um pouco a vida. 👏🏻👏🏻👏🏻👏🏻👏🏻

142

20K

362

632

2M

alfredo_ep retweeted

ℏεsam

@Hesamation

10 days ago

LOCAL LLM GUIDE (June 2026) Cheapest full build: 1× used RTX 3090 (24GB) + rest of PC ≈ $1000-1500 16GB all-rounder → Gemma 4-12B 32GB all-rounder → Qwen3.6-27B Agents & tool use → Qwen3.6-27B Deep reasoning → Nex-N2-Mini

Hesamation's tweet photo. LOCAL LLM GUIDE (June 2026)

Cheapest full build: 1× used RTX 3090 (24GB) + rest of PC ≈ $1000-1500
16GB all-rounder → Gemma 4-12B
32GB all-rounder → Qwen3.6-27B
Agents & tool use → Qwen3.6-27B
Deep reasoning → Nex-N2-Mini https://t.co/lN0nQY6Zq3

12

419

46

454

37K

alfredo_ep retweeted

HomeMadeGarbage

@H0meMadeGarbage

15 days ago

Codexで強化学習生き物つくってもうた。。

109

6K

442

2K

820K

alfredo_ep retweeted

Jack 🤖

@JacklouisP

16 days ago

Why TF does AI-optimised metal look like bone? Ask AI to optimise a bracket for strength and weight and it hands back something that looks grown, not built. The physics behind this is very cool: •What the software does. You give it a bounding box, the loads, and the anchor points. It models stress through the block as thousands of tiny springs - finite element analysis - then reinforces the cells carrying load and dials down the ones sitting idle. Over hundreds of iterations only the load-bearing material survives. It’s called topology optimisation, and the output is the most efficient distribution of material for that exact load case. •Why does force moves in arcs, not straight lines? Any sharp corner creates a stress concentration - a local spike that fails first. The path of least resistance through a loaded solid is always curved, following the principal stress trajectories. Like water finding its way downhill, force takes the smoothest route. Lay material on those arcs and you get maximum strength per gram. Which is why the result always curves. •Why does this look natural? Because evolution runs the identical loop: it deposits bone along stress lines and dissolves it where it sits idle and wastes energy. The shapes aren’t biological or mechanical. They’re optimal. Evolution converged on them because wasting energy gets you outcompeted. Different processes, same selection pressure. What’s cool is we proved that these shapes were optimal in 1904. We just couldn’t build them until 3D printing arrived.