Small Model Lab @smallmodellab - Twitter Profile

Small Model Lab

@SmallModelLab

2 months ago

One of the best small models out now.

Bindu Reddy

@bindureddy

2 months ago

Gemma 4 is a very good small model that punches above it's weight class Gemma is a 31B model that is as good as other very large MoE models It's the best in the world for it's size 👏👏

bindureddy's tweet photo. Gemma 4 is a very good small model that punches above it's weight class

Gemma is a 31B model that is as good as other very large MoE models

It's the best in the world for it's size 👏👏 https://t.co/kINDS3GdIK

37

374

25

60

44K

1

0

110

Small Model Lab

@SmallModelLab

2 months ago

@bindureddy Yes! Great example of a very powerful small LLM 💪

0

19

Small Model Lab

@SmallModelLab

2 months ago

@garrytan So excited for this Garry!

0

45

Small Model Lab

@SmallModelLab

2 months ago

This deserves more attention.

0xSero

@0xSero

2 months ago

This didn't receive the attention it deserved. They pre-trained this model completely peer 2 peer, no data-centers. Everything was done over a permissionless network, I have tried the model, it's honestly not a good LLM but that's beyond the point. We NEED this, we NEED an alternative. - Download OpenCode - Download Pi - Pay for OpenSource - Share your AI sessions - Learn to do RL We can't be at the mercy of ANY lab. https://t.co/6ruL2lz2Dh

0xSero's tweet photo. This didn't receive the attention it deserved. They pre-trained this model completely peer 2 peer, no data-centers.

Everything was done over a permissionless network, I have tried the model, it's honestly not a good LLM but that's beyond the point.

We NEED this, we NEED an alternative.

- Download OpenCode
- Download Pi
- Pay for OpenSource
- Share your AI sessions
- Learn to do RL

We can't be at the mercy of ANY lab.

https://t.co/6ruL2lz2Dh

44

1K

112

410

51K

0

1

111

Who to follow

Efty

@eftycom

The world's domain name marketplace.

dave evanson

@SedoDaveEvanson

With 9-figures (hundreds of millions) in brokered deals Dave Evanson is the Senior Broker for https://t.co/H2nrA3g78u, focusing on premium and ultra premium domains.

GoDaddy Domain Academy

@DNAcademy

Learn to buy, sell, and invest in domain names like a pro. Domain Academy (formerly DNAcademy) is an educational offering of @GoDaddy. Join us https://t.co/w1MWftq9a9

Small Model Lab

@SmallModelLab

2 months ago

@0xSero @tplr_ai I was also surprised this wasn’t getting more attention

0

1

0

93

Small Model Lab

@SmallModelLab

2 months ago

@jpidala @Alibaba_Qwen @openclaw @OpenRouter Super impressed with Qwen lately, 7B runs surprisingly well on a base model Mac Mini

0

67

Small Model Lab

@SmallModelLab

2 months ago

@ns123abc Where do I get those sunglasses? 😎

0

59

Small Model Lab

@SmallModelLab

2 months ago

@0xSero Yessss, do it!!

0

21

Small Model Lab

@SmallModelLab

2 months ago

Super interesting data here, and amazing how long it takes to REAP these, but clearly worth it!

0xSero

@0xSero

2 months ago

Qwen3.5, MiniMax-M2.7 are incredible acts of kindness that I don't think will be with us from so much longer. Here's my update for you. > I have 20 GPUs at full utilisation right now. All these getting cooooompressed, no synthetic data All runs will be done in 9 days, if I don't get a catastrophic failure - REAP for: - GLM-5 - Qwen3-next-coder - Qwen3.5-122B - Qwen3.5-plus-397b - Browser-use - CUDA - Terminal-use - Coding - Math - Agentic trajectories - 30% my personal chat session history I am also removing refusals inspired by Prism. So no more I can't do this I can't do that blah blah Inference for local AI - Qwen3.5-262B-REAP - I've been using it exclusively in Parchi, perfect 100 tokens/s & 0 errors very good at browser use ----------------- Secret - Qwen3.5-27b - you will see when i'm done Targeting the following hardware levels: With full context 200-256k context in vllm, sglang, llama.cpp, exllamav3, and if people help MLX 16-32 GB - Qwen3.5-27b 32-48 GB - Qwen3-coder-next 48-128 GB - Qwen3.5-122B 128-256 GB - Qwen3.5-Plus-397B 196-512 GB - GLM-5.* I am training them on 22,000 samples at 16k context 352M of custom selected calibration datasets. My hope is to make the highest quality multimodal LLM compressions for this year. 20 GPUs running in parallel for the next 10 days - 8x H100s - Qwen - 4x B200s - GLM-5.* - 8x 3090s - Testing Once MiniMax-M2.7 is online 4 more GPUs will get to work.

0xSero's tweet photo. Qwen3.5, MiniMax-M2.7 are incredible acts of kindness that I don't think will be with us from so much longer.

Here's my update for you.

> I have 20 GPUs at full utilisation right now.

All these getting cooooompressed, no synthetic data

All runs will be done in 9 days, if I don't get a catastrophic failure - REAP for:
- GLM-5
- Qwen3-next-coder
- Qwen3.5-122B
- Qwen3.5-plus-397b

- Browser-use
- CUDA
- Terminal-use
- Coding
- Math
- Agentic trajectories
- 30% my personal chat session history

I am also removing refusals inspired by Prism. So no more I can't do this I can't do that blah blah

Inference for local AI

- Qwen3.5-262B-REAP - I've been using it exclusively in Parchi, perfect 100 tokens/s & 0 errors very good at browser use

-----------------

Secret

- Qwen3.5-27b - you will see when i'm done

Targeting the following hardware levels:

With full context 200-256k context in vllm, sglang, llama.cpp, exllamav3, and if people help MLX

16-32 GB - Qwen3.5-27b
32-48 GB - Qwen3-coder-next
48-128 GB - Qwen3.5-122B
128-256 GB - Qwen3.5-Plus-397B
196-512 GB - GLM-5.*

I am training them on 22,000 samples at 16k context

352M of custom selected calibration datasets.

My hope is to make the highest quality multimodal LLM compressions for this year.

20 GPUs running in parallel for the next 10 days

- 8x H100s - Qwen
- 4x B200s - GLM-5.*
- 8x 3090s - Testing

Once MiniMax-M2.7 is online 4 more GPUs will get to work.

35

660

18

204

25K

0

70

Small Model Lab

@SmallModelLab

2 months ago

@OpenAI Congrats, you can do a lot with $122B 🔥

0

72

Small Model Lab

@SmallModelLab

2 months ago

@morganlinton 🦾

0

31

Small Model Lab

@SmallModelLab

2 months ago

@0xSero Super interesting, thanks for sharing all the details, and I sure hope they’re with us for at least a bit longer!!

0

476

Small Model Lab

@SmallModelLab

2 months ago

@morganlinton Small models FTW 🕺

1

0

751

Small Model Lab

@SmallModelLab

3 months ago

Serious quant db heaven.

Quant Science

@quantscience_

3 months ago

Python is mind-boggling for finance. Case in point: There's a Finance database of 300,000 tickers. Available 100% for free:

8

703

88

993

38K

1

0

3

210

Small Model Lab

@SmallModelLab

3 months ago

@quantscience_ Yes it is, and this db is crazy!

0

1

0

135

Small Model Lab

@SmallModelLab

3 months ago

@KongBTC @morganlinton Yes it is 💪🦀

0

1

0

2

Small Model Lab

@SmallModelLab

3 months ago

@morganlinton You had me at Deeper Rust Phase 🦀

0

1

0

62

SmallModelLab retweeted

PyQuant News 🐍

@pyquantnews

3 months ago

Factor investing is what made me stand out at JPMorgan. But it took me years to master the information coefficient. In 1 minute, I'll teach you the 10 things you need to know (that took me 1 year to learn). Let's go: