Latent Node @latent_node - Twitter Profile

Pinned Tweet

Latent Node

@latent_node

about 2 months ago

https://t.co/czP4jXQ9G3

4

25

3

26

22K

Latent Node

@latent_node

about 2 hours ago

@suchenzang You need to treat writing as code and do it in claude code, it actually works well, You can plan ahead and create plotline, characters, tone etc. and you can ensure the agent adheres to them.

0

1

0

265

Latent Node

@latent_node

about 2 hours ago

@eastdakota This is more common than you think, had someone offer me to fire my co founder and only then they would invest.

0

7

0

7K

Latent Node

@latent_node

about 3 hours ago

@ChanduThota if you use one of the optiq quants it is quite fast and fits in 16 gb VRAM amazing model -https://t.co/MaGZBxzgO2

Latent Node

@latent_node

about 3 hours ago

Gemma-4 12B, quantized with mlx-optiq, running on a Mac: 2.5x faster than bf16 (28.8 vs 11.6 tok/s on M3 Max), 2.7x smaller (8.9 GB vs 24 GB), fits in 16 GB of RAM. The sensitivity-aware 4-bit also beats naive 4-bit by +6.4 Capability Score (+13 long-context, +11.6 code, 93% GSM8K). No accuracy tax. pip install mlx-optiq.

1

3

0

268

0

1

0

8

Latent Node

@latent_node

about 3 hours ago

@nickmhc @rasbt if you use one of the optiq quants it is quite fast and fits in 16 gb VRAM amazing model - https://t.co/MaGZBxzgO2

Latent Node

@latent_node

about 3 hours ago

Gemma-4 12B, quantized with mlx-optiq, running on a Mac: 2.5x faster than bf16 (28.8 vs 11.6 tok/s on M3 Max), 2.7x smaller (8.9 GB vs 24 GB), fits in 16 GB of RAM. The sensitivity-aware 4-bit also beats naive 4-bit by +6.4 Capability Score (+13 long-context, +11.6 code, 93% GSM8K). No accuracy tax. pip install mlx-optiq.

1

3

0

268

0

16

Latent Node

@latent_node

about 3 hours ago

@IanBallantyne if you use one of the optiq quants it is quite fast and fits in 16 gb VRAM amazing model - https://t.co/ZJPurl2hHd

Latent Node

@latent_node

about 3 hours ago

@googlegemma if you use one of the optiq quants it is quite fast and fits in 16 gb VRAM amazing model - https://t.co/MaGZBxzgO2

0

79

0

14

Latent Node

@latent_node

about 3 hours ago

@0xOzp @LottoLabs How did you get the MTP layer did oyu train it?

1

0

10

Latent Node

@latent_node

about 3 hours ago

@googlegemma if you use one of the optiq quants it is quite fast and fits in 16 gb VRAM amazing model - https://t.co/MaGZBxzgO2

Latent Node

@latent_node

about 3 hours ago

Gemma-4 12B, quantized with mlx-optiq, running on a Mac: 2.5x faster than bf16 (28.8 vs 11.6 tok/s on M3 Max), 2.7x smaller (8.9 GB vs 24 GB), fits in 16 GB of RAM. The sensitivity-aware 4-bit also beats naive 4-bit by +6.4 Capability Score (+13 long-context, +11.6 code, 93% GSM8K). No accuracy tax. pip install mlx-optiq.

1

3

0

268

0

79

Latent Node

@latent_node

about 3 hours ago

@LyalinDotCom if you use one of the optiq quants it is quite fast and fits in 16 gb VRAM amazing model - https://t.co/MaGZBxzgO2

Latent Node

@latent_node

about 3 hours ago

Gemma-4 12B, quantized with mlx-optiq, running on a Mac: 2.5x faster than bf16 (28.8 vs 11.6 tok/s on M3 Max), 2.7x smaller (8.9 GB vs 24 GB), fits in 16 GB of RAM. The sensitivity-aware 4-bit also beats naive 4-bit by +6.4 Capability Score (+13 long-context, +11.6 code, 93% GSM8K). No accuracy tax. pip install mlx-optiq.

1

3

0

268

0

1

0

16

Latent Node

@latent_node

about 3 hours ago

@KazimAIZJU if you use one of the optiq quants it is quite fast and fits in 16 gb VRAM amazing model - https://t.co/MaGZBxzgO2

Latent Node

@latent_node

about 3 hours ago

Gemma-4 12B, quantized with mlx-optiq, running on a Mac: 2.5x faster than bf16 (28.8 vs 11.6 tok/s on M3 Max), 2.7x smaller (8.9 GB vs 24 GB), fits in 16 GB of RAM. The sensitivity-aware 4-bit also beats naive 4-bit by +6.4 Capability Score (+13 long-context, +11.6 code, 93% GSM8K). No accuracy tax. pip install mlx-optiq.

1

3

0

268

0

1

Latent Node

@latent_node

about 3 hours ago

@JustinLin610 if you use one of the optiq quants it is quite fast and fits in 16 gb VRAM amazing model - https://t.co/MaGZBxzgO2

Latent Node

@latent_node

about 3 hours ago

Gemma-4 12B, quantized with mlx-optiq, running on a Mac: 2.5x faster than bf16 (28.8 vs 11.6 tok/s on M3 Max), 2.7x smaller (8.9 GB vs 24 GB), fits in 16 GB of RAM. The sensitivity-aware 4-bit also beats naive 4-bit by +6.4 Capability Score (+13 long-context, +11.6 code, 93% GSM8K). No accuracy tax. pip install mlx-optiq.

1

3

0

268

0

82

Latent Node

@latent_node

about 3 hours ago

@PaulGugAI if you use one of the optiq quants it is quite fast as well - https://t.co/MaGZBxzgO2

Latent Node

@latent_node

about 3 hours ago

Gemma-4 12B, quantized with mlx-optiq, running on a Mac: 2.5x faster than bf16 (28.8 vs 11.6 tok/s on M3 Max), 2.7x smaller (8.9 GB vs 24 GB), fits in 16 GB of RAM. The sensitivity-aware 4-bit also beats naive 4-bit by +6.4 Capability Score (+13 long-context, +11.6 code, 93% GSM8K). No accuracy tax. pip install mlx-optiq.

1

3

0

268

0

4

Latent Node

@latent_node

about 3 hours ago

@matthewridenour @triswarkentin @o_lacombe @osanseviero @DynamicWebPaige That's wild it is also one of the most downlaoded optiq quants for mlx - https://t.co/MaGZBxzgO2

Latent Node

@latent_node

about 3 hours ago

Gemma-4 12B, quantized with mlx-optiq, running on a Mac: 2.5x faster than bf16 (28.8 vs 11.6 tok/s on M3 Max), 2.7x smaller (8.9 GB vs 24 GB), fits in 16 GB of RAM. The sensitivity-aware 4-bit also beats naive 4-bit by +6.4 Capability Score (+13 long-context, +11.6 code, 93% GSM8K). No accuracy tax. pip install mlx-optiq.

1

3

0

268

0

29

Latent Node

@latent_node

about 3 hours ago

@JulianPscheid @HedyAI_ The model is really good for local agentic workflows and lightenting fast - https://t.co/MaGZBxzgO2

Latent Node

@latent_node

about 3 hours ago

Gemma-4 12B, quantized with mlx-optiq, running on a Mac: 2.5x faster than bf16 (28.8 vs 11.6 tok/s on M3 Max), 2.7x smaller (8.9 GB vs 24 GB), fits in 16 GB of RAM. The sensitivity-aware 4-bit also beats naive 4-bit by +6.4 Capability Score (+13 long-context, +11.6 code, 93% GSM8K). No accuracy tax. pip install mlx-optiq.

1

3

0

268

0

7

Latent Node

@latent_node

about 3 hours ago

see - https://t.co/FDwNzWQZAF

0

1

0

12

Latent Node

@latent_node

about 3 hours ago

Gemma-4 12B, quantized with mlx-optiq, running on a Mac: 2.5x faster than bf16 (28.8 vs 11.6 tok/s on M3 Max), 2.7x smaller (8.9 GB vs 24 GB), fits in 16 GB of RAM. The sensitivity-aware 4-bit also beats naive 4-bit by +6.4 Capability Score (+13 long-context, +11.6 code, 93% GSM8K). No accuracy tax. pip install mlx-optiq.

1

3

0

268

Latent Node

@latent_node

about 3 hours ago

@PaulGugAI @Teknium @NeoAIForecast The optiq quants tend to be more accurate and faster compared to the same uniform 4 bit ones on mlx. With MTP it will be quite fast - https://t.co/UH381cyvJt

1

0

12