Alessandro @alew3 - Twitter Profile

alew3 retweeted

about 2 months ago

⚡ Meet Qwen3.6-35B-A3B：Now Open-Source！🚀🚀 A sparse MoE model, 35B total params, 3B active. Apache 2.0 license. 🔥 Agentic coding on par with models 10x its active size 📷 Strong multimodal perception and reasoning ability 🧠 Multimodal thinking + non-thinking modes Efficient. Powerful. Versatile. Try it now👇 Blog：https://t.co/EXx5y466su Qwen Studio：https://t.co/bg4tAU1p74 HuggingFace：https://t.co/w4pDX14DZS ModelScope：https://t.co/SuRyLzdQiO API（‘Qwen3.6-Flash’ on Model Studio）：Coming soon～ Stay tuned

Alibaba_Qwen's tweet photo. ⚡ Meet Qwen3.6-35B-A3B：Now Open-Source！🚀🚀

A sparse MoE model, 35B total params, 3B active. Apache 2.0 license.

🔥 Agentic coding on par with models 10x its active size
📷 Strong multimodal perception and reasoning ability
🧠 Multimodal thinking + non-thinking modes

Efficient. Powerful. Versatile. Try it now👇

Blog：https://t.co/EXx5y466su
Qwen Studio：https://t.co/bg4tAU1p74
HuggingFace：https://t.co/w4pDX14DZS
ModelScope：https://t.co/SuRyLzdQiO
API（‘Qwen3.6-Flash’ on Model Studio）：Coming soon～ Stay tuned

445

11K

2K

5K

3M

alew3 retweeted

clandestine.eth 🦇🔊

@0xClandestine

about 2 months ago

Heterogeneous acceleration on Apple Silicon achieved. ANE + GPU running in parallel. Mirror SD with DFlash, ported to MLX — targeting ANE + GPU simultaneously. The M-series was designed for this. We just hadn't unlocked it yet.

0xClandestine's tweet photo. Heterogeneous acceleration on Apple Silicon achieved.

ANE + GPU running in parallel.

Mirror SD with DFlash, ported to MLX — targeting ANE + GPU simultaneously.

The M-series was designed for this. We just hadn't unlocked it yet. https://t.co/raSH0CMN4V

30

309

22

207

15K

Alessandro @alew3

3 months ago

@ivanzugec @ernielm @felixrieseberg It appeared a few hours later. Thanks.

0

16

Alessandro @alew3

3 months ago

@ernielm @felixrieseberg @ernielm it now appears on my mobile app as "Dispatch" under the burger menu

0

43

Who to follow

Marcio Cassol

@mbfcassol

Alessandro @alew3

3 months ago

@Tibbzzee @ChanaMessinger @felixrieseberg don't have it, just updated it.

0

1

0

40

Alessandro @alew3

3 months ago

@felixrieseberg I have Max 20 , how do I get this working? Already update the desktop and mobile.

2

0

1K

alew3 retweeted

Andrej Karpathy

@karpathy

3 months ago

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. https://t.co/YCvOwwjOzF Part code, part sci-fi, and a pinch of psychosis :)

karpathy's tweet photo. I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then:

- the human iterates on the prompt (.md)
- the AI agent iterates on the training code (.py)

The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc.

https://t.co/YCvOwwjOzF
Part code, part sci-fi, and a pinch of psychosis :)

1K

28K

4K

39K

11M

Alessandro @alew3

5 months ago

@trq212 @trq212 just watched it, can you share the slides?

0

38

Alessandro @alew3

6 months ago

@awnihannun @angeloskath thanks!

0

26

alew3 retweeted

Rachel Thomas

@math_rachel

7 months ago

"People who go all in on AI agents now are guaranteeing their obsolescence. If you outsource all your thinking to computers, you stop upskilling, learning, and becoming more competent. AI is great at helping you learn." @jeremyphoward @NVIDIAAI https://t.co/s2ZIeHK3sq 2/

3

66

11

40

19K

Alessandro @alew3

8 months ago

@awnihannun Looking good! BTW, any reason MLX can't leverage the Neural Engine?

0

783

Alessandro @alew3

9 months ago

@awnihannun wow! whats the syntax to get this going?

0

206

alew3 retweeted

OpenAI

@OpenAI

11 months ago

ChatGPT agent is ready to introduce itself. https://t.co/EOvjGJsf0R

378

4K

520

747

1M

Alessandro @alew3

12 months ago

@seeedstudio @huggingface any promotional coupon for this amazing kit?

1

0

15

alew3 retweeted

Charlie Marsh

@charliermarsh

about 1 year ago

You can set `UV_TORCH_BACKEND=auto` and uv will automatically install the right CUDA-enabled PyTorch for your machine, zero configuration

charliermarsh's tweet photo. You can set `UV_TORCH_BACKEND=auto` and uv will automatically install the right CUDA-enabled PyTorch for your machine, zero configuration https://t.co/fOb8Pn3qYk

71

2K

221

1K

191K

Alessandro @alew3

about 1 year ago

@Prince_Canuma @arcee_ai good luck at Apple 😜

1

0

29

Alessandro @alew3

about 1 year ago

@Prince_Canuma holy cow! that was fast!

1

0

137

Alessandro @alew3

about 1 year ago

@scottsLockedIn @maximelabonne this is to prevent other big tech from using it.

1

2

0

215

Alessandro @alew3

about 1 year ago

Meta has just released their Llama 4 models. The "Llama 4 Scout" variant features an impressive 10 million token context window. https://t.co/JIN22jqpmG #llama4 #llama

alew3's tweet photo. Meta has just released their Llama 4 models. The "Llama 4 Scout" variant features an impressive 10 million token context window. https://t.co/JIN22jqpmG #llama4 #llama https://t.co/nE0oHcB7uL

1

5

0

765

Alessandro

@alew3

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users