Alan Silva @AlanAiEngineer - Twitter Profile

Alan Silva

@AlanAiEngineer

about 13 hours ago

@mervenoyann RIP

0

29

Alan Silva

@AlanAiEngineer

about 23 hours ago

@AlexFromAtomic @atomic_chat_hq It was meant to be 3 GB MORE vram... That should be pretty obvious if you think about it for more than half a second. I tried 5 bit and 6 bit on 12GB vram, both ran fine, but the quality was still far inferior than Qwen 9B.

0

25

Alan Silva

@AlanAiEngineer

2 days ago

Our boys at Unsloth don't waste time!

Unsloth AI

@UnslothAI

2 days ago

@googlegemma Thank you Google Deepmind for constantly releasing open models! 🌟 We made Dynamic GGUFs so you can run Gemma 4 12B more efficiently: https://t.co/8cL321pVDh

22

957

59

352

41K

0

1

0

19

Alan Silva

@AlanAiEngineer

2 days ago

@UnslothAI @googlegemma Ok, this was fast.

0

1

0

457

Alan Silva

@AlanAiEngineer

3 days ago

Now this is smart engineering. Exactly what real AI builders aim for. Meanwhile, all the grifters on this platform are still out here parroting "just use Claude; local AI isn't useful enough"

stevibe

@stevibe

3 days ago

Qwen3.6 35B A3B can't fill out a paper form on its own. But give it NVIDIA's LocateAnything-3B — the #1 trending model on HuggingFace — as its eyes, and the two small models get it done together. (The test: place each element at the right pixel position on a blank form image, not type into a field.) Setup: > Qwen is the brain (main model), LocateAnything is the eyes (helper model acting as a tool). > I gave Qwen a new tool: ask "where's the email field?" and LocateAnything returns the exact x, y, width, height. > The blue boxes on the screen are its detections. Look how tight they are — it nails every field. Result: > Qwen3.6 35B A3B + LocateAnything-3B: form completed, all info correct. > Name, DOB, ID, gender, marital status, nationality, email, phone, address, postal code: all landed in the right field areas. > Character-box alignment still a touch loose, but every value is where it belongs. > 9m10s, 224.5k input, 24.3k output, 21 turns. Why it matters: > Qwen alone can't finish this test. Bolt on a 3B model that does exactly one thing > locate > and suddenly it can. > A combination of small models can do the work of a single large one.

84

3K

275

3K

144K

0

1

0

15

Alan Silva

@AlanAiEngineer

3 days ago

@GaryMarcus The moat does in fact exists, it's called Hardware. Who owns the hardware will own the future.

0

1

778

Alan Silva

@AlanAiEngineer

4 days ago

@TheAhmadOsman

0

65

Alan Silva

@AlanAiEngineer

4 days ago

I’m sorry, but I don’t think you have a clear understanding of this. Running models locally isn’t about “be competitive”. Anyone who buys into self-hosting understands this basic tradeoff: you sacrifice some speed and peak accuracy in exchange for greater privacy and full control. I don’t know what kind of hardware you’re running locally (if any), but if you believe your home setup or personal server is “competitive” with state-of-the-art cloud models, you’re being delusional. That said, the DGX Spark is already achieving very respectable speeds despite its memory bandwidth limitations.

0

6

Alan Silva

@AlanAiEngineer

4 days ago

@LottoLabs Yeah… They really should drop a 40B, a 120B MoE, and a 30B dense version. But that would create competition, and I doubt they want that right now, especially with their IPO coming up.

0

1

0

1

38

Alan Silva

@AlanAiEngineer

4 days ago

@Alibaba_Qwen Don’t hesitate to launch new models. If you’re worried that a new release could cannibalize your inference revenue, consider a staggered rollout of open‑weight models. Begin with the Qwen 3.7 9B, then after a month or two follow up with the 35B MOE and the 27B dense.

0

4

0

2K

Alan Silva

@AlanAiEngineer

4 days ago

Yo, Dwarkesh Patel the best tech, science, and history podcaster is launching his own platform. I'm super excited to see what he has in store!

Dwarkesh Patel

@dwarkesh_sp

4 days ago

Watch the full interview here: https://t.co/k1gs2R4CYG

1

24

7

25

10K

0

12

Alan Silva

@AlanAiEngineer

5 days ago

@Teknium I just saw that, congratulations!

0

1

0

497

Alan Silva

@AlanAiEngineer

5 days ago

@sudoingX That guy is a grifter through and through.

0

13

Alan Silva

@AlanAiEngineer

5 days ago

Now we are cooking with fire.

Nous Research

@NousResearch

5 days ago

Hermes Agent is now natively supported on @Windows

252

5K

386

998

5M

0

1

0

7

Alan Silva

@AlanAiEngineer

5 days ago

@NousResearch @Windows Biggest release of the Year!

0

55

Alan Silva

@AlanAiEngineer

5 days ago

If you have ever used a computer to do actual meaningful work, you already know exactly what agents are for. No debate there. I personally use Hermes Agent for deep web research and run hundreds of long brainstorming sessions with it. Some sessions go nowhere, but others spit out solid ideas and technical docs that I've used to build self-hosted AI tools for my work and hobbies. I also lean on it to benchmark local models and troubleshoot Windows 11 and Linux issues. Honestly, it just fits right into any dev workflow.

0

2

0

3

3K

Alan Silva

@AlanAiEngineer

5 days ago

Don't give attention to this "theo" guy he is basically the face of AI grifting right now. Constantly pushing the idea that local and self-hosted models are useless while openly shilling for the big labs. I keep seeing him trying starting drama around open source projects, while never contributing to any.

0

44

Alan Silva

@AlanAiEngineer

5 days ago

OMG! PewDiePie just dropped his own AI agent and it is awesome! I love to see things like this. Free and open source AI/Tools are the future and although corporations may try to stop it, "they can not stop the future" https://t.co/oIatDfX4I1

0

1

0

47

Alan Silva

@AlanAiEngineer

6 days ago

@ShishirShelke1 Are they trying to get a shot at Apples M+ with this?

0

1K

Alan Silva

@AlanAiEngineer

Last Seen Users on Sotwe

Trends for you

Most Popular Users