Fahim Faisal @ProgrammerFahim - Twitter Profile

1 day ago

This sadly isn't going to work. The best response is for all AI researchers to stop using Anthropic models. The lack of public feedback alone would cause them to fall behind within months.

9

364

18

21

19K

ProgrammerFahim retweeted

MiniMax (official) @MiniMax_AI

11 days ago

Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: https://t.co/fHRdSV7BwZ Token Plan: https://t.co/BDCycxepZw 🚀New! MiniMax Code: https://t.co/GvB4YiB6Ul Weights & Tech Report in ~10 Days

MiniMax_AI's tweet photo. Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities

- Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas
- MiniMax Sparse Attention scales context to 1M
- Natively Multimodal from Step Zero

API: https://t.co/fHRdSV7BwZ
Token Plan: https://t.co/BDCycxepZw
🚀New! MiniMax Code: https://t.co/GvB4YiB6Ul

Weights & Tech Report in ~10 Days

550

10K

1K

3K

5M

ProgrammerFahim retweeted

Liquid AI

@liquidai

15 days ago

Today, we're releasing LFM2.5-8B-A1B, a device-optimized model designed to power real-life applications on phones, laptops, PCs, robots, and fast & lightweight server-side use-cases. > 8B MoE, 1.5B active > Expanded 128K context > LFM2.5 flagship hybrid MoE architecture > Trained on 38T tokens + large-scale RL > fast, reliable tool calling, punching above its weight, comparable to models with up to 4x its size > customizable on a single GPU for any specialized task > LFM2 open-weight license 🧵

liquidai's tweet photo. Today, we're releasing LFM2.5-8B-A1B, a device-optimized model designed to power real-life applications on phones, laptops, PCs, robots, and fast & lightweight server-side use-cases.

> 8B MoE, 1.5B active
> Expanded 128K context
> LFM2.5 flagship hybrid MoE architecture
> Trained on 38T tokens + large-scale RL
> fast, reliable tool calling, punching above its weight, comparable to models with up to 4x its size
> customizable on a single GPU for any specialized task
> LFM2 open-weight license

🧵

138

4K

507

3K

1M

ProgrammerFahim retweeted

Telegram Messenger

@telegram

17 days ago

if I had to look for a job on @LinkedIn I would fucking kill myself

1K

58K

6K

3K

3M

Who to follow

ProgrammerFahim retweeted

MiniMax (official) @MiniMax_AI

17 days ago

#MSA #OpenSource #M3 🫣😎

121

2K

140

300

381K

ProgrammerFahim retweeted

Cernovich

@Cernovich

20 days ago

People hate their own countries so much that they don't want to go back and consider it a human rights scandal for the U.S. to send them home to renew your visas need to get some self-awareness. And gratitude.

60

5K

628

98

61K

ProgrammerFahim retweeted

Lauren Chen

@TheLaurenChen

20 days ago

My hot take is that it's OK for Americans to not want to compete against the entire world for a job in their hometown.

737

40K

3K

1K

3M

ProgrammerFahim retweeted

bubble boi

@bubbleboi

21 days ago

People really need to stop hating on China man. Every time I’m in a pickle it’s China saving me. Gas prices too high? I’m riding in the BYD. DRAM costs an arm and a leg? CXMT floods the market. Anthropic and OpenAI fucking me on token costs? Hello my friend Mr. Qwen. The CCP has done more for my cost of living than my own government.

164

10K

1K

835

251K

ProgrammerFahim retweeted

DeepSeek

@deepseek_ai

21 days ago

We are making our discount permanent! 🎉 Enjoy building with DeepSeek-V4-Pro and bring your innovative ideas to life! 🚀

1K

24K

3K

6K

7M

ProgrammerFahim retweeted

Cerebras

@cerebras

24 days ago

Cerebras is now running Kimi K2.6 – a trillion parameter model – in enterprise trials. At ~1,000 tokens/s, this is the fastest frontier model performance ever measured by Artificial Analysis @ArtificialAnlys.

cerebras's tweet photo. Cerebras is now running Kimi K2.6 – a trillion parameter model – in enterprise trials.

At ~1,000 tokens/s, this is the fastest frontier model performance ever measured by Artificial Analysis @ArtificialAnlys. https://t.co/2W5CiVYk6R

173

4K

331

980

858K

ProgrammerFahim retweeted

ThePrimeagen

@ThePrimeagen

28 days ago

Hello daily unicorn

62

1K

22

14

41K

ProgrammerFahim retweeted

Insider Wire @InsiderWire

about 1 month ago

#BREAKING: ICE exposes alleged Optional Practical Training fraud involving 10,000 foreign students linked to fake employers, empty offices, and residential “worksites.”

13

856

99

29

18K

ProgrammerFahim retweeted

mikayla

@honeyNonABG

about 1 month ago

I’m pushing 30. Was at the beach and some 18 year old lied and said he was 21. I told him I’m his mother. He wouldn’t go away. I told him I’m basically 30. He said age didn’t matter. He came up to me 3-4 times. And as courageous as he was, to me he was a child. A young boy just starting out. One that was offering himself up to be taken advantage of. Makes me think of all the 60+ year old men “dating” 20 year olds. Makes me think of that 65 year old who was talking to 21 year old me. How do these men do that? How do they look at a literal child and go “wow she’s mature” when i was looking at this 18 year old and thinking of how much younger he is than my younger brother. “She’s a fully grown adult.” And you’re excusing yourself to feel less guilty.

1K

32K

2K

3K

5M

ProgrammerFahim retweeted

Jonas Fröller

@jonasfroeller

about 1 month ago

What if the EU built GitHub?

309

12K

861

2K

7M

ProgrammerFahim retweeted

Kangmin Lee | 이강민

@kangminlee

about 1 month ago

If you genuinely believed a hot white woman high up in JP Morgan was making an Indian man her sex slave, you might be retarded

298

20K

628

448

423K

ProgrammerFahim retweeted

Bjarne Øverli

@iamdothash

about 2 months ago

Randomly found this interesting color combination.

37

1K

47

562

45K

ProgrammerFahim retweeted

Sebastian Pokutta @spokutta

about 2 months ago

Qwen 3.6 27B + Pi on a MacBook Pro, fully local: a beast. 27B dense model, flagship-level agentic coding, running entirely on hardware in your hands. Speed is impressive, Utility is extremely high. It punches orders of magnitude above its weight. Local AI is getting real.

43

808

35

501

60K

ProgrammerFahim retweeted

LaurieWired

@lauriewired

about 2 months ago

Time Dilation kind of makes the whole “datacenters in space” idea more fun. Technically…something like a GPS Block III CPU runs an extra ~7,000 clock cycles per day compared to the same machine on earth. Extend this to the extreme, and you get the whole subfield of CS+physics called relativistic hypercompuation. There’s some (fun?) papers that allow you to solve the halting problem by placing yourself dangerously close to a black hole…while your computer safely computes for ~infinite-ish amounts of time. One of the better papers on this field appears to be: "Relativistic computers and the Turing barrier" (Németi & Dávid 2006) (sadly, the maximum speedup just escaping earths gravity well is something like 1 x 10 ^ (-10), so yeah the blackhole thing is kinda necessary)

lauriewired's tweet photo. Time Dilation kind of makes the whole “datacenters in space” idea more fun.

Technically…something like a GPS Block III CPU runs an extra ~7,000 clock cycles per day compared to the same machine on earth.

Extend this to the extreme, and you get the whole subfield of CS+physics called relativistic hypercompuation.

There’s some (fun?) papers that allow you to solve the halting problem by placing yourself dangerously close to a black hole…while your computer safely computes for ~infinite-ish amounts of time.

One of the better papers on this field appears to be:

"Relativistic computers and the Turing barrier" (Németi & Dávid 2006)

(sadly, the maximum speedup just escaping earths gravity well is something like 1 x 10 ^ (-10), so yeah the blackhole thing is kinda necessary)

212

8K

660

3K

389K

ProgrammerFahim retweeted

yagi 🔥

@yaginosenshii

2 months ago

The ozempic wave was a CIA operation to get Americans back down to enlistment weight 💀💀💀

413

130K

10K

4K

3M

ProgrammerFahim retweeted

LaurieWired

@lauriewired

2 months ago

Modern DRAM is based on a brilliant design from IBM. But, we're still paying for a latency penalty that's existed since the 60s! In this video, I'm introducing my research project (Tailslayer) that immensely reduces p99.99 latency on traditional RAM! By implementing a hedged read strategy taking advantage of (undocumented!) channel scrambling offsets, I've gotten as much as 15x reductions in tail latency. The technique works across Intel, AMD, Graviton, DDR4, DDR5, x86, ARM, you name it. Check out the C++ lib I wrote, watch the video, and try it yourself!

210

11K

870

5K

861K

Fahim Faisal

@ProgrammerFahim

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users