🔰にっくす

Verified account

@_nix_

🔰ITインフラエンジニア/週末はテスラ乗り/日産サクラ/AWSからHPCまで幅広く/会社員しながら兼自社社長で複数業務委託/フルリモート20年以上/アニメや映画が好き/株も好き/B'zファンクラブ/妻と3人の息子+娘（トイプー）/最近はAIが仕事してます/ｵﾝ･ｻﾞ･ｴｯﾁﾞｼｭｯｼﾝ

長野 of 日本

Joined October 2009

2.2K Following

2.3K Followers

27.4K Posts

Pinned Tweet

🔰にっくす

about 4 years ago

ウチの社用犬🐶

18

230

6

2

0

🔰にっくす

about 7 hours ago

もう少しプロパーの人にしっかりして欲しいかな

0

1

0

0

60

🔰にっくす

about 15 hours ago

macOSくん、急にどうした？

___nix___'s tweet photo. macOSくん、急にどうした？ https://t.co/7QI8oaLwm0

0

2

0

0

142

🔰にっくす

about 16 hours ago

これはww

金のニワトリ

about 18 hours ago

gemma-4-12b-it（UD-Q4_K_XL）をお試し明らかに出力がおかしいのでベンチマークどころではない。。。お試しされた方、他の量子化モデルでも似たような感じでしょうか？

24

286

31

89

184K

0

2

1

0

2K

Who to follow

上地申吾@AWS 設計・構築、Datadog 導入・運用支援、SRE 導入支援

エーピーコミュニケーションズ / REIONE 元メインフレーム運用監視 JAWS-UG 沖縄支部 / JDDUG運営 2023-2025 Japan AWS All Certifications Engineers 2025-2026 AWS Community Builders 情報セキュリティスペシャリスト

こばやし 🇺🇸駐在GA州🍑

Director of IT / ITインフラとサイバーセキュリティ担当 / 2度目の🇺🇸 (前回はNJ) / 妻とティーンエイジャーの娘2人と生活 / クラフトビール🍺巡り / 会社では主に日米の架け橋(板挟み)やってます👊

@koikoi179305278

済み:IPA(IP・FE・AP・SC)／技術士1次／CCNA／LPIC-3(303)／Python(CE・CDA)／1陸技／1通施／電通主(伝送・線路)／工担(総)／1種 2種電工／甲種危険／簿記3級••••••••IPA(NW・DB)、簿記2級 #### 無言フォロー失礼します ####

🔰にっくす

about 24 hours ago

何だか期待できそうなサイズ GGUFってところも嬉しい

1 day ago

Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs. Google's new model, Gemma 4 12B Unified supports image, audio and 256K context. You can run and train the model via Unsloth Studio. GGUF: https://t.co/8cL321pVDh Guide: https://t.co/odRo9WjRpA

UnslothAI's tweet photo. Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs.

Google's new model, Gemma 4 12B Unified supports image, audio and 256K context.

You can run and train the model via Unsloth Studio.

GGUF: https://t.co/8cL321pVDh
Guide: https://t.co/odRo9WjRpA https://t.co/Ax09ZTXFF3

90

3K

332

1K

265K

0

2

0

0

160

___nix___ retweeted

1 day ago

OpenAI's GPT-OSS-120B runs on a single RTX 5090. it's a 59GB model in native MXFP4. it doesn't fit in 32GB of VRAM. the move is MoE offload: keep attention on the GPU, spill the expert weights to system RAM (llama.cpp --n-cpu-moe). this way, only 5.1B of 117B params fire per token, so the CPU side stays cheap. with reasoning on, measured on my box, temperature 0, ~100 items per task (MMLU 114): - MMLU 89.5 - GSM8K 97.0 - HumanEval 98.0 pass@1 - ARC-Challenge 95.0 that's a good frontier-grade scores, on one consumer GPU. ~~~ it is quite slow tho: 47 tok/s generation. that's because the experts live in RAM, so token speed waits on the CPU, not the 5090. prefill is fine with 473 tok/s at 512 ctx. it is generation that pays the offload tax. the model is usable, not fast. but you get a real frontier model you fully own, on hardware you can buy, for the price of patience.

witcheer's tweet photo. OpenAI's GPT-OSS-120B runs on a single RTX 5090.

it's a 59GB model in native MXFP4. it doesn't fit in 32GB of VRAM.
the move is MoE offload: keep attention on the GPU, spill the expert weights to system RAM (llama.cpp --n-cpu-moe).

this way, only 5.1B of 117B params fire per token, so the CPU side stays cheap.

with reasoning on, measured on my box, temperature 0, ~100 items per task (MMLU 114):

- MMLU 89.5
- GSM8K 97.0
- HumanEval 98.0 pass@1
- ARC-Challenge 95.0

that's a good frontier-grade scores, on one consumer GPU.

~~~
it is quite slow tho: 47 tok/s generation.

that's because the experts live in RAM, so token speed waits on the CPU, not the 5090.

prefill is fine with 473 tok/s at 512 ctx. it is generation that pays the offload tax.

the model is usable, not fast. but you get a real frontier model you fully own, on hardware you can buy, for the price of patience.

12

102

13

86

12K

🔰にっくす

1 day ago

これは良さそう

2 days ago

ブログ書きました：［速報］Windows上でLinuxコンテナの作成や実行ができる「WSL containers」発表 https://t.co/4dywY8LEyi

3

803

350

319

119K

0

0

1

0

3K

🔰にっくす

1 day ago

@Sagasa8045 重要なのは道具ではありません！何をするのかです！

1

2

0

0

7

🔰にっくす

1 day ago

GitHub Runner の IP が軒並み悪さしていていて困る GuardDuty にも検出されるし

0

1

0

0

138

🔰にっくす

1 day ago

Terraform あるある ❌https://t.co/6Avpt9WlCC ❌https://t.co/k17yKYl1Z4 ⭕️https://t.co/vHC6wyoOJD_region.current.region

0

2

0

0

105

___nix___ retweeted

村本章憲 Stamp CEO

1 day ago

https://t.co/Bo6DKMCdRx ここが変わるだけで日本大きく変わりそうなのにね。

0

1

1

0

198

🔰にっくす

1 day ago

持ち株は全てマイナスなのに、日経平均がプラスである理由を誰か説明してくださいおはようございます🌞

0

4

0

0

310

🔰にっくす

2 days ago

確かに Composer は悪く無い印象 Kimi ベースでしたよね

2 days ago

周りの人達が「Cursor の Composer 2.5 くらいでよい」みたいな話をし始めている。早さ、安さ、賢さのバランスがいいってことなんだろうな。自分も使っててそう思う。

0

61

8

14

5K

0

1

0

0

170

🔰にっくす

2 days ago

ChatGPT Atlas って結局何してくれるんだっけ？

0

2

0

0

157

🔰にっくす

2 days ago

MiniMax って、結局のところ大きいの？小さいの？

MiniMax (official) @MiniMax_AI

2 days ago

Watch M3 reach the frontier 🚀

18

426

21

45

43K

0

3

0

0

165

🔰にっくす

2 days ago

AWS で H100 を使った LLM の性能試験をやりたいのだけど中々 SI の容量が割り当てられないのよね 5分毎に様々なリージョンでループさせてる

0

1

0

0

98

🔰にっくす

2 days ago

パ〇ナさんの名前が無いのはなーぜなーぜ？

朝日新聞デジタル編成席 @asahicom

3 days ago

【速報】人材派遣大手5社、全国の派遣料金でカルテルの疑い。公取委立ち入り https://t.co/dv3jwrUVfn

asahicom's tweet photo. 【速報】人材派遣大手5社、全国の派遣料金でカルテルの疑い。公取委立ち入り
https://t.co/dv3jwrUVfn https://t.co/SBqisJAUHh

136

11K

5K

2K

4M

0

2

0

0

387

🔰にっくす

2 days ago

王翦将軍はどこ？？

0

1

0

0

156

🔰にっくす

2 days ago

映画『キングダム魂の決戦』6月2日(火)ワールドプレミア｜7月17日(金)公開豪華キャスト♪

1

1

0

0

291

Last Seen Users on Sotwe

Trends for you

Most Popular Users