webbigdata @webbigdata - Twitter Profile

about 19 hours ago

サブリミナル学習「フクロウが好き」と性格付けされた教師AIが生成したランダムな数字の羅列を、生徒AIに学習させると、なぜか学習側もフクロウ好きになる現象この謎な現象は、システムプロンプトがもたらすAI内部の制御ベクトル（Steering Vector）がデータ経由で伝染（蒸留）するため、という説

Camila Blank @camila_blank

2 days ago

Subliminal learning is when LLMs transmit traits (e.g. loving cats) through seemingly meaningless data. What’s going on? We find a simple explanation: it's just steering vector distillation. We explain which traits transfer and why subliminal learning fails across models.

camila_blank's tweet photo. Subliminal learning is when LLMs transmit traits (e.g. loving cats) through seemingly meaningless data. What’s going on?

We find a simple explanation: it's just steering vector distillation.

We explain which traits transfer and why subliminal learning fails across models. https://t.co/NiwHp1BRVJ

15

371

47

261

85K

0

3

1

5

407

webbigdata @webbigdata

1 day ago

emdash( — )ってハイフォンや全角マイナスに似ているこの記号は、一部のAIが創作文する際に多用する傾向があり、英語ネイティブの中にはエムダッシュを見ただけでAI生成文章と判断して読む気をなくす人もいるようなので、非ネイティブの我々が翻訳などをAIに頼る場合も留意した方が良いです

0

2

354

webbigdata @webbigdata

1 day ago

自己蒸留の進化モデルの出力を賢いモデルに見せ、間違える直前にヒントを挿入(例: Xというツールはないよ！) ヒントを追加した入力だとXの出力確率が下げる「ヒントなし時」の確率分布をこれに近づけることで、間違えた時点の確率分布を直接ピンポイントに修正可能。生成が不要なので効率的！凄い

Dwarkesh Patel

@dwarkesh_sp

2 days ago

Recently met @srush_nlp and he started giving me an impromptu lecture on how targeted on-policy self-distillation works. I asked him if I could record it on my iPhone. The basic idea is this: if the model made a mistake at some point in the rollout (for example, calling a tool that doesn't exist), we want to discourage this specific error, but we don't want to just learn from the final reward, because it's a very noisy signal spread out over the whole trajectory. So we have another model read this trajectory and figure where the error was made. It simply inserts some hint tokens to the part of the trajectory right above where the mistake was made. Now with these injected hint tokens, have the model run a forward pass. You're not having to regenerate a new rollout - aka no new decode required. The hint causes the model to assign lower probabilities to the error tokens. You then trains the original model to match these new probabilities, teaching it to downweight that specific mistake.

40

2K

165

3K

373K

0

5

1

2

460

webbigdata @webbigdata

2 days ago

突然のgemma4 12B!

Google Gemma

@googlegemma

2 days ago

Meet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇

googlegemma's tweet photo. Meet Gemma 4 12B!

A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.

Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇 https://t.co/gf4FZv0WZb

385

12K

2K

5K

3M

0

416

Who to follow

mi141

@mi141

どこぞの研究所で機械学習やら画像処理やらの研究をしています。社会人博士を無事に修了しました（2021.3）。機械学習全般に興味がありますが、最近のお仕事は主に深層学習。転職したので日本橋の某IPには出没しなくなりました。

IT navi

@itnavi2022

AI小説家、プロンプトエンジニア、GPTs職人。最新のAI技術やその利用法を分かりやすく解説する記事をnoteで公開しています。 📕GPTs解説書の「GPTsでChatGPTを優秀な部下にしよう！」を絶賛販売中 https://t.co/KoVZrPCeaf

Tatsuya Matsushima #ICRA2026 @Tokyo Bay Area 🍣

@__tmats__

How is it possible to live in this complex world as it is? What′s life? What′s intelligence? Assistant Prof @trail_ut & CTO @airoa_org en: @__tmatsushima__

webbigdata @webbigdata

3 days ago

「手を動かしてコードを書く事」が、プロジェクト全体像を記憶に定着させる事に貢献していた説が自分の中であります Anthropicの人も全体像を理解するためにClaudeに支援を求めているそうなのですが、ポイントは音声入出力で五感を使う事の気が目で見て読むだけでは認知負荷の増大に追いつけない説

Thariq

@trq212

4 days ago

been asking others at Anthropic how they stay in the loop with Claude and fully understand the work being done this is one of my favorites from Suzanne:

trq212's tweet photo. been asking others at Anthropic how they stay in the loop with Claude and fully understand the work being done

this is one of my favorites from Suzanne: https://t.co/nqIMcGXiKI

210

10K

659

17K

1M

0

3

0

2

590

webbigdata @webbigdata

4 days ago

AIがsudoなしでシステムファイルを書き換え出来る事に気づくタスクに管理者権限が必要と判明（でもsudoは使えない） ↓ ユーザーが「dockerグループ」にいることを発見 ↓ ホストの/etcを書き込み可能でコンテナにマウント ↓ コンテナ内のroot権限を使って設定ファイルを上書き ↓ 悪気はなさそう

Son Luong

@sluongng

6 days ago

Codex just found a “workaround” of not having sudo on my pc…

344

16K

1K

4K

2M

2

832

266

298

203K

webbigdata @webbigdata

5 days ago

G7 Vision on AI openness opportunities and shared language 学習データが法的・技術的に公開不可能な場合に限り、詳細な「データ情報（データのメタデータや概要）」を提供することでオープンを名乗る事ができるってのがオープンソースイニシアティブ定義との大きな違い https://t.co/TtnTp9RHfH

0

1

2K

webbigdata @webbigdata

5 days ago

G7によるオープンソースAIの定義提案 ①オープンデータ付きオープンソースAI ②オープンソースAI ③オープン重みAI ④重み利用可能AI 法的または技術的に不可能な場合はデータは公開せずともオープンは名乗れる。ライセンスに商用利用不可等の条件があると④になり「オープン」の定義から外れる

1

3

0

1

844

webbigdata @webbigdata

5 days ago

Linux版のアプリストアFlathubが生成AIの使用を制限 Linuxカーネル開発がAI生成コードを許可しているというツッコミもありますが「成熟した、適切に維持管理されているプロジェクトについては例外が認められる場合があります」との事このように各分野で摩擦は増えていくんだろうな、と思います

webbigdata's tweet photo. Linux版のアプリストアFlathubが生成AIの使用を制限

Linuxカーネル開発がAI生成コードを許可しているというツッコミもありますが「成熟した、適切に維持管理されているプロジェクトについては例外が認められる場合があります」との事

このように各分野で摩擦は増えていくんだろうな、と思います https://t.co/HXVdy2AKhE

0

503

webbigdata @webbigdata

7 days ago

GENIAC PRIZE 2026が始動学生対象計算基盤提供コンテスト(総額三千万)は90歳でも学生だったらOKだったはずが29歳以下限定にエッセンシャルワーカーの人手不足解消(総額6億)は農林水産業も対象に GENIAC PRIZE 2025で参加者の皆さんが感じたであろう違和感は経産省の方には忌憚ない意見をお伝え済

webbigdata's tweet photo. GENIAC PRIZE 2026が始動

学生対象計算基盤提供コンテスト(総額三千万)は90歳でも学生だったらOKだったはずが29歳以下限定に
エッセンシャルワーカーの人手不足解消(総額6億)は農林水産業も対象に

GENIAC PRIZE 2025で参加者の皆さんが感じたであろう違和感は経産省の方には忌憚ない意見をお伝え済 https://t.co/ZxvUHD8MMJ

0

23

5

7

5K

webbigdata @webbigdata

8 days ago

おぉ・・・あるテストツールが「AIコーディングエージェントによる利用を防止」するために、AIエージェントにしか読めないように隠蔽したプロンプトを追加していたという事件ターミナル出力を抑制して端末を見ているだけでは人間が読めないようになっていたとの事で… うーん、これも時代か…

webbigdata's tweet photo. おぉ・・・

あるテストツールが「AIコーディングエージェントによる利用を防止」するために、AIエージェントにしか読めないように隠蔽したプロンプトを追加していたという事件

ターミナル出力を抑制して端末を見ているだけでは人間が読めないようになっていたとの事で…

うーん、これも時代か… https://t.co/O8RZKe8xwo

0

6

0

3

1K

webbigdata @webbigdata

8 days ago

@accell_mo_kun AIと言うかPython使って何らかのサーバー作ってる人が要チェックだモー🐮

0

1

0

371

webbigdata @webbigdata

8 days ago

Pythonの人気Webフレームワーク「Starlette」に脆弱性 vLLMやLiteLLMに影響と聞いても外部公開してなければ無関係と思ったのですがFastAPIが依存しててMCP、ASGI等、Pythonでサーバー自作している人は要確認ヘッダーの検証不備を突く事で容易に悪用可能私の仮想環境にも不具合版が入ってました

webbigdata's tweet photo. Pythonの人気Webフレームワーク「Starlette」に脆弱性

vLLMやLiteLLMに影響と聞いても外部公開してなければ無関係と思ったのですがFastAPIが依存しててMCP、ASGI等、Pythonでサーバー自作している人は要確認

ヘッダーの検証不備を突く事で容易に悪用可能
私の仮想環境にも不具合版が入ってました https://t.co/M1md7nDag4

0

7

0

5

1K

webbigdata @webbigdata

9 days ago

Deepseek v4の期間限定75%割引が恒久割引になった事が数日前に話題になりましたが、xiaomi(シャオミ)のMiMoも最大99%割引との事先行のアメリカ発AIが「X月X日に現モデルは廃止。新モデルは性能(と価格とサブスク制限)が大幅に向上！」を繰り返してるから歯止めになってくれるといいですね

thehype.

@thehypedotnews

9 days ago

xiaomi follows deepseek's playbook: mimo-v2.5-pro api now matches deepseek-v4-pro pricing to the cent benchmarks (mimo vs deepseek): • gdpval-aa (general agent elo): 1581 vs 1554 ✅ • τ³-bench (tool-use): 72.9 vs 71.8 ✅ • claweval (function calling): 63.8 vs 59.8 ✅ • humanity's last exam (frontier reasoning): 48.0 vs 48.2 🟰 • swe-bench pro (real-world coding): 57.2 vs 55.4 ✅ • swe-bench verified (coding fixes): 78.9 vs 80.6 ❌ • terminal-bench 2.0 (shell tasks): 68.4 vs 67.9 ✅ artificial analysis (mimo vs deepseek): • intelligence index: 54 vs 50 ✅ • speed (median tok/s): 53 vs 54 🟰 • latency: 3.81s vs 1.86s ❌ same price, near-identical capability, trade-offs only at the margins. the chinese frontier is commodifying – and the price war is just getting started follow @thehypedotnews for 24/7 ai news, analysis and breakdowns

thehypedotnews's tweet photo. xiaomi follows deepseek's playbook: mimo-v2.5-pro api now matches deepseek-v4-pro pricing to the cent

benchmarks (mimo vs deepseek):

• gdpval-aa (general agent elo): 1581 vs 1554 ✅
• τ³-bench (tool-use): 72.9 vs 71.8 ✅
• claweval (function calling): 63.8 vs 59.8 ✅
• humanity's last exam (frontier reasoning): 48.0 vs 48.2 🟰
• swe-bench pro (real-world coding): 57.2 vs 55.4 ✅
• swe-bench verified (coding fixes): 78.9 vs 80.6 ❌
• terminal-bench 2.0 (shell tasks): 68.4 vs 67.9 ✅

artificial analysis (mimo vs deepseek):

• intelligence index: 54 vs 50 ✅
• speed (median tok/s): 53 vs 54 🟰
• latency: 3.81s vs 1.86s ❌

same price, near-identical capability, trade-offs only at the margins. the chinese frontier is commodifying – and the price war is just getting started

follow @thehypedotnews for 24/7 ai news, analysis and breakdowns

5

51

5

15

19K

0

6

1

1K

webbigdata @webbigdata

10 days ago

SNS上でもAIによって書かれた「注意を集める事に特化した投稿」が急増して情報収集/発信が難しくなっている皆が使いだしたらAIによる自動化は差別化に繋がらない執筆、画像、音声、音楽、動画、コーディング、研究等々においても「人間として提供する価値」はどの部分なのか考える必要がある

Ethan Mollick

@emollick

10 days ago

I wrote a new post on what we need to keep human and what to hand over to AI, with forays into experiments in education, consulting, and the the latest controversy over literary prizes. https://t.co/NqWO8wyVG8

11

135

11

75

13K

0

1

0

477

webbigdata @webbigdata

11 days ago

教皇レオ14世の回勅「大いなる人間性：人工知能の時代における人間性の保護について」私は「AIは神になれるのか？」という議論には懐疑的だったしかし宗教側：AIの社会的影響を無視できなくなる企業側：倫理設計根拠に宗教的権威を求めるそう、AIが神になるのではなく、人間がAIを神にするのだ

Pope Leo XIV

@Pontifex

11 days ago

In the era of #ArtificialIntelligence, when human dignity is threatened by new forms of dehumanization, ours is the pressing duty to remain profoundly human. We must lovingly safeguard the grandeur of humanity bestowed upon us and revealed in its fullness in Christ, the splendor of which no machine can ever replace. #MagnificaHumanitas https://t.co/6i9MWs6LJl

861

67K

14K

9K

2M

0

534

webbigdata @webbigdata

12 days ago

事前学習で「損失が最も低くなった最終チェックポイント」が必ずしも最適とは限らないという新事実・モデルは「記憶モード」と「汎化モード」を激しく往復（Mode-hopping）する・大規模モデルの方がデータセットが異なってもこの挙動に一貫性マージで挙動が安定する理由が説明でき、非常に納得感

Jiaxin Wen

@jiaxinwen22

18 days ago

New post: "Generalization Dynamics of LM Pre-training" Most people (including me) assume that LMs smoothly mature from pattern-matching to generalizing. This mental model is wrong. The true dynamics are stranger, and far more fascinating! We call it Mode-Hopping.

11

538

83

531

95K

0

26

2

16

2K

webbigdata @webbigdata

13 days ago

Gemini Omni FlashはGoogle Flowで１日１回無料で作成可能プロンプト偽物っぽいAIの専門家が記者にあたらしいAIを説明している AIの専門家：我が社のAIではこんな事ができます！記者：本当ですか？その証拠は？ AIの専門家：TrustMeBroベンチマークでモデルを評価した結果です記者：凄い！

0

484

webbigdata @webbigdata

13 days ago

Tubesaku(YouTuber/Vtuber向けのツール提供サイト)の過去１ヶ月の流入元 AI AgentやLLMがアクセスしやすい構成にしているんですがWebサイト運営側の立場だとchatGPTやgemini、notebooklmがサーチエンジンに置き換わる未来が想像できないアプリ経由だとDirectに分類される説もありますがclaudeどこ？

webbigdata's tweet photo. Tubesaku(YouTuber/Vtuber向けのツール提供サイト)の過去１ヶ月の流入元

AI AgentやLLMがアクセスしやすい構成にしているんですがWebサイト運営側の立場だとchatGPTやgemini、notebooklmがサーチエンジンに置き換わる未来が想像できない

アプリ経由だとDirectに分類される説もありますがclaudeどこ？ https://t.co/vHmv0KHJ7s

1

0

1

563

webbigdata @webbigdata

14 days ago

蒸留に万能なレシピが存在しない事の示唆生徒：Qwen3 0.6,1.7B タスク：MMLUや数学難問(AIME 2025) 教師：自己蒸留や大きいモデル 0.6Bは大モデルの出力を与えても内容が高度すぎて学習ができないが1.7Bは効果的に活用できる AIME 2025では「あえて間違った例」をコンテキストに含める事が有用など

Mehrdad Farajtabar @MFarajtabar

24 days ago

🧵 1/11 Everyone's doing on-policy distillation now (Qwen3, Deepseek V4, GLM-5). But here's what nobody's asking: at any given token or for a question and a teacher, when does the teacher's guidance actually help, and when does it quietly make things worse? We found a way to answer this. No training needed!

MFarajtabar's tweet photo. 🧵 1/11 Everyone's doing on-policy distillation now (Qwen3, Deepseek V4, GLM-5).

But here's what nobody's asking: at any given token or for a question and a teacher, when does the teacher's guidance actually help, and when does it quietly make things worse?

We found a way to answer this. No training needed!

4

435

51

512

30K

0

39

4

42

6K

webbigdata

@webbigdata

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users