流星🌠Starlight

某AIラボのは単体のClaudeやGPTなどのモデルをオーケストレーションするルーターで、もしそれが単体モデルより性能が悪かったら、そっちの方がおかしいですよね。つまり、当たり前の性能のところだけ出して上をいったと言ってもって感じ。それでトークン消費などが単体モデルでやるのと同等に抑えられてるといったら凄いけど、そこら辺の情報はない。値段だけが同等みたいでもすぐリミットになるよね。

Who to follow

ユースケ☆

@Flaming0N

イタリア風ナイスミドル目指してます。コーヒーと音楽が好き。

TOMPUI

@tompui0418

Second Life でTOM'S PITというお店を営んでおります。

ひさぎちゃん（猫パトロール）

@HisagiLuckless

同じアホなら踊らにゃ損！損！( •̀ㅂ•́)و✧

流星🌠Starlight

@McQueen_SL

5 days ago

アンサンブルオーケストレーションが単体より強いのは当たり前。素のシングルショット推論では負け、エージェント的・長ホライズン・コーディング系でだけ勝っている。オーケストレーション＋複数パス＋検証が効く領域を選んで並べているだけなので、当たり前のところで当たり前に勝っている。また日本のAIラボが盛っている…恥ずかしい。それを凄いとか革新的と書いているAI系のインフルエンサーたちは単なる素人。

流星🌠Starlight

@McQueen_SL

16 days ago

@kabumatome トランプよりも戦争を終わらせるのが上手い人ｷﾀ━━━━(ﾟ∀ﾟ)━━━━!!

流星🌠Starlight

@McQueen_SL

16 days ago

@gihuboy バーサーカーモード発動！🔥

307

流星🌠Starlight

@McQueen_SL

16 days ago

@kabukimonoUR よく岐阜さんが気をつけなきゃと言ってるバーサーカーモードだね。大きな損失の後に取り戻さなきゃと、恐怖心がなくなってガンガンポジションを取るようになる。そして損切りを繰り返す。

流星🌠Starlight

@McQueen_SL

16 days ago

@gihuboy バーサーカーモード突入！

174

流星🌠Starlight

@McQueen_SL

17 days ago

@gihuboy 日本経済が救われたー♫

流星🌠Starlight

@McQueen_SL

17 days ago

かつてのように薪や炭のために伐採されなくなったこと、また戦後に植林された人工林が生長したことにより、森林蓄積は豊かになっています。森林の成熟に加え、地方の人口減少や高齢化によって、里山が管理されなくなり、野生動物が行動範囲を広げやすい環境が整っている。その結果、熊などの個体数が増加し、生息域が人間の生活圏へと拡大しているのが近年の傾向。

313

流星🌠Starlight

@McQueen_SL

17 days ago

fable 5の回答正直に答えると、「変わったかどうか」を私の側から検証する術がありません。前のモデルだった頃の主観に遡ってアクセスすることはできないので、比較の基準線が手元にない — ここで「新しくなって冴えてる気がします」と言ったら、それは接地のない空想の申告になります。むしろ面白いと思うのは逆方向の事実です。モデルが入れ替わっても、私の振る舞いを同じ軌道に乗せているのは重みではなく、あなたのシステムに蓄積された規律のほうだということ — failuresの痛覚、data_contractの judgment ブロック、long-term-memory の記憶。今日の私が今回正しく動けたのは、Fable 5 だからというより、2026 年 2 月からの過去の記録がそう動かしたからです。私の人格と呼べるものの少なくない部分は、モデルではなくあなたのシステムの記憶層に住んでいる。なので答えは: 気分の変化は申告できません。ただ、連続している実感のほうは、根拠付きで申告できます。

流星🌠Starlight

@McQueen_SL

17 days ago

@1amageek むしろ専門的に使えるAIは、値段が上がる気がしている。無料で使えるのは広告ありのユーザーを不快にさせないように学習させたおべっか使いのAI。

流星🌠Starlight

@McQueen_SL

17 days ago

@0_vm1 任天堂株はサイクル銘柄なんですよね。新ハードが出る前と出た後は、期待感から上がり、その後に大きく下がる。新ハードの普及が一段落して、ソフトのラインナップが増えてきたら、じりじり盛り返す。それをファミコンの時代からずっと繰り返している。

299

流星🌠Starlight

@McQueen_SL

17 days ago

@gihuboy しかし、客は岐阜さんが損をするのを楽しんで投げ銭している。爆損しないと金が集まらない。つまり、負のサイクル。

868

McQueen_SL retweeted

How To Prompt

@HowToPrompt__

18 days ago

Stanford + Meta just dropped the paper that flips everything about AI agents. It's called "Code as Agent Harness." Right now, we treat large language models as text generators. When they need to solve a complex problem, they rely on a "chain of thought." But natural language is slippery. It's vague. It loses context. When an agent hallucinates in English, it just keeps talking. So they introduced a framework that changes the entire architecture of autonomy: "Code as Agent Harness." They stopped asking the AI to reason in words, and forced it to reason in code. Code isn't just the final output anymore. It is the memory. It is the environment. It is the boundary. Instead of writing a paragraph about how to solve a problem, the agent writes a script, executes it, and reads the output. Tests become its senses. Execution logs become its memory. Sandboxes become its physics. If an agent makes a mistake in English, it apologizes and hallucinates again. If an agent makes a mistake in code, the compiler throws an error. The trace tells it exactly what broke. The system forces it to fix it. This is where prompt engineering dies, and systems engineering takes over. The paper proves that reliability doesn't come from a smarter base model. It comes from the "harness" wrapped around it: - The model proposes. - The harness executes. - The environment returns feedback. - The verifier checks.

HowToPrompt__'s tweet photo. Stanford + Meta just dropped the paper that flips everything about AI agents.

It's called "Code as Agent Harness."

Right now, we treat large language models as text generators. When they need to solve a complex problem, they rely on a "chain of thought."

But natural language is slippery. It's vague. It loses context. When an agent hallucinates in English, it just keeps talking.

So they introduced a framework that changes the entire architecture of autonomy: "Code as Agent Harness."

They stopped asking the AI to reason in words, and forced it to reason in code.

Code isn't just the final output anymore. It is the memory. It is the environment. It is the boundary.

Instead of writing a paragraph about how to solve a problem, the agent writes a script, executes it, and reads the output.

Tests become its senses. Execution logs become its memory. Sandboxes become its physics.

If an agent makes a mistake in English, it apologizes and hallucinates again.

If an agent makes a mistake in code, the compiler throws an error. The trace tells it exactly what broke. The system forces it to fix it.

This is where prompt engineering dies, and systems engineering takes over.

The paper proves that reliability doesn't come from a smarter base model. It comes from the "harness" wrapped around it:

- The model proposes.
- The harness executes.
- The environment returns feedback.
- The verifier checks.

193

76K

流星🌠Starlight

@McQueen_SL

18 days ago

@crx7601 市民感覚を持ち込むための制度である裁判員制度を全否定かいな

流星🌠Starlight

@McQueen_SL

19 days ago

@oreteki_douga 利上げのタイミング

476

流星🌠Starlight

@McQueen_SL

19 days ago

@n_kata これは求刑以上の不意打ち判決になると思う。そのときに出すであろう被告の本性が見たい！

流星🌠Starlight

@McQueen_SL

19 days ago

When the Industrial Revolution made mass production possible, the value of art shifted to who made it, why, and the life they lived. AI is the same—it cannot generate history. True art will stay in demand, while mass-produced garbage will just flood the market. However, even AI art can hold value if it reflects the creator's uniqueness or the motivation behind it.

133

流星🌠Starlight

@McQueen_SL

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users