沖津@ITエンジニア

いや、全然そんなことないな今触って��た感じではたぶんSonnet 5はちゃんとFable 5の軽量版になってて、使い心地がOpus 4.8より場面もあるはず(もちろん、タスクによってはOpus 4.8のほうがいい場面もあるだろうが) エフォートの指定に異様に敏感で、Maxにするとすごい長考をするのもFableぽい

Chubby♨️

@kimmonismus

2 days ago

Here is my first assessment of Sonnet 5: Sonnet 5 is better than Sonnet 4.6. Who would have thought? But jokes aside: Unfortunately, it is weaker than Opus 4.8 across all evals. Why they nevertheless labeled the latest Sonnet 5 iteration with a “5”, even though “4.8” would have been more fitting, is beyond me. Normally, major version jumps in particular signal a significant leap in capability. Be that as it may: Sonnet 5 is good, but worse than expected. Pricing has not changed; it is on the same level as its predecessor. Opus is still more expensive, but at the same time it also remains better. Overall, the release irritates me and leaves more questions than it answers. I cannot help but see Sonnet 5 as a release that stands in the context of Fable 5. There was no mention of Fable 5 at all, which surprises me a lot. I really would have expected us to get news about it at the same time. But nothing. Instead, we get an update to a new model series (“5”), but one that is not significant compared with the models we already have. As a result, there is a lingering aftertaste that Sonnet was released as something in between, perhaps also simply to release something at all and to stay part of the conversation, including in a positive sense. Why no Opus 5, when we know that Fable 5 already exists as a model that performs significantly better than 4.8, and when we can assume both that a better Opus exists internally and that it would not be difficult to update Opus to the new generation? Why “only” Sonnet 5? Because restraint is currently required. The major releases are currently being delayed across the board; they are still in discussions with regulators about how the truly powerful frontier releases can be carried out at all and under what conditions. In my view, the Sonnet 5 release has to be seen against this background. And as a result, at least for me, it was disappointing overall.

kimmonismus's tweet photo. Here is my first assessment of Sonnet 5:

Sonnet 5 is better than Sonnet 4.6. Who would have thought? But jokes aside: Unfortunately, it is weaker than Opus 4.8 across all evals. Why they nevertheless labeled the latest Sonnet 5 iteration with a “5”, even though “4.8” would have been more fitting, is beyond me. Normally, major version jumps in particular signal a significant leap in capability. Be that as it may: Sonnet 5 is good, but worse than expected.

Pricing has not changed; it is on the same level as its predecessor. Opus is still more expensive, but at the same time it also remains better. Overall, the release irritates me and leaves more questions than it answers.

I cannot help but see Sonnet 5 as a release that stands in the context of Fable 5. There was no mention of Fable 5 at all, which surprises me a lot. I really would have expected us to get news about it at the same time. But nothing. Instead, we get an update to a new model series (“5”), but one that is not significant compared with the models we already have.
As a result, there is a lingering aftertaste that Sonnet was released as something in between, perhaps also simply to release something at all and to stay part of the conversation, including in a positive sense. Why no Opus 5, when we know that Fable 5 already exists as a model that performs significantly better than 4.8, and when we can assume both that a better Opus exists internally and that it would not be difficult to update Opus to the new generation? Why “only” Sonnet 5?

Because restraint is currently required. The major releases are currently being delayed across the board; they are still in discussions with regulators about how the truly powerful frontier releases can be carried out at all and under what conditions. In my view, the Sonnet 5 release has to be seen against this background. And as a result, at least for me, it was disappointing overall.

794

108

294K

121

沖津@ITエンジニア @Oki0837

2 days ago

⚪︎⚪︎が登場する、という未来形の話じゃなくて⚪︎⚪︎が登場した、という過去形の話を聞きたいないい加減な予想を無責任に垂れ流す方が金になるんだろうけど

沖津@ITエンジニア @Oki0837

3 days ago

AWS SummitにいけなかったのでDevOps Agentのセッションだけ見ているのですが、なかなかつかみどころがないというか… 使って覚えるしかないのか��あ

沖津@ITエンジニア @Oki0837

5 days ago

仮にこのようなことを本当にするつもりなら、アメリカはアメリカ外の知識をAIモデルから除去すべきでしょうねデータを無断で収奪します、その成果は勝手に使います、��いう振る舞いは泥棒と変わらない AnthropicもAlibabaのDistillingを責められる立場にない

Chubby♨️

@kimmonismus

6 days ago

Honestly, I no longer believe that people outside the U.S. will still have access to frontier models, and even there, access will be limited. We are now witnessing the end of public access to frontier intelligence. It is a very sad and serious turn of events.

234

122

233

226K

沖津@ITエンジニア @Oki0837

6 days ago

RT 某社のJavaフレームワークを思い出すネーミングだ… こういうところ、別にAnthropicに前ならえしなくてもいいと思うんだけどな

Oki0837 retweeted

OpenAI

@OpenAI

6 days ago

Introducing a limited preview of GPT-5.6 Sol, our next generation frontier model, as well as GPT-5.6 Terra, a balanced model for efficient, everyday work, and GPT-5.6 Luna, a fast and affordable model for high-volume work. https://t.co/OoM83SyISN

40K

17M

沖津@ITエンジニア @Oki0837

6 days ago

これはちょっと同情というか、誰も「私の自宅のネットワーク構成は高度で価値があります」とは書いてないので、受け取り側に悪意があり過ぎかなと(悪意のあるエンジニアと少なからずいるので、それに対する耐性のあるポストをしたほうがいいという話ならば同意)

コンフィ

@confi2112

6 days ago

構成図はあくまで一例として出したつもりです今までの現場とか実績まで出すと特定が怖いので_(:3 」∠)_

14K

869

沖津@ITエンジニア @Oki0837

6 days ago

そういえば、中国がAIモデルで先行している例としてすでにSeeDanceがあるんだったなこのままAnthropicとかOpenAIが足止めされてくれるなら、LLMに関しても中国が先行しはじめるかも？

沖津@ITエンジニア @Oki0837

8 days ago

htmxは前職でNext.jsからの移行で採用したんですが、悪くなかったですね（スキトラが大変という問題はさておき）ほんとならVueに慣れてる人にお任せした方がよさそうなふんいき

Kuman

@KUMAN_R

9 days ago

お、htmx が採用されている。実際 htmx で事足りるケースは多いだろうな。 ▼ 価格.comをAI駆動で全面��新する https://t.co/2A8I6Ju8ru

197

128

25K

295

沖津@ITエンジニア @Oki0837

10 days ago

AIの応答を模倣してくれるモックが欲しいと思う今日この頃それはAIにしか務まらないからまったく無意味な要求なのはわかってるんだけど AIが処理に絡むアプリのテストのしづらさがだいぶ辛いなんかベストプラクティスはあるのかなぁ

沖津@ITエンジニア @Oki0837

11 days ago

COUNTに変更して…とか言ってたけど、特定のパスしかWAFでALLOWする必要がない(つまり他のパスは普通ならCOUNTにもBLOCKにもかからない)から絞り込みが必要なわけで。こういう話が通じないのはもう諦めるしかないんだろうなと

沖津@ITエンジニア @Oki0837

11 days ago

直近であったのが、ドメイン指定でALLOWしているWAFのルールをパス指定にしてセキュリティを高めるべきだという話で、複数のパスのうち条件に引っかかっているパスを見つけるのは簡単だが、条件にかかっていないパスはどうやって見つけるのかという話が一生伝わらなかったんだよな

沖津@ITエンジニア @Oki0837

11 days ago

これは本当に大事な話で、AWSの人=AWSに詳しい、みたいな直感的な想定を持っていると足元を掬われる彼らの提案はベストプラクティスに沿っているものの、細かい仕様同士の整合や論理的破綻まで見てくれるわけではないので、自分で苦しむしかない

takefumi@fixing @__takefumi__

12 days ago

AWSの方とお話しする機会があるのですが、大半が私よりも知識が少ないケースがあります。それはまだいいのですが、ビジネスを実現する方法として『AWS使うならこういう方法があるよ』と提案をお��さんにはするので、お客さんと知識が低いAWS担当者が会話してるとなんともいえない気持ちになります。

208

429K

420

沖津@ITエンジニア @Oki0837

11 days ago

RT 組織内で複数のプロダクトを管理したい��合、どういうポジションの人がそれらを管理する想定なんだろうプロダクト管理者間で常に合議する？（なんかあんまり合理性を感じない）

Oki0837 retweeted

杉本啓

@sugimoto_kei

12 days ago

正直って、EMとかテックリードとか、いらんと思っている。プロダクトを作るひとがいればいいのであって、なんでそんなに役割分担するのかわからん。自分のしたい範囲に閉じ込もることの合理化に見える。それに、ドメインもわからん上、技術もわからんで、プロダクトを作れるわけがなかろ。それはそのひとがプロダクト開発者としてジュニアだというだけだ。少しずつ学ばなきゃいけないわけだ。現状を受容してしまって組織論で対応するのはスジがわるい。 ��論は許す。

423

164

184K

沖津@ITエンジニア @Oki0837

11 days ago

私は全然Gemini使ってるからGeminiはゴミ、みたいな言い方されるのはちょっと納得いかないな ClaudeやGPTのトークンで処理するにはもったいない雑談とかはGeminiでやってるし業務用途では使えないのが問題なだけで、日常使いにはそんな悪くない…はず Youtubeの要約も得意

沖津@ITエンジニア

@Oki0837

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users