Starlaxy @StarIaxy - Twitter Profile

Anthropic 的哲学家@AmandaAskell 最近参加了一个访谈，在访谈中她分享了自己探索好奇领域的一个方法。提示词大概是：我希望你从「xx」领域里选一个大概研究生水平的概念。然后我希望你通过写一个寓言的方式，间接地把这个概念完整讲出来。最好一直到快结尾时，人才会慢慢意识到这个概念究竟是什么。然后在故事之后，再补一段解释，把你刚才真正要讲的概念说清楚。

threeaus's tweet photo. Anthropic 的哲学家@AmandaAskell 最近参加了一个访谈，在访谈中她分享了自己探索好奇领域的一个方法。

提示词大概是：

我希望你从「xx」领域里选一个大概研究生水平的概念。然后我希望你通过写一个寓言的方式，间接地把这个概念完整讲出来。最好一直到快结尾时，人才会慢慢意识到这个概念究竟是什么。然后在故事之后，再补一段解释，把你刚才真正要讲的概念说清楚。

71

6K

985

7K

531K

StarIaxy retweeted

小島秀夫

@Kojima_Hideo

2 months ago

162

31K

3K

2K

1M

Who to follow

StarIaxy retweeted

4 months ago

ByteDance just published something I've been waiting for someone to build: CUDA Agent! It trained a model that writes fast CUDA kernels. Not just correct ones — actually optimized ones. It beats torch.compile by 2× on simple/medium kernels, ~92% on complex ones, and even outperforms Claude Opus 4.5 and Gemini 3 Pro by ~40% on the hardest setting. The key idea is simple but kind of brilliant: CUDA performance isn’t about correctness, it’s about hardware. Warps, memory bandwidth, bank conflicts — the stuff you only see in a profiler. So instead of rewarding “did it compile?”, they reward actual GPU speed. Real profiling numbers. RL trained directly on performance. That’s a big shift. Paper: https://t.co/EYx7QKosgk Project: https://t.co/pTCfzQIBes

BoWang87's tweet photo. ByteDance just published something I've been waiting for someone to build: CUDA Agent!

It trained a model that writes fast CUDA kernels. Not just correct ones — actually optimized ones.

It beats torch.compile by 2× on simple/medium kernels, ~92% on complex ones, and even outperforms Claude Opus 4.5 and Gemini 3 Pro by ~40% on the hardest setting.

The key idea is simple but kind of brilliant:

CUDA performance isn’t about correctness, it’s about hardware. Warps, memory bandwidth, bank conflicts — the stuff you only see in a profiler.

So instead of rewarding “did it compile?”, they reward actual GPU speed. Real profiling numbers. RL trained directly on performance.

That’s a big shift.

Paper: https://t.co/EYx7QKosgk
Project: https://t.co/pTCfzQIBes

52

3K

358

2K

183K

StarIaxy retweeted

DAIR.AI

@dair_ai

4 months ago

Agent memory benchmarks are misleading. Scoring well on memory recall doesn't mean an agent can actually use that memory to take correct actions across sessions. Models that achieve near-saturated performance on existing long-context memory benchmarks like LoCoMo perform poorly when tested in real agentic scenarios. This new research introduces MemoryArena, a benchmark designed to evaluate agent memory across interdependent multi-session tasks. Unlike existing benchmarks that test memorization separately from action or focus on single sessions, MemoryArena uses human-crafted agentic tasks where agents must learn from prior interactions and apply that knowledge to solve subsequent challenges. Why it matters: as agents handle longer, multi-session workflows, memory isn't just about retrieval. It's about applying the right context at the right time to make good decisions. Paper: https://t.co/PQpmsZVCvr Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c

dair_ai's tweet photo. Agent memory benchmarks are misleading.

Scoring well on memory recall doesn't mean an agent can actually use that memory to take correct actions across sessions.

Models that achieve near-saturated performance on existing long-context memory benchmarks like LoCoMo perform poorly when tested in real agentic scenarios.

This new research introduces MemoryArena, a benchmark designed to evaluate agent memory across interdependent multi-session tasks.

Unlike existing benchmarks that test memorization separately from action or focus on single sessions, MemoryArena uses human-crafted agentic tasks where agents must learn from prior interactions and apply that knowledge to solve subsequent challenges.

Why it matters: as agents handle longer, multi-session workflows, memory isn't just about retrieval. It's about applying the right context at the right time to make good decisions.

Paper: https://t.co/PQpmsZVCvr

Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c

16

200

31

191

39K

StarIaxy retweeted

ロロたんぬ Λ PGTチャンネルの人 @Lolo_Tannu

5 months ago

話している内容はぺらぺらなんですけど、動画内で一瞬表示したまとめをこの辺に置いておきますほんとうはこれ１つ１つを掘り下げてしゃべりたいなって思ったんですけど、本筋ではないのでめちゃくちゃ端折りました。

3

1K

141

508

204K

StarIaxy retweeted

トレカエース植田店（ブックエース植田店） @ueda037

5 months ago

本日開催ポケカ公認ジムバトルスタンダードご参加ありがとうございました！参加者6名優勝者「べっし」さんデッキ名「ソウブレイズ」コメント「穴ほり最高」おめでとうございます！！🎉🎉 #トレカエース植田 #ポケカ

0

6

1

0

1K

StarIaxy retweeted

CARD BOX 柳正堂書店イオンタウン山梨中央店 @CARDBOX_RY_aeon

5 months ago

🎊今回の優勝者🎊 トレーナーズリーグオープン大会参加者　16名優勝🏆 ミツヤ選手デッキ名「探検家のソウブレイズ」優勝者コメント『新生ソウブレイズは探検家の先導がオススメです』おめでとうございます🎉　 #ポケカ

0

31

4

17

6K

StarIaxy retweeted

DAIR.AI

@dair_ai

5 months ago

Many are trying to code with agents to boost velocity. But at what cost? The default assumption is that AI coding tools are additive: IDE assistants help, and autonomous agents help more. Stack them together, get more productivity. But nobody had measured whether this is actually true in production repositories. This new research presents the first large-scale causal study of autonomous coding agent adoption in open-source projects, analyzing repository-level outcomes across development velocity and software quality. The methodology: staggered difference-in-differences with matched controls using the AIDev dataset. Repositories are split into two groups: agent-first (AF), where agents are the first AI tool adopted, and IDE-first (IF), where repositories already used AI IDEs like Copilot or Cursor before adopting agents. AF repositories see massive front-loaded gains: +36% commits and +77% lines added on average. At adoption month, the spike hits +111% commits and +216% lines added. These gains persist. But IF repositories see almost nothing: +4% commits and +1% lines added. The short-lived bump at adoption quickly fades, and by month 6, lines added turn negative (-45%). The quality findings are worse. Regardless of prior AI exposure, agent adoption increases static-analysis warnings by ~18% and cognitive complexity by ~35%. These effects are persistent. AF repositories reach +49% complexity by month 5. IF repositories hit +44-51% and stay there. Autonomous agents introduce complexity debt even when velocity advantages fade. Teams already using AI IDEs face coordination and integration bottlenecks that limit throughput, but still accumulate the maintainability risks. Coding agents are powerful but risky accelerators. Substantial velocity gains materialize only when agents are a project"s first AI tool. Prior AI IDE exposure moderates the benefits but not the quality risks. Selective deployment and strong quality safeguards are essential. Paper: https://t.co/6lVAuUPxvh Learn to build with AI agents in our academy: https://t.co/zQXQt0PMbG

dair_ai's tweet photo. Many are trying to code with agents to boost velocity.

But at what cost?

The default assumption is that AI coding tools are additive: IDE assistants help, and autonomous agents help more. Stack them together, get more productivity.

But nobody had measured whether this is actually true in production repositories.

This new research presents the first large-scale causal study of autonomous coding agent adoption in open-source projects, analyzing repository-level outcomes across development velocity and software quality.

The methodology: staggered difference-in-differences with matched controls using the AIDev dataset.

Repositories are split into two groups: agent-first (AF), where agents are the first AI tool adopted, and IDE-first (IF), where repositories already used AI IDEs like Copilot or Cursor before adopting agents.

AF repositories see massive front-loaded gains: +36% commits and +77% lines added on average. At adoption month, the spike hits +111% commits and +216% lines added. These gains persist.

But IF repositories see almost nothing: +4% commits and +1% lines added. The short-lived bump at adoption quickly fades, and by month 6, lines added turn negative (-45%).

The quality findings are worse. Regardless of prior AI exposure, agent adoption increases static-analysis warnings by ~18% and cognitive complexity by ~35%. These effects are persistent. AF repositories reach +49% complexity by month 5. IF repositories hit +44-51% and stay there.

Autonomous agents introduce complexity debt even when velocity advantages fade. Teams already using AI IDEs face coordination and integration bottlenecks that limit throughput, but still accumulate the maintainability risks.

Coding agents are powerful but risky accelerators. Substantial velocity gains materialize only when agents are a project"s first AI tool. Prior AI IDE exposure moderates the benefits but not the quality risks. Selective deployment and strong quality safeguards are essential.

Paper: https://t.co/6lVAuUPxvh

Learn to build with AI agents in our academy: https://t.co/zQXQt0PMbG

11

137

23

102

17K

5 months ago

5 months ago

＼コナンカード Xキャンペーン開催‼️／ 📌PRカード「赤井秀一」（PR233）を　抽選で10名様にプレゼント🎁 🔎参加方法 1⃣@CONAN_tcgをフォロー 2⃣【#コナンカード2月は赤井秀一】を付けこの投稿を引用ポスト📲❗️ 🔻キャンペーン詳細 https://t.co/6Pl7CMqHFb #名探偵コナン

CONAN_tcg's tweet photo. ＼コナンカード Xキャンペーン開催‼️／

📌PRカード「赤井秀一」（PR233）を
　抽選で10名様にプレゼント🎁

🔎参加方法
1⃣@CONAN_tcgをフォロー
2⃣【#コナンカード2月は赤井秀一】を付け
この投稿を引用ポスト📲❗️

🔻キャンペーン詳細
https://t.co/6Pl7CMqHFb

#名探偵コナン https://t.co/rJQPLZukb1

92

2K

1K

164

588K

0

17

StarIaxy retweeted

Krullzor @krullzorz

6 months ago

6-0 50 man locals with PY Rosinante :3 Imu 🎲 ✅ Moria 🎲✅ ST29 Luffy 🎲✅ Boa ✅ Boa ✅ Imu ✅ Finally i can play this deck in IRL 😭😭

26

679

34

278

44K

StarIaxy retweeted

うた。 @uta_1412_tcg

6 months ago

探偵マスターズ青単 1回戦　赤　先　× 2回戦　緑　後　⚪︎ 3回戦　赤　後　⚪︎ 4回戦　緑　先　⚪︎ 5回戦　白　後　⚪︎ 6回戦　緑　先　⚪︎ 7回戦　緑　後　⚪︎ 6-1 5位色1位🥇 練習と調整付き合ってくれたみんなに感謝です！

3

105

10

26

6K

StarIaxy retweeted

ぽて @pote__conantcg

6 months ago

探偵マスターズ2026 使用青単少年探偵団緑アグロ先攻⭕️ 緑アグロ先攻⭕️ 緑アグロ先攻⭕️ 黒ﾌﾞﾗｯｸｲﾝﾊﾟｸﾄ後攻⭕️ 白キッド先攻⭕️ 黄長野後攻⭕️ 緑アグロ後攻⭕️ Hブロック🥇青パートナー🥇 やったね✌️

7

137

15

52

12K

StarIaxy retweeted

ヒカソ @hikasogg

6 months ago

探偵マスターズは黒単使ってFブロック3位でした！超越さんと同じブロックで2位3位フィニッシュ☺️ コナンカードに真摯に取り組んできたのがやっと報われて本当に嬉しい一緒に調整してくれたマス、荒川、ディアスさんもありがとう！構築については後でnote書きます

hikasogg's tweet photo. 探偵マスターズは黒単使ってFブロック3位でした！
超越さんと同じブロックで2位3位フィニッシュ☺️
コナンカードに真摯に取り組んできたのがやっと報われて本当に嬉しい
一緒に調整してくれたマス、荒川、ディアスさんもありがとう！
構築については後でnote書きます https://t.co/I9Fz1MPjm0

1

49

9

21

8K

StarIaxy retweeted

@toshiki @toshi3143

6 months ago

無事に帰宅できましたので改めて… Aブロック5-2(赤1位)でした🙌 来年は6-1以上できるよう頑張ります💪 1戦目　赤揺心先　🙆‍♂️ 2戦目　黒ブライン先　🙅‍♂️ 3戦目　黄閉秘先　🙆‍♂️ 4戦目　青SOS先　🙆‍♂️ 5戦目　白単後　🙆‍♂️ 6戦目　黒ブライン先　🙅‍♂️ 7戦目　黒ブライン後　🙆‍♂️ #探偵マスターズ2026