Boyannn @ChanKaser - Twitter Profile

ChanKaser retweeted

实践哥MinLi

@MinLiBuilds

11 days ago

https://t.co/gksXMUvqr8

7

308

59

729

217K

ChanKaser retweeted

Suni

@suni_code

10 days ago

Found the Best Resource to learn Harness Engineering. 😭 https://t.co/3eOEmMlfbv

34

2K

307

4K

129K

ChanKaser retweeted

Berryxia.AI

@berryxia

2 months ago

Stanford 这堂 2 小时 AI 系统构建课，直接把 Claude 所有教程和 Prompting Thread 秒了！强烈推荐： “比你刷过的所有 Claude 教程都实用 10 倍” 里面讲的不是 prompt 技巧，而是 Stanford 真正教工程师如何从零构建可靠 AI 系统的完整方法论。周末就刷这一个，绝对是你这周最有生产力的事！我已经将其翻译为中英文双语视频直接戳这里👇

56

7K

1K

14K

594K

ChanKaser retweeted

Amit Shekhar

@amitiitbhu

2 months ago

https://t.co/f8Lxn4gBD8

5

633

113

829

78K

Who to follow

ChanKaser retweeted

3 months ago

A few clarifications on my China post, which I think has been misread as more bearish than intended. China has built the best open-source models in the world at a fraction of the CapEx and with far less chip access than Western labs. That is genuinely extraordinary. The talent level is the highest I've encountered anywhere. I'm confident there will be game-changing entrepreneurs to come out of this ecosystem. We've already committed to one Chinese software-focused fund and are in late stages with two more. We're not bearish — we're trying to invest carefully in a market with real froth. The post was meant as a nuanced take from someone actively looking to deploy capital there, not a dismissal.

12

156

16

59

35K

ChanKaser retweeted

ModelScope

@ModelScope2022

2 months ago

Tencent HY just dropped OmniWeaving: omni-level video gen with reasoning, built on HunyuanVideo-1.5.🚀 🚀 ✅ T2V, I2V, key-frame interpolation, video editing, multi-subject composition (up to 4 reference images, free-form text-image-video inputs) 🎯 Thinking mode: MLLM reasons over user intent before generating ⚡ Hidden States DeepStacking: multi-layer MLLM features (inspired by Qwen3-VL) for richer semantic control 📄 IntelligentVBench: new benchmark for unified video generation, released alongside. SoTA among open-source unified models. 💻 https://t.co/4ZMpdvREFt 📄 https://t.co/MDJ13nFR7T 🎮 Qualitative Examples 👉 https://t.co/GuJwGu7o3R

ModelScope2022's tweet photo. Tencent HY just dropped OmniWeaving: omni-level video gen with reasoning, built on HunyuanVideo-1.5.🚀 🚀

✅ T2V, I2V, key-frame interpolation, video editing, multi-subject composition (up to 4 reference images, free-form text-image-video inputs)
🎯 Thinking mode: MLLM reasons over user intent before generating
⚡ Hidden States DeepStacking: multi-layer MLLM features (inspired by Qwen3-VL) for richer semantic control
📄 IntelligentVBench: new benchmark for unified video generation, released alongside. SoTA among open-source unified models.

💻 https://t.co/4ZMpdvREFt
📄 https://t.co/MDJ13nFR7T
🎮 Qualitative Examples 👉 https://t.co/GuJwGu7o3R

1

72

11

41

6K

ChanKaser retweeted

Ahmad

@TheAhmadOsman

2 months ago

https://t.co/sF6qq5uIXK

43

2K

237

3K

270K

ChanKaser retweeted

Haocheng Xi

@HaochengXiUCB

3 months ago

Really exciting to see KV-cache compression getting attention. A similar bottleneck shows up beyond LLMs: for world models and autoregressive long-video generation, KV cache can quickly dominate memory and limit long-horizon consistency. Our recent work, Quant VideoGen, explores training-free 2-bit KV-cache quantization for video diffusion models, achieving up to 7.0× KV memory reduction with <4% latency overhead. Link: https://t.co/SH6FXXTGxL

HaochengXiUCB's tweet photo. Really exciting to see KV-cache compression getting attention.

A similar bottleneck shows up beyond LLMs: for world models and autoregressive long-video generation, KV cache can quickly dominate memory and limit long-horizon consistency.

Our recent work, Quant VideoGen, explores training-free 2-bit KV-cache quantization for video diffusion models, achieving up to 7.0× KV memory reduction with <4% latency overhead.

Link: https://t.co/SH6FXXTGxL

16

481

67

257

54K

ChanKaser retweeted

Thariq

@trq212

3 months ago

I put a lot of heart into my technical writing, I hope it's useful to you all. 📌 Here's a pinned thread of everything I've written. (much of this will be posted on the Claude blog soon as well)

252

8K

831

15K

1M

ChanKaser retweeted

Dhravya Shah

@DhravyaShah

3 months ago

https://t.co/PII44vkWP7

261

4K

413

8K

3M

ChanKaser retweeted

宝玉

@dotey

3 months ago

小技巧 ~/.claude/settings.json 里面添加 { "attribution": { "commit": "", "pr": "" } } 就可以默认不加 co-author https://t.co/08j5zFEOVL

35

1K

93

1K

184K

ChanKaser retweeted

日常焦虑帝

@gpuhell

3 months ago

苏神博客上的配图比论文里的清楚多了... 连线有颜色区分，一看就知道，论文里的单色混在一起。 https://t.co/AWdhWlKRcA

1

140

21

112

17K

ChanKaser retweeted

Thariq

@trq212

3 months ago

We just released Claude Code channels, which allows you to control your Claude Code session through select MCPs, starting with Telegram and Discord. Use this to message Claude Code directly from your phone.

2K

26K

2K

18K

8M

Boyannn @ChanKaser

3 months ago

@aliez_ren 可以试试, https://t.co/OL2EnKNKZK 对nvfp4支持应该是最好的

1

0

413

ChanKaser retweeted

Ai2 @allen_ai

3 months ago

What's in the release: 🔹 Pretraining & fine-tuning scripts (SFT + long-context SFT) 🔹 Multi-node distributed training 🔹 Data download, preprocessing, & visualization utilities 🔹 Single-task & multi-eval scripts with caching Built for reproducibility & new experiments.

3

339

49

428

125K

ChanKaser retweeted

Thariq

@trq212

4 months ago

a few Friday afternoon ships to end the week: the AskUserQuestion tool can now show markdown snippets to display diagrams, code examples, etc.

181

5K

162

1K

487K

ChanKaser retweeted

The AI Timeline

@TheAITimeline

4 months ago

🚨This week's top AI/ML research papers: - GLM-5 - Experiential Reinforcement Learning - Image Generation with a Sphere Encoder - World Action Models are Zero-shot Policies - Unified Latents - Fast KV Compaction via Attention Matching - Adam Improves Muon - LUCID - The Molecular Structure of Thought - Arcee Trinity Large Technical Report read this in thread mode for the best experience

4

240

27

173

17K

ChanKaser retweeted

Alex Wa @_djdumpling

4 months ago

new blog! What methodologies do labs use to train frontier models? The blog distills 7 open-weight model reports from frontier labs, covering architecture, stability, optimizers, data curation, pre/mid/post-training + RL, and behaviors/safety https://t.co/88heRH4TcO

_djdumpling's tweet photo. new blog! What methodologies do labs use to train frontier models?

The blog distills 7 open-weight model reports from frontier labs, covering architecture, stability, optimizers, data curation, pre/mid/post-training + RL, and behaviors/safety

https://t.co/88heRH4TcO https://t.co/faaWrLQr4g

34

2K

290

3K

290K

ChanKaser retweeted

Deedy

@deedydas

4 months ago

The Ultimate List of Artificial Intelligence "Neolabs". A Neolab is a pre-revenue scale startup working on long-term AI breakthroughs. Here's all 50 of them.

deedydas's tweet photo. The Ultimate List of Artificial Intelligence "Neolabs".

A Neolab is a pre-revenue scale startup working on long-term AI breakthroughs.

Here's all 50 of them. https://t.co/IhNdkxTE67

93

2K

139

2K

268K

ChanKaser retweeted

Pedro

@sillydarket

4 months ago

https://t.co/epmvq8bsBQ

72

871

91

3K

336K

Boyannn

@ChanKaser

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users