Sam Davis @samgd - Twitter Profile

samgd retweeted

8 days ago

Introducing the newest Coral board, for efficient, on-device AI! Check out the demos in the video: - On-board speech translation - Natural language controlling hardware - Vision & sound generating music

180

8K

746

4K

1M

samgd retweeted

Greg Brockman

@gdb

8 days ago

Codex for transcribing and answering questions about a meeting in real time:

85

978

60

614

152K

samgd retweeted

ClaudeDevs

@ClaudeDevs

7 days ago

Opus 4.8 is live in Claude Code today. A few things worth knowing: 🧵

378

10K

864

2K

1M

samgd retweeted

hardmaru

@hardmaru

8 days ago

For over a decade, we’ve accepted that end-to-end backprop is the only way to train deep networks. But holding the entire network in memory all at once is why AI training is hitting a resource wall. We found a new way to break the network into blocks and train them independently. The trick? Treating the network’s forward pass like a diffusion model denoising a signal. This reinterpretation slashes the memory needed to train deep models. In our #ICLR2026 paper (https://t.co/PK5h0mqQSo), we matched end-to-end performance across ViTs, DiTs, and LLMs. We did this while training just one isolated block at a time.

148

6K

650

4K

728K

Who to follow

Fiveat

@CriptoFiveat

💻Te enseño a programar #Solidity y #Hashgraph con su #SDK en YouTube. Minero de $BTC

Suzanna Sia

@suzyahyah

10 years of ML, 30 years of existential crisis 1st Author @ NeurIPS,AAAI,EMNLPx2,NAACL,EACL. ML/AI-ed in Defence,Tech,Manufacturing,Finance

nftflair.eth 🍌🧪👾

@nftflair

I trade crypto except memecoins. $CDX $AXB $BRAIN

Sam Davis @samgd

14 days ago

"works but is fugly" - thank you for this option, Claude

0

15

samgd retweeted

OpenAI

@OpenAI

28 days ago

Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents. Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold. Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.

694

15K

1K

5K

4M

samgd retweeted

OpenRouter

@OpenRouter

28 days ago

1/ Audio is now first-class on OpenRouter. Two new endpoints live today: 📢 /api/v1/audio/speech — text-to-speech (TTS) 🎤 /api/v1/audio/transcriptions — speech-to-text (SST) Same routing, billing, and keys you already use for text, image, and video.

OpenRouter's tweet photo. 1/ Audio is now first-class on OpenRouter.

Two new endpoints live today:

📢 /api/v1/audio/speech — text-to-speech (TTS)
🎤 /api/v1/audio/transcriptions — speech-to-text (SST)

Same routing, billing, and keys you already use for text, image, and video. https://t.co/6uHeEUuDl5

12

414

34

171

29K

samgd retweeted

steven

@Tu7uruu

29 days ago

Big announcement for speech AI Benchmarks get gamed. So we added a repellent. The Open ASR Leaderboard now includes private evaluation data from Appen and DataoceanAI, making speech recognition benchmarks more robust against test-set contamination and “benchmaxxing.” Better signal. Less overfitting. More real-world ASR.

Tu7uruu's tweet photo. Big announcement for speech AI

Benchmarks get gamed. So we added a repellent.

The Open ASR Leaderboard now includes private evaluation data from Appen and DataoceanAI, making speech recognition benchmarks more robust against test-set contamination and “benchmaxxing.”

Better signal. Less overfitting. More real-world ASR.

7

114

17

60

12K

samgd retweeted

Sam Altman

@sama

about 1 month ago

pretty excited for voice models to get great its interesting to watch how people are already starting to change the way they interface with AI

927

6K

239

418

659K

Sam Davis @samgd

about 1 month ago

@acarroll_ATG Great to see AI systems being used to make meaningful improvements! Do you plan on writing up more of the details? What worked/failed etc? Just fyi in the post: "but did not fully leverage all available signals. not fully leverage all available signal."

0

23

samgd retweeted

Will Bui

@will_ea

about 1 month ago

27x faster Attention Residuals!!! 🚀 We implemented Block AttnRes as a pip-installable package. !pip install flash-attn-res No annoying kernel nonsense. No compile/autograd plumbing. Call it like a regular PyTorch op. It just works. Methodology: 🔹 fused triton kernels 🔹 batched attention over residual blocks 🔹 online-softmax merge 🔹 flash attention-style split-KV reduction Thanks @LLMenjoyer and @cartesia for the support and guidance✌️

will_ea's tweet photo. 27x faster Attention Residuals!!! 🚀

We implemented Block AttnRes as a pip-installable package.

!pip install flash-attn-res

No annoying kernel nonsense.
No compile/autograd plumbing.
Call it like a regular PyTorch op.

It just works.

Methodology:
🔹 fused triton kernels
🔹 batched attention over residual blocks
🔹 online-softmax merge
🔹 flash attention-style split-KV reduction

Thanks @LLMenjoyer and @cartesia for the support and guidance✌️

22

772

83

570

75K

Sam Davis @samgd

about 1 month ago

🫡

samgd's tweet photo. 🫡 https://t.co/4L4DQuA27U

0

25

samgd retweeted

OpenAI

@OpenAI

about 1 month ago

We’re talking about Goblins. https://t.co/dqmcLGCW71

531

8K

846

2K

2M

samgd retweeted

DeepSeek

@deepseek_ai

about 1 month ago

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: https://t.co/drlDrxkYtp 🤗 Open Weights: https://t.co/T13Y8i7SDM 1/n

deepseek_ai's tweet photo. 🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.

🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice.

Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today!

📄 Tech Report: https://t.co/drlDrxkYtp
🤗 Open Weights: https://t.co/T13Y8i7SDM

1/n

2K

45K

8K

10K

10M

samgd retweeted

Sam Altman

@sama

about 1 month ago

1. We believe in iterative deployment; although GPT-5.5 is already a smart model, we expect rapid improvements. Iterative deployment is a big part of our safety strategy; we believe the world will be best equipped to win at the team sport of AI resilience this way. 2. We believe in democratization. We want people to be able to use lots of AI; we aim to have the most efficient models, the most efficient inference stack, and the most compute. We want our users to have access to the best technology and for everyone to have equal opportunity. We have been tracking cybersecurity as a preparedness category for a long time, and have built mitigations we believe in that enable us to make capable models broadly available. 3. We love you and we want you to win. We want to be a platform for every company, scientist, entrepreneur, and person. (My whole career has largely been about the magic of startups, and I think we are about to see that magic at hyperscale.)

732

9K

503

814

550K

samgd retweeted

ClaudeDevs

@ClaudeDevs

about 1 month ago

Over the past month, some of you reported Claude Code's quality had slipped. We investigated, and published a post-mortem on the three issues we found. All are fixed in v2.1.116+ and we’ve reset usage limits for all subscribers.

2K

40K

3K

6K

6M

samgd retweeted

Daniel Wortel-London @dlondonwortel

about 2 months ago

It’s not just new, it’s newspeak

264

21K

2K

4M

samgd retweeted

Boris Cherny

@bcherny

about 2 months ago

Opus 4.7 uses more thinking tokens, so we've increased rate limits for all subscribers to make up for it. Enjoy!

1K

22K

925

1K

1M

samgd retweeted

Alexandr Wang

@alexandr_wang

about 2 months ago

1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵