Marc @dopsnacky - Twitter Profile

about 16 hours ago

Wan Streamer is a real-time interactive model that listens, sees, speaks, and replies with synchronized video. It runs at 25 fps with ~200 ms latency, making audio-video agents feel closer to live conversation. https://t.co/MDqfe2bwe2

5

125

18

101

5K

dopsnacky retweeted

Alexander Goslin

@xandurglar

about 17 hours ago

Introducing InfiniteDiffusion, my independent paper accepted to #SIGGRAPH2026! I have one RTX 3090 Ti. No funding, advisors, or team. By day I'm a new grad SWE at Walmart. The paper has two main contributions: - InfiniteDiffusion: a new approach to infinite generation with diffusion models. - Terrain Diffusion: the world’s first learned procedural terrain generator. Here’s why this matters, and how they are connected. 🧵

125

4K

419

3K

494K

dopsnacky retweeted

Reid Hannaford

@reidhannaford

3 days ago

I'm blown away. This AI filmmaking workflow for precise camera control, multiple characters, and dialogue is insane: 1. Generate a start frame in Midjourney 2. Match the poses in Blender, animate the camera 3. Feed both to Seedance I didn't think this would work. Two consistent characters, solid performances, the move tracked perfectly through the entire beat. Even the soup looks great.

108

3K

364

3K

194K

dopsnacky retweeted

Meng To

@MengTo

22 days ago

I recorded a 22-min tutorial on how to avoid AI slop for your landing pages

31

2K

85

2K

126K

Who to follow

Fujiki

@FujikiAyato

色々な沼(YOI、タイバニ、SD、金カム等々)にどぼどぼハマりまくって、全身泥まみれの昭和産(20↑どころか○暦近い)ババア垢。なお出会い系・資産運用系垢は即ブロしますので、そこんとこよろしく。

here for the AI Community | Retweets & Likes are memos for me

dopsnacky retweeted

Shruti

@ShruPosts

2 days ago

built this using antigravity + chatgpt + figma in ~90 mins 👀 (idea → moodboard → images → design → live)

151

4K

155

4K

371K

dopsnacky retweeted

Meng To

@MengTo

18 days ago

Insane to see the iOS simulator in Codex. iOS dev is about to get way easier with this plugin.

49

2K

86

2K

149K

dopsnacky retweeted

Alibaba Cloud

@alibaba_cloud

4 days ago

🚀 Introducing HappyHorse 1.1 — now officially live on Alibaba Cloud Model Studio! All HappyHorse 1.1 capabilities are available via API, providing enterprise customers and developers with a complete integration solution. This release delivers production-ready video synthesis systematically optimized across core content generation scenarios. 🔥 Launch Promotion: Enjoy a 40% OFF sitewide discount for the first 2 weeks! Optimize your integration costs today.

35

581

87

352

137K

dopsnacky retweeted

Cline

@cline

3 days ago

We've kept hearing how GLM-5.2 beats Opus 4.8, and are skeptical of benchmarks - so we tested them on a real bug from the Cline repo. While both models fixed the issue, GLM was the winner in terms of cost and code quality: - GLM used twice as many tokens (GLM 1.1m vs Opus 660K) but cost half as much (GLM $0.41 vs Opus $0.81) - Opus finished quicker - 1.6 min and 12 tool calls vs GLM 4.7 min and 28 tool calls - GLM cleaned up dead code and verified the build compiled before completing. Opus didn't - it left type errors that passed tests but broke the production build. Both runs used the same Cline harness prompting and tools, so it seems GLM is RL trained to spend more tokens verifying its work before completing. Impressive work by the @Zai_org team!

cline's tweet photo. We've kept hearing how GLM-5.2 beats Opus 4.8, and are skeptical of benchmarks - so we tested them on a real bug from the Cline repo. While both models fixed the issue, GLM was the winner in terms of cost and code quality:

- GLM used twice as many tokens (GLM 1.1m vs Opus 660K) but cost half as much (GLM $0.41 vs Opus $0.81)

- Opus finished quicker - 1.6 min and 12 tool calls vs GLM 4.7 min and 28 tool calls

- GLM cleaned up dead code and verified the build compiled before completing. Opus didn't - it left type errors that passed tests but broke the production build.

Both runs used the same Cline harness prompting and tools, so it seems GLM is RL trained to spend more tokens verifying its work before completing. Impressive work by the @Zai_org team!

221

8K

612

2K

873K

dopsnacky retweeted

Sakana AI

@SakanaAILabs

4 days ago

Introducing Sakana Fugu: A full multi-agent orchestration system accessible via a single model API. Our ‘Fugu Ultra’ model matches the performance of Fable and Mythos, delivering frontier capability without the risk of export controls. Try it: https://t.co/hhO6qTawgb 🐡

1K

38K

6K

31K

26M

dopsnacky retweeted

Kimi.ai @Kimi_Moonshot

14 days ago

🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced! 🔷 Improved coding & agent performance over K2.6: +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite. 🔷 Reasoning efficiency: Less overthinking, with 30% lower reasoning-token usage compared to K2.6. 🔷 Long-horizon coding: Improved instruction following, higher end-to-end coding task success rates. ⚡️ 6x High-Speed Mode coming soon! 🔌 Available today via Kimi API and Kimi Code. 🔗 Kimi Code: https://t.co/uvoSJKyGCY 🔗 API: https://t.co/EOZkbOwCN4

Kimi_Moonshot's tweet photo. 🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced!

🔷 Improved coding & agent performance over K2.6: +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite.
🔷 Reasoning efficiency: Less overthinking, with 30% lower reasoning-token usage compared to K2.6.
🔷 Long-horizon coding: Improved instruction following, higher end-to-end coding task success rates.

⚡️ 6x High-Speed Mode coming soon!
🔌 Available today via Kimi API and Kimi Code.

🔗 Kimi Code: https://t.co/uvoSJKyGCY
🔗 API: https://t.co/EOZkbOwCN4

644

14K

2K

3K

3M

dopsnacky retweeted

Google Research

@GoogleResearch

14 days ago

🚀 Introducing Gemini-SQL2, our breakthrough text-to-SQL capability powered by Gemini 3.1 Pro! We've achieved state-of-the-art results on the highly competitive BIRD benchmark, translating natural language into execution-ready SQL queries. 🧵👇

GoogleResearch's tweet photo. 🚀 Introducing Gemini-SQL2, our breakthrough text-to-SQL capability powered by Gemini 3.1 Pro! We've achieved state-of-the-art results on the highly competitive BIRD benchmark, translating natural language into execution-ready SQL queries. 🧵👇 https://t.co/HfO2ZW2pih

132

7K

631

3K

681K

dopsnacky retweeted

Google DeepMind @GoogleDeepMind

15 days ago

We’re teaming up @Palmeiras, the first football club to meaningfully build upon TacticAI: our AI system that can help simulate field scenarios and predict open play dynamics up to 8 seconds in advance. ⚽

117

3K

378

2K

1M

dopsnacky retweeted

OAK

@_OAK200

21 days ago

Here’s a system prompt you can use inside a ChatGPT or Claude project. The main idea is simple: You feed it a basic idea, a rough image prompt, a scene description, or even an uploaded image, and it enhance it and return with 10 cinematic prompts, each exploring a different composition, camera angle. For example, I used: “a gladiator riding a horse on a mountainside” And these are the results! It's a great way to explore different visual languages, discover interesting compositions. System prompt below. 👇

_OAK200's tweet photo. Here’s a system prompt you can use inside a ChatGPT or Claude project.

The main idea is simple:

You feed it a basic idea, a rough image prompt, a scene description, or even an uploaded image, and it enhance it and return with 10 cinematic prompts, each exploring a different composition, camera angle.

For example, I used:
“a gladiator riding a horse on a mountainside”

And these are the results!
It's a great way to explore different visual languages, discover interesting compositions.

System prompt below. 👇

52

821

59

1K

122K

dopsnacky retweeted

Reve @reve

23 days ago

Today, we’re launching Reve 2.0, the best 4K image model in the world. We invented a new way to generate and edit any image using precise layouts. For the first time, it’s possible to create images you can touch.

273

5K

491

5K

12M

dopsnacky retweeted

Google Gemma

@googlegemma

23 days ago

Meet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇

googlegemma's tweet photo. Meet Gemma 4 12B!

A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.

Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇 https://t.co/gf4FZv0WZb

402

12K

2K

5K

3M

dopsnacky retweeted

OpenAI

@OpenAI

24 days ago

Building apps has never been easier. With Sites, Codex can turn your work, ideas, and plans into an interactive website or app your team can explore, use, and share with a URL. Rolling out to Business and Enterprise plans, before expanding more broadly.

971

20K

2K

10K

10M

dopsnacky retweeted

Microsoft AI

@MicrosoftAI

24 days ago

Seven new models launching at Build: let’s go! Reasoning. Code. Image. Transcribe. Voice. Built from scratch on a clean data lineage, designed for efficiency, working seamlessly as a family of models Thread 🧵 #MSBuild

MicrosoftAI's tweet photo. Seven new models launching at Build: let’s go!
Reasoning. Code. Image. Transcribe. Voice.

Built from scratch on a clean data lineage, designed for efficiency, working seamlessly as a family of models

Thread 🧵
#MSBuild https://t.co/g3WQIcIQ24

139

3K

525

1K

393K

dopsnacky retweeted

Heavy Pulp

@heavypulp

about 2 months ago

The Call - A Heavy Pulp Original Made with Grok Imagine

88

824

97

231

281K

dopsnacky retweeted

Keshigeyan Chandrasegaran

@keshigeyan

28 days ago

1/ Introducing GPIC: a Giant Permissive Image Corpus and benchmark for visual generation! 🚀100M VLM-captioned image-text pairs for training 📊1M image-text pairs for benchmarking 🖼️~28 trillion pixels 🤗Centrally Hosted ✅Fully permissive for research + commercial use Dataset, benchmark and models🧵👇 Co-led with @KyleSargentAI

keshigeyan's tweet photo. 1/ Introducing GPIC: a Giant Permissive Image Corpus and benchmark for visual generation!

🚀100M VLM-captioned image-text pairs for training
📊1M image-text pairs for benchmarking
🖼️~28 trillion pixels
🤗Centrally Hosted
✅Fully permissive for research + commercial use

Dataset, benchmark and models🧵👇

Co-led with @KyleSargentAI

15

372

84

231

147K

dopsnacky retweeted

Y Combinator

@ycombinator

28 days ago

It’s never been easier to design your dream house. Draw a shape. Define your rooms. Set your constraints. @DraftedAI generates complete floor plans, elevations, and 3D home designs in seconds. Over the last month, 120,000 people generated 325,000+ home designs with https://t.co/XqC0LP5n3y.

189

4K

338

5K

745K

Marc

@dopsnacky

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users