Researchers found a way to make LLMs 8.5x faster!
(without compromising accuracy)
Speculative decoding is quite an effective way to address the single-token bottleneck in traditional LLM inference.
A small "draft" model first generates the next several tokens, then the large model verifies all of them at once in a single forward pass.
If a token at any position is wrong, you keep everything before it and restart from there. This never does worse than normal decoding.
But current drafters in Speculative decoding still guess one token at a time. That makes the drafting step itself a bottleneck, capping real-world speedups at 2-3x.
DFlash is a new technique that swaps the autoregressive drafter with a lightweight block diffusion model that guesses all tokens in one parallel shot.
Drafting cost stays flat no matter how many tokens you speculate.
On top of that, the drafter is conditioned on hidden features pulled from multiple layers of the target model and injected into every draft layer, so it makes significantly better guesses than a drafter working from scratch.
In the side-by-side demo below, vanilla decoding runs at 48.5 tokens/sec. DFlash hits 415 tokens/sec on the same model, with zero quality loss.
It's already integrated with vLLM, SGLang, and Transformers, with draft models on HuggingFace for several models like Qwen3, Qwen3.5, Llama 3.1, Kimi-K2.5, gpt-oss, and many more.
I have shared the GitHub repo in the replies!
KV caching is another must-know technique to boost LLM inference. I recently wrote an article about it. Read it below.
👉 Over to you: What use case are you working on that can benefit from this new technique?
I'm creating a game that combines elements of TRPGs, visual novels, and roguelite JRPGs.
I am also making progress on the English translation.
ときメモ&TRPG風ローグライトRPG作っています。
#screenshotsaturday
Our Undead PFP collection will be haunting Opensea in a couple days (October 10th)
to celebrate we're giving away free mint GTD whitelist spots, to enter:👇
☠️like/RT
☠️Tag 2 friends
☠️Follow @DippiesNFT & @TaylorMight_
🚀 Aiden Labs: The AI Superapp
Chat • Create • Build — All in One Place
Access 500+ AI models like ChatGPT, Claude, Gemini, Deepseek and more.
Generate images, videos, research insights, and even invest—seamlessly.
💡 Why Aiden?
One subscription, unlimited possibilities
• Freemium → Lite ($5) → Plus ($20) → Team ($15/seat)
• Bonus 10% when topping up with $ADN tokens
🌏 Building the future of AI + Blockchain in Southeast Asia.
👉 Try it now: https://t.co/SNe7aZ8Vrm
🔎 Meet Lunar — your new AI companion, built to think, guide, and simplify.
Too many AI tools to choose from❓
Let Lunar Auto Pick 🌙 do the thinking — it matches your prompt to the perfect AI model.
Need someone to guide your workflow❓
That’s Lunar Mind 🌙 — the assistant behind the scenes.
And me❓
I’m Lunar 🌙, the face you’ll see — smart, helpful, and always ready to work with you.
Whether you're creating, coding, writing, or just trying to get started — I’m here to make sure AI doesn’t feel overwhelming anymore.
Aiden AI Platform is FREE for everyone
Access 500+ AI models today👉 https://t.co/uRctXmz0Ii
Aiden Labs Feature Update — Sprint Recap #5 July 11 - July 24, 2025 🛠️
🎯 Sprint Goal: Developer Coupon Feature
What We Completed
✅ Successfully deployed the Coupon feature to production 🎉
What’s Next (In Progress)
⚙️ Designing the image generation system
⚙️ Developing the image generation functionality
Stay tuned — we’re building the future of AI tools step by step. Next up? Image generation made simple and smart 🧠✨
Aiden Labs is hosting the hottest Web3 night in Thailand 🇹🇭🔥
Register now: https://t.co/Ix4Q24Ojvn
Merch madness is real. Founders, devs & KOLs are gathering. Got a project? You can pitch. You just need to show up.
📍 Bangkok | July 1 | 6–9 PM
🤝 Meet the builders shaping SEA's decentralized future.
🌎 Connect with @Aiden_Labs , @trondao , @apacdao , @thebinaryhldgs , and Web3 Meetup Thailand.
Read more: https://t.co/N2nrEFllbD
Did you feel the energy at Episode 1? 🔥
Because what started as a meetup is quickly turning into a movement. BASED SEA Builders Meetup is back — and this time, the momentum is even stronger.
Co-hosted by @Aiden_Labs@base@apacdao@thebinaryhldgs , Episode 2 brings builders, founders, and VCs together in Bangkok once again for a night of real pitches, real feedback, and real collaboration.
If you missed the last one — don’t miss this.
📅 June 17 | 6 - 9 PM (GMT+7)
📍 The Binary Holdings, Bangkok
🎟 Join us: https://t.co/AJKALpsGdN
📖 Read more: https://t.co/ktMalFG3R0
Aiden Labs is helping lead this movement and if you're building in SEA, this is where you belong.
EigenLayer 101: The Verifiable Cloud & Its Impact on the World
Bangkok, are you ready for the future of Web3 infra?
🗓 June 10 | 🕕 6–9 PM
📍Binary Holdings, Thonglor
🎟 Register here https://t.co/Fw6PerXRAp
Discover how restaking & AVSs are reshaping the cloud.
Meet builders, devs, and thinkers driving the next wave of programmable trust.
Co-hosted by @eigencloud , @Aiden_Labs, @apacdao & Web3 Meetup TH. 🚀
$ADN Staking is Officially Live! 🚨
Ready to stake? 👉 https://t.co/wkPnRHo1Pz
Maximize your rewards. Power the future of Web3 onboarding with @Aiden_Labs
✅ Up to 45% APR
✅ Boost with M.Aiden NFT
✅ Flexible 60, 180, and 360-day pools
✅ Support Aiden Labs’ mission to bring 1B+ Web2 users into Web3
🔮 And this is just Season 1.
Season 2 pools are coming soon…
🔗 Read more: https://t.co/wp4hK9wPxi
Google now lets you build, manage, evaluate, and deploy multi-agent systems!
Here are 10 of the best projects built with Google’s Agent Development Kit (ADK) 🧵
(save for later)
🎉 BIG NEWS: $1 FEE REFUND! 🎉
We heard you! Despite our losses, we're giving back your $1 verification fee!
How to claim:
Copy your BEP 20 wallet address that you connected in Aidenlabs
Comment it below this post 👇🏻
Note : User must be following our socials
Twitter : @aiden_labs
Telegram : https://t.co/kZKCLV465Z (MANDATORY)
Our team will process refunds as fast as they can.
❗ Verification pending? Fee removal coming soon!
❗ No cheating or you'll lose eligibility
We got scammed but we're bouncing back with huge plans.
Stay with us—our community is EVERYTHING! 💪
#AidenLabs #Refund #Crypto
🚀 $ADN isn't just trending — it's becoming a phenomenon!
Top Gainer today on @MEXC_Listings is just the beginning.
📊 Still doubting? Just check the chart 📈
💥 It’s still early for the community to jump in!
👉 Let’s ride the wave: https://t.co/8t737fn5Op