Alquilando la rtx 5090 en runpod la ecuacion te da menos de USD 0.01 el millon de tokens
BTW, Google te da 14k requests gratis por dia de todos los gemma4, pero es para hacer un costo real
Si quisieras hacerte tu propio plan "claude max" nivel sonnet 4.5 al precio que vale ese computo hoy, te daria un costo de unos USD 2/mes con toda la furia, sin depender de una empresa china ni nada de eso, algo que hace solo 3 meses lo pagaban 200usd a una empresa que quema 30B/yr para dar ese servicio a ese precio, ese es el nivel de locura en el que estamos
No tiene sentido que algo sea tan ineficiente, quiero decir, si, hay nichos en los que justificará, pero son los menos, tarde o temprano la necesidad de eficiencia en costos va a acomodar todo esto que hoy esta roto
From initial testing inference engines I’ve learned:
Diminishing returns, if you have qwen 27b running at 28+ t/s on a 3090 that’s probably about as good as it will get (without mtp etc.).
Unsloth is so good gguf is probably more worth it over a couple t/s more.
Unless you need concurrency vLLM/SGlang probably aren’t worth the hassle.
Still going to finish sglang testing just initial vibes.
🧠For Qwen3-Next’s Day 0 support in SGLang, one tricky part was enabling spec decoding with the Hybrid Linear Model—since SSM & conv caches only store the last position (unlike KV cache).
🚀After tons of effort with @qingquan_song, we achieved >2× speedup!
Benchmarks below
I can't go back to the regular YouTube UI after this 😅
Obsidian Reader now makes the transcript interactive so you can scrub, highlight, auto-scroll. It feels so nice.
This video took me 30 minutes to make
It has 3.7M views, 427.7k likes, 152.2k saves & 1.6k comments on TikTok
It makes my app $2,000+ every month still after 6 months
I spent $0 BTW
Stop overcomplicating it, you can literally steal this format if you want.
$10k/month from single ai character...
here's how our creators are making full-time money with brand campaigns:
> create your ai character
> grow the page to 100+ followers
> go to @affiliatenw
> join any public campaign
> post tiktoks
> earn $1–$3 per 1,000 views
> automate everything with claude
> create similar accounts
> scale to $10k–$40k a month
this article has all the sauce, good read:
Go to Reddit → r/AskReddit
Find old people sharing life advice.
Turn the stories into scripts with Claude.
Add voice using ElevenLabs.
Animate the video with TubeGen.
Edit everything in Canva.
Post the video on YouTube.
Get views.
Earn money every month. 💰📈
I have made $200,000 from a single Youtube channel, with a secret method called modelling.
Here's a full guide on modelling and how to do it with an example niche 👇
A Chinese student got $4M for an AI that simulates how thousands of people react to any event
Problem: it was Chinese-only and required cloud APIs.
I made it fully local + English
Here's what MiroFish-Offline does and why it matters 🧵
Today, we’re introducing Pomelli’s latest feature update, ‘Photoshoot’
With Photoshoot, you can start from a single image of your product and easily create high quality, customized product shots to elevate your marketing.
Available free of charge in the US, Canada, Australia & New Zealand! Get started with Pomelli today at https://t.co/SbeT00ToNx