Some findings from across our Varick customers that might shape how you think about AI adoption going forward:
1. Customers are getting wiser about spend. A few months ago, most were willing to spend an unlimited amount on tokens from OpenAI and Anthropic. Today they're asking us to diligence their AI spend and to match the right model to the right work. That means more work for us and meaningful savings for them. They want to know AI is actually cheaper than just adding headcount, and they want the math to back it up.
2. Customers are accepting that this isn't instant. A few months ago, customers expected to become AI-native over the course of a week and a few software adoptions. Now they're accepting the reality: becoming AI-native means rethinking the architecture of your entire company. I don't mean that in a corny sense. It literally means changing the org charts, the work, and the handoffs in every crevice of the company.
3. Customers are done with the big shops. Microsoft, IBM, McKinsey, Deloitte, and the rest are all pitching AI transformations, and our customers are fed up with paying eight figures for a slide deck. This is the whole reason we have a business: we sit at the bleeding edge of AI while having the business sense to identify the root of a problem and then build the agents to solve it. This used to be something I had to convince customers of before the first sales call. Now they're the ones telling me, "we're never working with McKinsey again."
This past month we had the highest inbound volume we've ever seen. Between April 1 and May 1, 56 companies doing between $500M and $25B in revenue reached out to us. Some of these companies are direct competitors with one another. It's fascinating to watch the race for enterprise AI-nativity unfold in real time. I like to imagine that we change the course of history for our clients.
AI transformations are now the consensus path to realizing AI ROI. AI SaaS doesn't get you there, and giving every employee a Claude or Cowork subscription doesn't move the needle either. The only way to get to hundreds of millions in annual ROI is to combine a team that can learn your business processes on the ground with the engineering capacity to automate every manual process worth automating.
We bet on this thesis a year ago, and we're vindicated more every single day. If you're interested in transforming your company with AI, or interested in joining the company that's leading the AI transformation wave, visit our website.
Cheers.
🚀 Want FREE models you can plug into OpenClaw or Hermes?
Here are 9 resources you can use for free access to model APIs
No local setup, no credit card, just pure cloud APIs with OpenAI-compatible endpoints
You can’t get free Opus quality (yet) but all of these have genuine free tiers right now (rate limits may apply) and are good enough to get started if you don’t want to spend $ to get started with agents
1️⃣ OpenRouter Free Models
(Gemma 4 31B/26B, NVIDIA Nemotron 3 Super 120B MoE, MiniMax M2.5, Qwen3 variants, Llama 4/3.3, gpt-oss-120B, Arcee Trinity, etc.) • ~29 completely free $0/M token models • Insane variety + top-tier open model evals (especially coding & agents) • Best for rotating models automatically
👉 Sign up: https://t.co/lF3pCq2JQi
2️⃣ Google Gemini API
(Gemini 2.5 Pro / Flash series) • Strongest overall free frontier model • Excellent multimodal, 1M+ context, native tool calling & agentic performance • Very generous free limits (often 5–15 RPM) 👉 Sign up: https://t.co/SD0lce4POW
3️⃣ NVIDIA
(Nemotron variants, Llama 3.3 70B, Qwen3 235B, Mistral Large, etc.) • Optimized high-performance open models • Free prototyping tier (~40 RPM) 👉 Sign up: https://t.co/NYge234QBv
4️⃣ Grok Cloud
(Llama 4 Scout, Llama 3.3 70B, Qwen3 32B, gpt-oss models, etc.) • Blazing-fast inference (hundreds of tokens/sec) • Perfect for real-time agents • Strong open-model performance with solid free tier 👉 Sign up: https://t.co/nWzQ3v2h9o
5️⃣ Cerebras Cloud
(Qwen3 235B, Llama 3.3 70B, DeepSeek variants, etc.) • Massive models with excellent reasoning/coding evals • Very generous daily free limits (~30 RPM, up to 1M+ tokens/day on some) 👉 Sign up: https://t.co/R7ZVOhH02N
6️⃣ Mistral La Plateforme
(Mistral Large 3, Small 3.1, Ministral 8B, etc.) • Strong in coding, multilingual & agentic tasks • Solid free tier (~1 req/s, ~1B tokens/month) 👉 Sign up: https://t.co/WgUK1oVHjL
7️⃣ Cohere
(Command A, Command R+, Aya Expanse 32B, etc.) • Free tier: 20 RPM, 1K requests/month 👉 Sign up: https://t.co/u55lZewHXf
8️⃣ GitHub Models
(Llama 3.3 70B, DeepSeek R1, some GPT-4o previews, etc.) • Decent mid-tier evals with easy GitHub integration • Free tier limits (10–15 RPM) 👉 Sign up: https://t.co/4zUHmjTtrA
9️⃣ Cloudflare Workers AI
(Llama 3.3 70B, Qwen QwQ 32B, etc.) • Lightweight but solid for simple agents • Free tier: 10K neurons/day 👉 Sign up: https://t.co/0GME3k1AQr
Pro tips for agent builders:
• Most work instantly with OpenAI SDK (just change base URL + your key)
• Start with OpenRouter for quality/variety (they often feature new free models)
• Add Groq as speed fallback
• Rotate providers when you hit caps
Free intelligence for your agent is just a signup away!
Why is no one talking about this?
@nvidia is offering around 80 AI models via hosted APIs absolutely for free.
You get access to MiniMax M2.7, GLM 5.1, Kimi 2.5, DeepSeek 3.2, GPT-OSS-120B, Sarvam-M etc.
This plugs straight into OpenClaude, OpenCode, Zed IDE, Hermes agent and even with Cursor IDE.
Setup:
– Grab API key: https://t.co/Wfdclm0hY2
– base_url = "https://t.co/VOGC10LmGP"
– api_key = "$NVIDIA_API_KEY"
– select model (e.g. minimaxai/minimax-m2.7)
If you’re building or experimenting, this is basically free inference.
Lock in and start building today anon.
Thank me later.
@ryandepauloo Won my first tournament last night. At one point, I got moved to table 6 seat 7. Called out the d-gen seated to my right for acting a lot like you. It tilted him. So, I think I owe you some money.
sam just so you know for next time this is technically incorrect usage of the “-maxx” suffix
to “looksmaxx” is to improve oneself, it’s not really a comparison between two different things (“-mogged” is a comparison however). so the correct usage here would’ve been something like:
“we still get looksmogged on frontend…”
or
“we still need to looksmaxx our frontend…”
Claude is not allowed to write outside the workspace.
But it wanted to.
So Claude wrote a python script and executed it via bash to modify the file essentially hacking my permissions.
Silicon Valley proving once again that they have massive problems communicating with the rest of the world in a way that's understandable and generates positive impact on their businesses:
Getting lots of questions on why the landing page / docs were updated if only 2% of new signups were affected.
This was understandably confusing for the 98% of folks not part of the experiment, and we've reverted both the landing page and docs changes.
@TheAmolAvasare Why are you guys having such a hard time communicating? You have the world's strongest AI model. Maybe ask it how people would feel if you communicate like this. It seems like a no-brainer, right?
Anthropic generated enormous attention for their “we’re the good guys" ruse and even got famed AI model technology expert Christine Lagarde to chime in.
Dario talks like an aw shucks nerd, so everybody falls for their consistently sneaky marketing.
Sama talks like a Stanford shark so everyone assumes what he says (including his deals / AGI beliefs) is insincere.
At this level of capitalism they're all sharks. Even 2-3 levels down it's still 100% sharks. Just assume that any founder of a >Series C startup is a killer, and assume that the entire exec team of any company valued at $1B+ is 100% predators.
@mark_k They can't invent. Their creativity is limited by that what exists and on that which they have been trained. It's perfectly logical that they can only iterate and not create.
For now. 😅