@Altimor Pulled the trigger today and switched 100% of Lindy traffic to @HypernymAI , churning from vanilla models. Saves us millions of $ and we're actually seeing an *increase* in context performance on many core use cases. Transformative for the business.
The common belief today is that once models exceed 256k tokens, context performance falls off a cliff.
At Hypernym, we’ve internally proven this widely accepted “fact” is false.
It’s not a technical limitation — it’s simply the result of an industry moving too fast.
No matter which model we apply our infrastructure to, LITM drastically improves — among many other things.
More to come. 🚀
@josephweinberg@elonmusk Wait until they see Modulum, the next evolution of inference and long context persistent memory enhancements. It would put @spacex on steroids.
The explosion of agentic AI and compute shortages are pushing up prices: Average LLM token costs are now $2.12/mil tokens,+12% this week alone and +65% since end of Feb.
> youʼre OpenAI
> hire a small army of ex-Meta ad and monetization people
> a Slack channel just for ex-Facebook staff
> brings in the full “targeted ads” playbook
> launch a browser
> users install it, and OpenAI collects personalized, granular data at scale
> it’s a browser-shaped surveillance device
> it’s a mapping machine of your workflows
> itʼs a reverse-engineering tool for the internetʼs data pipelines, deployed at scale via their users
> launch Sora 2
> a TikTok‑style social network
> infinite AI-generated video feed
> you create or remix clips, upload your face, become the cameo star
> every scroll, like, remix is another data point, another ad signal
> their model learns exactly what hooks you and dials up the dopamine
> you’re not just watching, you’re training their algorithm for better ad targeting
> viral videos driven by your input + their algorithm = your attention refined into $$$
> “your feedback helps us improve the experience” (yeah, for advertisers)
> launch “Pulse”
> reads your chats while you sleep
> remembers you wanna visit Bora Bora
> knows your kid is 6 months old and
> “thinks” of your baby milestones
> suggests developmental toys next
> “it's for your convenience”
> actually laying the groundwork for targeted ads using memory
> internal memo: some people already think ChatGPT shows ads
> OpenAI staff: “might as well then”
> congrats, you’re back in the Facebook era
> except this time, you’re training the algo yourself
> Buy a GPU
> run your LLMs locally
> reject adware LLMs before it’s too late