๐ Look who's climbing! Trieve @trieveai just crossed 2,500 GitHub stars! Props to the team ๐ซ
AI-powered search and RAG platform for faster, more relevant results
https://t.co/4QJbMUabsa
so Trieve was really a bad idea because it's mostly true that xgboost is all you need
big enterprises who spend a lot on search are primarily spending money on staff and data pipelines to tune xgboost for ranking a top-k set of results that comes back from their search engine db
sparse/dense/multi vector(s) scores are ultimately just data points which go into the xgboost model that decides on ranking (this is called "learning to rank" if you want to google)
it is important to have as many data points as you can get. most companies will add things like # of sales, likes, etc.
tooling for this isn't great, the elastic/solr plugins have outdated documentation and data engineering with LLMs is still way too intensive
startups don't use xgboost for their "optimized retrieval pipelines" and that's a travesty
someone out there needs to work on making it better/easier
We acquired Trieve! ๐
@mintlify now powers over 23M search and AI queries per month. To deliver the AI-native docs experience developers deserve, we needed world-class search and retrieval infrastructure.
Trieve's founders @skeptrune and @cdxker built exactly that - and now they're joining our team to scale it even further.
The future of developer experience is conversational, instant, and accurate. Excited to build it together with the Trieve team
Welcome to the family @skeptrune@cdxker!!
Excited for world-class search and retrieval infra for AI native docs experience ๐ Can't wait to see what we ship together @mintlify@trieveai
We built Uzi as a tool for our own use and are sharing it with everyone!
Is only one instance of a codegen agent "good enough" anymore?
Check out the repo here: https://t.co/FldRxDEwrL
Uzi is basically your command center for parallel AI coding agents, leveraging Git worktrees to keep them all working without tripping over each other
Some are finding success running agents in parallel, but thereโs no easy way to do it, let alone manage it at enterprise scale
Most RAG systems are dumb.
LLMs are smart enough to search and will re-query until they find the right thing. Give them search tools and let them figure it out. We built this agentic approach into a single API route.
Just `use_agentic_search: true` in your API call ๐
Cheating at Search with LLMs
We've been doing this thing for a while at Trieve that we've been calling "cheating at Search with LLMs" and I thought it'd be cool to talk about it.
The Problem: Smart Product Comparisons
For our Gen AI sales associate Shopify app, we wanted to make it possible to do cool things like generate a comparison table for any two products. Take this example from the brand LifeStraw, which sells filterable straws. If a customer asks to "compare the Sip against the LifeStraw" (two different products in their portfolio), we need to quickly look inside their catalog to determine which two products to fetch.
The challenge? No traditional keyword, semantic, or hybrid search would ever be intelligent enough without an LLM to understand the exact two products being discussed.
Our Solution: Let the LLM Do the Hard Work
So we cheat. Here's how it works:
1. First, we do a standard search with the user's query and get the top 20 results. Each group represents a product, and each chunk within that group is a variant of that product (like different colors or pack sizes).
2. Then we use a tool called "determine relevance" that asks the LLM to rank each product as high, medium, or low relevance to the query. We pass each product's JSON, HTML, description text, and title to the LLM.
3. The LLM examines each product and makes the call. For example, it might mark the LifeStraw Sip Cotton Candy variant as "high" relevance, and the regular Life Straw as "high" relevance, while everything else gets "medium" or "low."
4. We then use these relevance rankings to display only the most relevant products to the user.
Making It Fast
Despite making 20+ LLM calls in the background, the experience feels instantaneous to the user thanks to semantic caching on all the tool calls. If I run the same comparison again, it's blazing fast.
Going Even Further
We extend this approach to other aspects of search:
- Price Filters: We have a tool call that extracts min and max price parameter
- Category Determination: For stores with predefined categories, we use LLMs to determine the right category
- Format Selection: We use tool calls to decide whether to generate text or images
- Context Retention: If a user follows up with "tell me more about the Life Straw's filtration," we don't need to search again - we just use the same products from before
Why This Matters
It literally feels like cheating, which is incredible. In the early days, we spent a ton of time building super high-relevance search pipelines. But with modern LLMs, that's unnecessary. You can just fetch 20 things, give the LLM the query and each fetched item, and ask it which ones are relevant.
Absolute madness. Intelligence as a commodity.
๐ Just published: Trieve @trieveai โ AI-powered search and RAG platform for faster, more relevant results
Trieve offers an all-in-one solution for search, recommendations, and RAG with automatic continuous improvement based on user feedback.
https://t.co/4QJbMUabsa
vibe coded an ai agent that comes up with new relationship questions and uses @trieveai to make sure the questions don't sound too similar to ones people have already seen in our app
infinite content glitch hahaha
(kidding we will still need to manually vet these but it makes life a lot easier)
@appwrite@supabase@pocketbase@AirbyteHQ@coolifyio AI & MACHINE LEARNING:
@LLMWare - LLM-driven solution platform
@TrieveAI - Specialized AI infrastructure for retrieval tasks
These aren't just alternatives, they're often MORE powerful than their paid counterparts!
Build Blazingly Fast Typo Detection using Trieve
Trieve is an open source, all-in-one infrastructure for search, recommendations, RAG, and analytics offered via API. Our latest version is 100 times faster, 300ฮผs for correctly spelled queries and ~5ms / word for misspellings. We explain how it was accomplished in this post. #trieve #RAG #Rust
https://t.co/XtkEifBYqk