Just pushed an early version of 'Advanced Search'
It combines vector similarity search with AI reasoning to rank the most relevant podcast clips first, not just phrase and keyword matches. Simply describe the clip!
Smarter results, better context, faster discovery 🔥
Apparently Patreon thumbnail urls expire…that’s ok tho, I wrote a lil script that pulled the active url and downloaded the current catalog of thumbnails to load them locally
Moving forward the thumbnails will download when the system pulls in new episodes
I don’t think people realize the scale of data behind the JBP Search Engine.
Every single episode since Episode 1 has been transcribed and converted into embeddings for semantic search.
This isn’t keyword matching. It understands the context of your query and returns the exact moment you’re looking for…not just a list of episodes where something was mentioned.
1,415 episodes. ~3 hours each. Over 600GB of locally stored audio + additional metadata powering search quality.
Could you do it? Maybe, but just know, I’ve personally spent thousands of dollars in compute to make this possible.
There’s no way around it when dealing with this much data.
I don’t think people realize the scale of data behind the JBP Search Engine.
Every single episode since Episode 1 has been transcribed and converted into embeddings for semantic search.
This isn’t keyword matching. It understands the context of your query and returns the exact moment you’re looking for…not just a list of episodes where something was mentioned.
1,415 episodes. ~3 hours each. Over 600GB of locally stored audio + additional metadata powering search quality.
Could you do it? Maybe, but just know, I’ve personally spent thousands of dollars in compute to make this possible.
There’s no way around it when dealing with this much data.
@thestoptv Don't forget the Kill Tony specials alley oop'd into Kevins Hart Netflix Original Show 'Funny AF' the exact same thing as Kill Tony lol american idol for comedians
Heads up, theres no easy way to get all the Patreon thumbnails if you aren't the creator
You'll have to build a script to pull them from their backend
Tip: there isn't a public api for this either so get creative...just tryna help the folks rebuilding this in 20 minutes 😉
I don’t think people realize the scale of data behind the JBP Search Engine.
Every single episode since Episode 1 has been transcribed and converted into embeddings for semantic search.
This isn’t keyword matching. It understands the context of your query and returns the exact moment you’re looking for…not just a list of episodes where something was mentioned.
1,415 episodes. ~3 hours each. Over 600GB of locally stored audio + additional metadata powering search quality.
Could you do it? Maybe, but just know, I’ve personally spent thousands of dollars in compute to make this possible.
There’s no way around it when dealing with this much data.
Flip is great on the pod for when you're on your phone and watching because you know he's gonna ask "wait what are we talking about" half way through the convo
It's honestly really helpful 😂