๐ฅ The Weaviate team is on absolute fire this month, shipping one major update after another!
๐ฃ๏ธ Today, I am incredibly excited to roll out a feature requested by MANY of our community members
๐ We have just launched a ๐๐ฅ๐๐ ๐๐ข๐ฅ๐๐ฉ๐๐ฅ ๐ง๐๐๐ฅ on Weaviate Cloud!
You can use AgentIR embeddings in the Weaviate Database with the `text2vec_huggingface` module!
๐ค๐
And Happy Birthday to the lead creator and maintainer of Weaviate Modules, @antas_marcin! ๐
The era of juggling 5 different embedding models is over.
Google just unified text, images, video, audio, and PDFs into one vector space.
๐ข๐ป๐ฒ ๐บ๐ผ๐ฑ๏ฟฝ๏ฟฝ๐น, ๐บ๐๐น๐๐ถ๐ฝ๐น๐ฒ ๐บ๐ผ๐ฑ๐ฎ๐น๐ถ๐๐ถ๐ฒ๐: Text, images, video, audio, and PDFs all mapped into a single unified vector space. No more juggling different embedding models or complex preprocessing pipelines.
๐๐๐ถ๐น๐ ๐ผ๐ป ๐๐ฒ๐บ๐ถ๐ป๐ถ ๐ฎ๐ฟ๐ฐ๐ต๐ถ๐๐ฒ๐ฐ๐๐๐ฟ๐ฒ with support for 100+ languages and some impressive specs:
โข 8192 max input tokens
โข Flexible output dimensions (128-3072)
โข Top 5 performance on MTEB Multilingual leaderboard
โข SOTA among proprietary models across most modalities
๐ช๐ต๐ ๐๐ต๐ถ๐ ๐บ๐ฎ๐๐๐ฒ๐ฟ๐ ๐ณ๐ผ๐ฟ ๐๐ผ๐๐ฟ ๐ฅ๐๐ ๐ฎ๐ฝ๐ฝ๐น๐ถ๐ฐ๐ฎ๐๐ถ๐ผ๐ป๐:
By natively handling interleaved data without intermediate processing steps, Gemini Embedding 2 simplifies complex pipelines. You can now build semantic search and recommendation systems that seamlessly work across text documents, images, videos, and audio files.
The model is available now via Gemini API and Vertex AI, and works with Weaviate's existing text2vec-google integration ๐
Check out these recipes to get started ๐
Semantic search/RAG over video: https://t.co/IzU7XZ34N7
Semantic search/RAG over audio: https://t.co/q7WNOncgx4
Multimodal PDF RAG: https://t.co/hPhbcNwk4D
Small Change, Big Impact (Day 3/5): 12x Reduction of Inter-zonal traffic โก๏ธ๐
Another day, another significant win. Today it's not a performance update, but a traffic optimization.
Learn how we reduced traffic by 12x on a large customer's cluster with 1,700+ updates per second. ๐งต
Small Change, Big Impact (Day 2/5): Faster re-scoring for compressed HNSW (PQ/SQ/RQ/BQ) โก๏ธ๐
Any time you have compression, you have rescoring. Today's hidden improvement is a 25%+ speed-up from faster rescoring through better re-use of resources. Details in ๐งต
Small Change, Big Impact: Day 1/5: PQ Speed-up ๐โ๏ธ
Product Quantization just got ~60% faster on average between v1.34.7 and v1.34.8.
How? Why? It uses an optmization technique that's probably as old as coding itself. More in ๐งตโฌ๏ธ
If you want to have good multi lingual text and image search then you should use OpenCLIP models, for only image search I would go with ModernVBERT-embed.
I have used this dataset: https://t.co/DzrGXovC2p
and made the whole project open source, so if you want to run it locally the checkout this repo: https://t.co/utrWvYdzFe
Want to compare different CLIP models head-to-head?
I've built a simple web app where you can test 3 search types (similarity search, image search, text-to-image search) across 4 open-source CLIP models:
- facebook/metaclip-2-worldwide-b32-384
- ModernVBERT/modernvbert-embed
- OpenCLIP xlm-roberta-base-ViT-B-32 pretrained: laion5b_s13b_b90k
- google/siglip2-so400m-patch16-512
Images are stored in @weaviate_io and inference runs on an NVIDIA Jetson AGX Orin.
Try it out and evaluate them yourself: https://t.co/lnSVP1Hx8I
#weaviate #nvidia #agxorin #clip
Just released: Multi2Vec CLIP inference container 1.5.0ย ๐
This release contains:
- Support for facebook MetaClip2 models
- Support forย ModernVBERT/modernvbert-embedย model
- Added support for running inference container onย NVIDIA Jetson devices
Check out the docs to spin it up: https://t.co/08dr4zTaIt
Just released: Multi2Vec CLIP inference container 1.5.0ย ๐
This release contains:
- Support for facebook MetaClip2 models
- Support forย ModernVBERT/modernvbert-embedย model
- Added support for running inference container onย NVIDIA Jetson devices
Check out the docs to spin it up: https://t.co/08dr4zTaIt