Marcin Antas @antas_marcin - Twitter Profile

Marcin Antas

@antas_marcin

7 days ago

@GPW_Trader2022 250

2

1

0

285

antas_marcin retweeted

Bob van Luijt @bobvanluijt

8 days ago

🔥 The Weaviate team is on absolute fire this month, shipping one major update after another! 🗣️ Today, I am incredibly excited to roll out a feature requested by MANY of our community members 🚀 We have just launched a 𝗙𝗥𝗘𝗘 𝗙𝗢𝗥𝗘𝗩𝗘𝗥 𝗧𝗜𝗘𝗥 on Weaviate Cloud!

1

10

5

3

764

Marcin Antas

@antas_marcin

10 days ago

@helloiamleonie @liquidai @maximelabonne @paulabartabajo_ wow! Congratulations!!!

0

12

antas_marcin retweeted

Connor Shorten

@CShorten30

about 1 month ago

You can use AgentIR embeddings in the Weaviate Database with the `text2vec_huggingface` module! 🤗💚 And Happy Birthday to the lead creator and maintainer of Weaviate Modules, @antas_marcin! 🎂

CShorten30's tweet photo. You can use AgentIR embeddings in the Weaviate Database with the `text2vec_huggingface` module!
🤗💚

And Happy Birthday to the lead creator and maintainer of Weaviate Modules, @antas_marcin! 🎂

2

17

10

7

1K

Who to follow

Erika Shorten

@eshorten300

Partnerships @weaviate_io | Diary about agents, LLM frameworks, and vector databases 🤪

Zain

@ZainHasan6

I build and teach AI • AI/ML @togethercompute • EngSci ℕΨ/PhD @UofT • Previously: vector DBs, data scientist, lecturer & health tech founder • 🇺🇸🇨🇦🇵🇰

Etienne Dilocker

@etiennedi

Co-Founder & CTO @weaviate_io

Marcin Antas

@antas_marcin

about 1 month ago

@CShorten30 Awesome!! and thank you for birthday wishes @CShorten30 !!!

0

2

0

31

Marcin Antas

@antas_marcin

about 2 months ago

@KryptoDzikPL Super seria postów. Bardzo ciekawa. Mam pytanie co sądzi Pan o nadchodzącym IPO Cerebras?

0

199

Marcin Antas

@antas_marcin

3 months ago

@CShorten30 @MarcusForPeace @GoogleResearch NIce!!!

1

0

20

Marcin Antas

@antas_marcin

3 months ago

@KinasRemek @nvidia Mega! Gratulacje!!!

0

66

Marcin Antas

@antas_marcin

3 months ago

Weaviate on NVIDIA GTC 2026 slides!

Bob van Luijt @bobvanluijt

3 months ago

👀 And what did my eyes see during Jensen's #GTC keynote...?

2

30

7

1

6K

1

5

1

0

827

antas_marcin retweeted

Weaviate AI Database

@weaviate_io

3 months ago

The era of juggling 5 different embedding models is over. Google just unified text, images, video, audio, and PDFs into one vector space. 𝗢𝗻𝗲 𝗺𝗼𝗱��𝗹, 𝗺𝘂𝗹𝘁𝗶𝗽𝗹𝗲 𝗺𝗼𝗱𝗮𝗹𝗶𝘁𝗶𝗲𝘀: Text, images, video, audio, and PDFs all mapped into a single unified vector space. No more juggling different embedding models or complex preprocessing pipelines. 𝗕𝘂𝗶𝗹𝘁 𝗼𝗻 𝗚𝗲𝗺𝗶𝗻𝗶 𝗮𝗿𝗰𝗵𝗶𝘁𝗲𝗰𝘁𝘂𝗿𝗲 with support for 100+ languages and some impressive specs: • 8192 max input tokens • Flexible output dimensions (128-3072) • Top 5 performance on MTEB Multilingual leaderboard • SOTA among proprietary models across most modalities 𝗪𝗵𝘆 𝘁𝗵𝗶𝘀 𝗺𝗮𝘁𝘁𝗲𝗿𝘀 𝗳𝗼𝗿 𝘆𝗼𝘂𝗿 𝗥𝗔𝗚 𝗮𝗽𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀: By natively handling interleaved data without intermediate processing steps, Gemini Embedding 2 simplifies complex pipelines. You can now build semantic search and recommendation systems that seamlessly work across text documents, images, videos, and audio files. The model is available now via Gemini API and Vertex AI, and works with Weaviate's existing text2vec-google integration 💚 Check out these recipes to get started 👇 Semantic search/RAG over video: https://t.co/IzU7XZ34N7 Semantic search/RAG over audio: https://t.co/q7WNOncgx4 Multimodal PDF RAG: https://t.co/hPhbcNwk4D

weaviate_io's tweet photo. The era of juggling 5 different embedding models is over.

Google just unified text, images, video, audio, and PDFs into one vector space.

𝗢𝗻𝗲 𝗺𝗼𝗱��𝗹, 𝗺𝘂𝗹𝘁𝗶𝗽𝗹𝗲 𝗺𝗼𝗱𝗮𝗹𝗶𝘁𝗶𝗲𝘀: Text, images, video, audio, and PDFs all mapped into a single unified vector space. No more juggling different embedding models or complex preprocessing pipelines.

𝗕𝘂𝗶𝗹𝘁 𝗼𝗻 𝗚𝗲𝗺𝗶𝗻𝗶 𝗮𝗿𝗰𝗵𝗶𝘁𝗲𝗰𝘁𝘂𝗿𝗲 with support for 100+ languages and some impressive specs:
• 8192 max input tokens
• Flexible output dimensions (128-3072)
• Top 5 performance on MTEB Multilingual leaderboard
• SOTA among proprietary models across most modalities

𝗪𝗵𝘆 𝘁𝗵𝗶𝘀 𝗺𝗮𝘁𝘁𝗲𝗿𝘀 𝗳𝗼𝗿 𝘆𝗼𝘂𝗿 𝗥𝗔𝗚 𝗮𝗽𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀:
By natively handling interleaved data without intermediate processing steps, Gemini Embedding 2 simplifies complex pipelines. You can now build semantic search and recommendation systems that seamlessly work across text documents, images, videos, and audio files.

The model is available now via Gemini API and Vertex AI, and works with Weaviate's existing text2vec-google integration 💚

Check out these recipes to get started 👇

Semantic search/RAG over video: https://t.co/IzU7XZ34N7

Semantic search/RAG over audio: https://t.co/q7WNOncgx4

Multimodal PDF RAG: https://t.co/hPhbcNwk4D

3

81

19

50

4K

antas_marcin retweeted

Etienne Dilocker

@etiennedi

5 months ago

Small Change, Big Impact (Day 3/5): 12x Reduction of Inter-zonal traffic ⚡️📉 Another day, another significant win. Today it's not a performance update, but a traffic optimization. Learn how we reduced traffic by 12x on a large customer's cluster with 1,700+ updates per second. 🧵

etiennedi's tweet photo. Small Change, Big Impact (Day 3/5): 12x Reduction of Inter-zonal traffic ⚡️📉

Another day, another significant win. Today it's not a performance update, but a traffic optimization.

Learn how we reduced traffic by 12x on a large customer's cluster with 1,700+ updates per second. 🧵

1

8

4

1

473

antas_marcin retweeted

Etienne Dilocker

@etiennedi

5 months ago

Small Change, Big Impact (Day 2/5): Faster re-scoring for compressed HNSW (PQ/SQ/RQ/BQ) ⚡️📈 Any time you have compression, you have rescoring. Today's hidden improvement is a 25%+ speed-up from faster rescoring through better re-use of resources. Details in 🧵

etiennedi's tweet photo. Small Change, Big Impact (Day 2/5): Faster re-scoring for compressed HNSW (PQ/SQ/RQ/BQ) ⚡️📈

Any time you have compression, you have rescoring. Today's hidden improvement is a 25%+ speed-up from faster rescoring through better re-use of resources. Details in 🧵

2

15

7

1

1K

antas_marcin retweeted

Etienne Dilocker

@etiennedi

5 months ago

Small Change, Big Impact: Day 1/5: PQ Speed-up 📈⚙️ Product Quantization just got ~60% faster on average between v1.34.7 and v1.34.8. How? Why? It uses an optmization technique that's probably as old as coding itself. More in 🧵⬇️

etiennedi's tweet photo. Small Change, Big Impact: Day 1/5: PQ Speed-up 📈⚙️

Product Quantization just got ~60% faster on average between v1.34.7 and v1.34.8.

How? Why? It uses an optmization technique that's probably as old as coding itself. More in 🧵⬇️

2

20

7

1

1K

antas_marcin retweeted

Bob van Luijt @bobvanluijt

5 months ago

💪 Nice! Awesome work, Weaviate team! 👉 https://t.co/gYlVVxdXjT

0

10

5

0

865

Marcin Antas

@antas_marcin

6 months ago

If you want to have good multi lingual text and image search then you should use OpenCLIP models, for only image search I would go with ModernVBERT-embed. I have used this dataset: https://t.co/DzrGXovC2p and made the whole project open source, so if you want to run it locally the checkout this repo: https://t.co/utrWvYdzFe

0

1

0

1

20

Marcin Antas

@antas_marcin

6 months ago

Want to compare different CLIP models head-to-head? I've built a simple web app where you can test 3 search types (similarity search, image search, text-to-image search) across 4 open-source CLIP models: - facebook/metaclip-2-worldwide-b32-384 - ModernVBERT/modernvbert-embed - OpenCLIP xlm-roberta-base-ViT-B-32 pretrained: laion5b_s13b_b90k - google/siglip2-so400m-patch16-512 Images are stored in @weaviate_io and inference runs on an NVIDIA Jetson AGX Orin. Try it out and evaluate them yourself: https://t.co/lnSVP1Hx8I #weaviate #nvidia #agxorin #clip

antas_marcin's tweet photo. Want to compare different CLIP models head-to-head?

I've built a simple web app where you can test 3 search types (similarity search, image search, text-to-image search) across 4 open-source CLIP models:

- facebook/metaclip-2-worldwide-b32-384
- ModernVBERT/modernvbert-embed
- OpenCLIP xlm-roberta-base-ViT-B-32 pretrained: laion5b_s13b_b90k
- google/siglip2-so400m-patch16-512

Images are stored in @weaviate_io and inference runs on an NVIDIA Jetson AGX Orin.

Try it out and evaluate them yourself: https://t.co/lnSVP1Hx8I

#weaviate #nvidia #agxorin #clip

1

4

1

0

375

Marcin Antas

@antas_marcin

6 months ago

@KinasRemek Zapowiada się mega ciekawie!

0

1

0

81

Marcin Antas

@antas_marcin

6 months ago

@RBrzoska @MistralAI @bielikllm to by było coś niesamowitego!

0

244

Marcin Antas

@antas_marcin

6 months ago

With our newest CLIP inference container we added support for NVIDIA Jetson devices (JetPack 6) here's link how to use it: https://t.co/cUB0lB4z9m

Weaviate AI Database

@weaviate_io

6 months ago

Just released: Multi2Vec CLIP inference container 1.5.0 🎉 This release contains: - Support for facebook MetaClip2 models - Support for ModernVBERT/modernvbert-embed model - Added support for running inference container on NVIDIA Jetson devices Check out the docs to spin it up: https://t.co/08dr4zTaIt

weaviate_io's tweet photo. Just released: Multi2Vec CLIP inference container 1.5.0 🎉

This release contains:
- Support for facebook MetaClip2 models
- Support for ModernVBERT/modernvbert-embed model
- Added support for running inference container on NVIDIA Jetson devices

Check out the docs to spin it up: https://t.co/08dr4zTaIt

1

20

5

4

1K

0

5

0

113

antas_marcin retweeted

Weaviate AI Database

@weaviate_io

6 months ago

Just released: Multi2Vec CLIP inference container 1.5.0 🎉 This release contains: - Support for facebook MetaClip2 models - Support for ModernVBERT/modernvbert-embed model - Added support for running inference container on NVIDIA Jetson devices Check out the docs to spin it up: https://t.co/08dr4zTaIt

1

20

5

4

1K

Marcin Antas

@antas_marcin

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users