Sergio Moya

almost 2 years ago

Are you interested in distributed systems? Grab your 🍿 because you might enjoy diving into an interesting discussion about distributed locks and the Redlock pattern used in Redis. Conversations took place a few years ago but they are still relevant. https://t.co/3cptUiIa9p

937

smoyac retweeted

about 1 year ago

Read more about the new pgai Vectorizer: https://t.co/N8m1jVpO6t Check out the pgai Github to get started: https://t.co/iDvO7OUDTd

272

smoyac retweeted

about 1 year ago

SIMPLIFY DOCUMENT EMBEDDINGS WITH PGAI VECTORIZER: Postgres + Amazon S3 Tired of complex AI pipelines for document embeddings? We've expanded pgai Vectorizer to automatically create searchable vector embeddings in Postgres from documents stored in S3 while keeping the original files in place. The document embedding challenge Before pgai Vectorizer, developers building RAG applications had to manage complex ETL pipelines, multiple systems, data synchronization services, queuing systems, and monitoring tools just to keep document embeddings up-to-date. This complexity creates a brittle infrastructure that inevitably leads to stale embeddings and wasted engineering hours. Automatic document vectorization Pgai Vectorizer now provides a streamlined approach where you can reference documents in S3 via URLs stored in a database table. The vectorizer handles the complete workflow—downloading documents, parsing them to extract content, chunking text appropriately, and generating embeddings for vector search. pgai Vectorizer enables developers to: ✅ Get started more easily with automatic embedding creation via a simple SQL command. ✅ Spend less time wrangling data infrastructure with automatic updating and synchronization. ✅ Continuously improve your AI systems by testing different embedding models or chunking strategies with a single line of SQL. We support a wide variety of file formats including PDF, DOCX, TXT, XLSX, PPTX, images, HTML, and more. Why did we build this? At @timescaledb, we believe Postgres is the ideal home for both your structured data and vector embeddings. By keeping embeddings automatically synchronized to source documents in S3, pgai Vectorizer ensures your Postgres database remains the single source of truth for your AI applications. Want to unlock insights from mountains of your documents without the infrastructure headache? Try pgai Vectorizer today. [1] Learn more: https://t.co/XmDc0zar4b [2] Pgai Github: https://t.co/hut1MxuwPZ Share this post with your AI and data teams to let them know about pgai Vectorizer document embedding support.

avthar's tweet photo. SIMPLIFY DOCUMENT EMBEDDINGS WITH PGAI VECTORIZER: Postgres + Amazon S3

Tired of complex AI pipelines for document embeddings? We've expanded pgai Vectorizer to automatically create searchable vector embeddings in Postgres from documents stored in S3 while keeping the original files in place.

The document embedding challenge
Before pgai Vectorizer, developers building RAG applications had to manage complex ETL pipelines, multiple systems, data synchronization services, queuing systems, and monitoring tools just to keep document embeddings up-to-date. This complexity creates a brittle infrastructure that inevitably leads to stale embeddings and wasted engineering hours.

Automatic document vectorization
Pgai Vectorizer now provides a streamlined approach where you can reference documents in S3 via URLs stored in a database table. The vectorizer handles the complete workflow—downloading documents, parsing them to extract content, chunking text appropriately, and generating embeddings for vector search.

pgai Vectorizer enables developers to:
✅ Get started more easily with automatic embedding creation via a simple SQL command.
✅ Spend less time wrangling data infrastructure with automatic updating and synchronization.
✅ Continuously improve your AI systems by testing different embedding models or chunking strategies with a single line of SQL.

We support a wide variety of file formats including PDF, DOCX, TXT, XLSX, PPTX, images, HTML, and more.

Why did we build this?
At @timescaledb, we believe Postgres is the ideal home for both your structured data and vector embeddings. By keeping embeddings automatically synchronized to source documents in S3, pgai Vectorizer ensures your Postgres database remains the single source of truth for your AI applications.

Want to unlock insights from mountains of your documents without the infrastructure headache? Try pgai Vectorizer today.

[1] Learn more: https://t.co/XmDc0zar4b
[2] Pgai Github: https://t.co/hut1MxuwPZ

Share this post with your AI and data teams to let them know about pgai Vectorizer document embedding support.

smoyac retweeted

Programador web. Me encantan los videojuegos, vamos, un fReAk en toda regla (XD)

about 1 year ago

PGVECTOR VS QDRANT: You don’t need a specialized vector database for large scale. Postgres is all you need. 🐘There's a common misconception that Postgres and pgvector can't scale for vectors. That’s why we @TimescaleDB built pgvectorscale, an open-source PostgreSQL extension that supercharges pgvector with greater performance for large scale vector workloads. And we put it to the test against Qdrant, the leading specialized vector database, on an ANN benchmark of 50M embeddings. The results surprised even us... 📈 How did Postgres perform vs Qdrant? Postgres with pgvector and pgvectorscale outperforms Qdrant on throughput by 11.4x and delivers sub-75ms p99 query latencies at 99% recall. Qdrant does deliver better query latencies (between 39-48% better than Postgres), showing less variance between percentiles, which is to be expected from a purpose built vectordb. 🤔 Why does this matter? Postgres is the database that many developers already know, use, and trust. And this benchmark shows that it can not only keep up, but even outperform specialized vector databases on high-scale workloads. While there are niche use cases that do benefit from a dedicated vector database like Qdrant, pgvector and pgvectorscale enables 99% of developers to start and scale confidently with just PostgreSQL. 🧑‍💻 Sounds exciting! How can I get started? Pgvectorscale is open-source under the PostgreSQL license, and free to use on any PostgreSQL database. It’s available on any database service on Timescale Cloud, and you can find installation instructions on the pgvectorscale GitHub repository (see end of post). 👏 Big shoutout to the pgvectorscale team @cevianNY , TJ Green, and Smitty van Bodegom who worked on the technical innovations behind pgvectorscale like StreamingDiskANN indexes, Statistical Binary Quantization, and this benchmark study itself. Ya’ll are the real MVPs. 📚 Learn more [1] Full Pgvector vs Qdrant benchmark blog post: https://t.co/xxh0nLBUdl [2] pgvectorscale GitHub repo: https://t.co/xcpA65dWOQ Share this post with your team to let them know about pgvectorscale vs Qdrant and comment with your reactions and questions!

avthar's tweet photo. PGVECTOR VS QDRANT: You don’t need a specialized vector database for large scale. Postgres is all you need.

🐘There's a common misconception that Postgres and pgvector can't scale for vectors. That’s why we @TimescaleDB built pgvectorscale, an open-source PostgreSQL extension that supercharges pgvector with greater performance for large scale vector workloads.

And we put it to the test against Qdrant, the leading specialized vector database, on an ANN benchmark of 50M embeddings. The results surprised even us...

📈 How did Postgres perform vs Qdrant?
Postgres with pgvector and pgvectorscale outperforms Qdrant on throughput by 11.4x and delivers sub-75ms p99 query latencies at 99% recall.

Qdrant does deliver better query latencies (between 39-48% better than Postgres), showing less variance between percentiles, which is to be expected from a purpose built vectordb.

🤔 Why does this matter?
Postgres is the database that many developers already know, use, and trust. And this benchmark shows that it can not only keep up, but even outperform specialized vector databases on high-scale workloads.

While there are niche use cases that do benefit from a dedicated vector database like Qdrant, pgvector and pgvectorscale enables 99% of developers to start and scale confidently with just PostgreSQL.

🧑‍💻 Sounds exciting! How can I get started?
Pgvectorscale is open-source under the PostgreSQL license, and free to use on any PostgreSQL database. It’s available on any database service on Timescale Cloud, and you can find installation instructions on the pgvectorscale GitHub repository (see end of post).

👏 Big shoutout to the pgvectorscale team @cevianNY , TJ Green, and Smitty van Bodegom who worked on the technical innovations behind pgvectorscale like StreamingDiskANN indexes, Statistical Binary Quantization, and this benchmark study itself. Ya’ll are the real MVPs.

📚 Learn more
[1] Full Pgvector vs Qdrant benchmark blog post: https://t.co/xxh0nLBUdl
[2] pgvectorscale GitHub repo: https://t.co/xcpA65dWOQ

Share this post with your team to let them know about pgvectorscale vs Qdrant and comment with your reactions and questions!

Who to follow

Aitor López

@aitorlopez84

Albert Casademont

@acasademont

Software engineer & web performance enthusiast. Engineering at @aetioninc. Proud @ulabox and @wallbox_es alumni

Ronny López

@ronnylt

Eventually opinionated. Opinions are my own, but I change them very often.

smoyac retweeted

OpenAI Developers

@OpenAIDevs

over 1 year ago

ChatGPT for macOS can now edit code directly in IDEs. Available to Plus, Pro, and Team users.

356

567

over 1 year ago

A printable #postgres psql commands cheatsheet I made from the official documentation - https://t.co/NpW0f1dPUq

smoyac retweeted

over 1 year ago

Stop Over-Engineering AI Apps 🙅 AI frameworks promise simplicity but often add needless complexity. LangChain, for example, buries developers in abstraction—many start with it and then rip it out. The reality? AI apps are still just apps. Most AI ops (embeddings, retrieval, LLM calls) are just basic data types. Why not keep it simple? ✅ PostgreSQL + pgvector + pgai → Vector search, no extra infra ✅ LiteLLM → One API, all LLMs ✅ SQLAlchemy + pgai → AI-powered search in your Python app Build with boring, composable tech, not hype. Check it out: https://t.co/npJqqGG3ev #AI #PostgreSQL #pgvector #LLMs #KeepItSimple

TimescaleDB's tweet photo. Stop Over-Engineering AI Apps 🙅

AI frameworks promise simplicity but often add needless complexity. LangChain, for example, buries developers in abstraction—many start with it and then rip it out.

The reality? AI apps are still just apps. Most AI ops (embeddings, retrieval, LLM calls) are just basic data types. Why not keep it simple?

✅ PostgreSQL + pgvector + pgai → Vector search, no extra infra
✅ LiteLLM → One API, all LLMs
✅ SQLAlchemy + pgai → AI-powered search in your Python app
Build with boring, composable tech, not hype.

Check it out: https://t.co/npJqqGG3ev

#AI #PostgreSQL #pgvector #LLMs #KeepItSimple

757

smoyac retweeted

over 1 year ago

🎉pgai Vectorizer gets LiteLLM & SQLAlchemy support 🎉 It’s launch week at Timescale, and we have two big updates for developers working with vector search in PostgreSQL: 🔹 LiteLLM integration 👉 Use multiple embedding model providers (@cohere, @huggingface, @MistralAI, @Azure OpenAI, @awscloud Bedrock, @Google Vertex AI) through a single interface. Available for self-hosted users, coming soon to Timescale Cloud. 🔹 SQLAlchemy support 👉 Work with vector embeddings in Python using familiar SQLAlchemy patterns and tools. Less friction, more flexibility. Give it a try, and let us know what you think. Repo and more details below 🔗👇 https://t.co/Ui08YQgYen #postgresql #ai #vectorsearch #orm #sqlalchemy #litellm

TimescaleDB's tweet photo. 🎉pgai Vectorizer gets LiteLLM & SQLAlchemy support 🎉

It’s launch week at Timescale, and we have two big updates for developers working with vector search in PostgreSQL:

🔹 LiteLLM integration 👉 Use multiple embedding model providers (@cohere, @huggingface, @MistralAI, @Azure OpenAI, @awscloud Bedrock, @Google Vertex AI) through a single interface. Available for self-hosted users, coming soon to Timescale Cloud.

🔹 SQLAlchemy support 👉 Work with vector embeddings in Python using familiar SQLAlchemy patterns and tools.

Less friction, more flexibility. Give it a try, and let us know what you think. Repo and more details below 🔗👇
https://t.co/Ui08YQgYen

#postgresql #ai #vectorsearch #orm #sqlalchemy #litellm

smoyac retweeted

over 1 year ago

Proud to announce that pgai and @TimescaleDB are sponsoring The 2025 @aiDotEngineer Summit NYC! AI Engineer Summit the #1 technical conference in the world for AI engineers and leadership. The theme of this Summit is Agents at Work, with a special focus on in-production stories. We're super proud to be supporting this event, and I personally couldn't be more excited for the high-quality sessions and hallway discussions that will take place! Are you attending AI Engineer Summit NYC? And building building RAG or Agents with PostgreSQL and Pgvector? If so, come say hi! Myself and the pgai team will be at the Summit and eager to share stories and demo our latest releases (and sneak peaks and upcoming features).

avthar's tweet photo. Proud to announce that pgai and @TimescaleDB are sponsoring The 2025 @aiDotEngineer Summit NYC!

AI Engineer Summit the #1 technical conference in the world for AI engineers and leadership. The theme of this Summit is Agents at Work, with a special focus on in-production stories.

We're super proud to be supporting this event, and I personally couldn't be more excited for the high-quality sessions and hallway discussions that will take place!

Are you attending AI Engineer Summit NYC? And building building RAG or Agents with PostgreSQL and Pgvector? If so, come say hi! Myself and the pgai team will be at the Summit and eager to share stories and demo our latest releases (and sneak peaks and upcoming features).

over 1 year ago

https://t.co/fPTf48QvC5

smoyac retweeted

David Boyne

@boyney123

over 1 year ago

I've been working on a dashboard for @event_catalog , users will be able to deploy there catalogs, use studio to visual design them, get notifications when schemas change and much more.... I really like how the dashboard is currently looking ❤️

boyney123's tweet photo. I've been working on a dashboard for @event_catalog , users will be able to deploy there catalogs, use studio to visual design them, get notifications when schemas change and much more....

I really like how the dashboard is currently looking ❤️ https://t.co/e3jbMCR3fg

Alena Nikolaeva 🦋 @alenanik11

over 1 year ago

Fotasa. Pues eso, hoy, a las 18:00 CET!

Codely ﹤🍍﹥

@CodelyTV

over 1 year ago

Hay muchas formas de integrar la IA en tu aplicación, pero el mayor problema es el mantenimiento al largo tiempo. Para ello, llegó pgai, con un paradigma que acerca la generación de embeddings a los índices de bases de datos. Hoy, vamos a estar en directo con 2 personas que trabajan en esta herramienta dentro de @TimescaleDB: Sergio y Adolfo. 2 personas de habla hispana que nos van a contar cómo montarlo, cómo funciona por dentro y cómo hace esto para escalar tanto. Además, tendremos un debate sobre por qué se ha montado sobre la base de datos que se ha montado muy interesante. 🕕 Hoy, a las 18:00 CET 🔗 https://t.co/RZUnB23zPn 💬 ¡Responderemos a tus dudas en el chat!

CodelyTV's tweet photo. Hay muchas formas de integrar la IA en tu aplicación, pero el mayor problema es el mantenimiento al largo tiempo.

Para ello, llegó pgai, con un paradigma que acerca la generación de embeddings a los índices de bases de datos.

Hoy, vamos a estar en directo con 2 personas que trabajan en esta herramienta dentro de @TimescaleDB: Sergio y Adolfo.

2 personas de habla hispana que nos van a contar cómo montarlo, cómo funciona por dentro y cómo hace esto para escalar tanto.

Además, tendremos un debate sobre por qué se ha montado sobre la base de datos que se ha montado muy interesante.

🕕 Hoy, a las 18:00 CET
🔗 https://t.co/RZUnB23zPn
💬 ¡Responderemos a tus dudas en el chat!

167

smoyac retweeted

Edward Kmett

@kmett

over 1 year ago

Python is such a great user experience for AI. I can really see how it took over the world. You get the joy of installing and reinstalling the same packages with minor variations in cuda support. Then you get twenty minutes of being told argument after argument is deprecated and that your code will break in 1-2 versions, half of which are being invoked transitively by third-party code you are invoking. disable_exllama? No !use_exllama. Forgot to disable_exllama er.. not use_exllama? No weights for you. Once you suffer through that workflow, you can start iterating at a rate of half an hour to a day for each run of converting a model to another format, only to get a runtime error at the end that, say, 'quantize' method doesn't exist any more, thanks to transformers taking on that part, but then the 'save_quantized' method that is in all the tutorials doesn't exist anymore, so that part is also bunk. Now you're 8 hours into a 20 minute project, but you didn't have to deal with any type errors, so I guess you've got that going for you.

382

26K

smoyac retweeted

over 1 year ago

Any status message longer than 100 characters can be overwhelming in my experience and with testing with screen reader users. If we need to inform the user with a longer text, then we should really re-think our user interface. ❗ https://t.co/1MfpmqEqCk

641

smoyac retweeted

N҉I҉K҉O҉L҉A҉Y҉ @NikolayOCDC

over 1 year ago

Violento es morir en la lista de espera

99K

over 1 year ago

https://t.co/2a6z3VatYu

101

smoyac retweeted

Matvey Arye 🇺🇦

@cevianNY

over 1 year ago

pgai now supports a super-easy way to ingest @huggingface datasets directly into @PostgreSQL.

over 1 year ago

Boom!

Voyage AI by MongoDB

@VoyageAI

over 1 year ago

Excited to announce our partnership with @TimescaleDB! More accurate and efficient RAG from PostgreSQL powered by pgai + Voyage AI 🧵🧵🧵

VoyageAI's tweet photo. Excited to announce our partnership with @TimescaleDB! More accurate and efficient RAG from PostgreSQL powered by pgai + Voyage AI 🧵🧵🧵 https://t.co/Uos6ZcYGi1

105

smoyac retweeted

over 1 year ago

💪 Reclaiming Control: The Open-Source AI Stack 🤖 Proprietary LLMs like OpenAI and Anthropic are powerful, but they come with trade-offs: high costs, vendor lock-in, and privacy risks. Open-source AI changes this, giving developers control over their data, costs, and deployment—without sacrificing performance. The ecosystem is maturing fast, with tools that make it easy to build smarter AI apps without reliance on closed systems. Here’s our “Easy Mode” stack for building smarter AI apps. 😉👇 #PostgreSQL #OpenSource #AI #LLMs #Timescale #pgai

TimescaleDB's tweet photo. 💪 Reclaiming Control: The Open-Source AI Stack 🤖

Proprietary LLMs like OpenAI and Anthropic are powerful, but they come with trade-offs: high costs, vendor lock-in, and privacy risks. Open-source AI changes this, giving developers control over their data, costs, and deployment—without sacrificing performance.

The ecosystem is maturing fast, with tools that make it easy to build smarter AI apps without reliance on closed systems.

Here’s our “Easy Mode” stack for building smarter AI apps. 😉👇

#PostgreSQL #OpenSource #AI #LLMs #Timescale #pgai

179

150

30K

smoyac retweeted

over 1 year ago

🦙 pgai Vectorizer + @ollama: Open-Source AI Made Simple 🦙 pgai Vectorizer now supports open-source embedding models via @ollama, alongside @OpenAI and @VoyageAI models, making it easier for developers to build AI applications with open-source tools. What’s New? 1️⃣ Ollama Integration: Use open-source models like Nomic (nomic-embed-text), Sentence Transformers, and BAAI’s bge models, directly with PostgreSQL. 2️⃣Seamless Embeddings: Auto-generate and sync embeddings from your PostgreSQL data—no extra tools or pipelines needed. 3️⃣Full Open-Source AI Stack: Combine PostgreSQL (vector database) with open-source models to build a complete, open-source AI pipeline. Why It Matters? No pipelines, no extra tools—just seamless embedding generation with PostgreSQL and open-source models, all in one place. Build smarter and faster with a fully open-source AI stack. Try it out, and let us know what you’re building! 🔗LINK BELOW 👇 #PostgreSQL #OpenSource #AI #LLMs #Timescale #pgai

TimescaleDB's tweet photo. 🦙 pgai Vectorizer + @ollama: Open-Source AI Made Simple 🦙

pgai Vectorizer now supports open-source embedding models via @ollama, alongside @OpenAI and @VoyageAI models, making it easier for developers to build AI applications with open-source tools.

What’s New?
1️⃣ Ollama Integration: Use open-source models like Nomic (nomic-embed-text), Sentence Transformers, and BAAI’s bge models, directly with PostgreSQL.
2️⃣Seamless Embeddings: Auto-generate and sync embeddings from your PostgreSQL data—no extra tools or pipelines needed.
3️⃣Full Open-Source AI Stack: Combine PostgreSQL (vector database) with open-source models to build a complete, open-source AI pipeline.

Why It Matters? No pipelines, no extra tools—just seamless embedding generation with PostgreSQL and open-source models, all in one place. Build smarter and faster with a fully open-source AI stack.

Try it out, and let us know what you’re building!

🔗LINK BELOW 👇

#PostgreSQL #OpenSource #AI #LLMs #Timescale #pgai

164

14K