Tech Evangelist, Developer Advocate, Tech Writer, & Marketing Consultant || Owner @ Talk to Me About Tech ๐ค || Follow for tips using tech for success โจ
For a look into what I'm up to these days (all things #community and #PostgreSQL, as always - and now throwing a #rabbitrescue into the mix!), check out the latest from Doug Ortiz of TechBits - what a fun #podcast episode this was to record!
https://t.co/TmJsID4msL
๐ฅ ๐๐๐ฒ๐ฟ๐ ๐๐ฒ๐๐ฒ๐น๐ผ๐ฝ๐ฒ๐ฟ ๐ถ๐ ๐ป๐ผ๐ ๐ฎ๐ป ๐๐ ๐๐ฒ๐๐ฒ๐น๐ผ๐ฝ๐ฒ๐ฟ: ๐๐ป๐๐ฟ๐ผ๐ฑ๐๐ฐ๐ถ๐ป๐ด ๐ฝ๐ด๐ฎ๐ถ ๐ฉ๐ฒ๐ฐ๐๐ผ๐ฟ๐ถ๐๐ฒ๐ฟ ๐ฅ
๐ฝ๐ด๐ฎ๐ถ ๐ฉ๐ฒ๐ฐ๐๐ผ๐ฟ๐ถ๐๐ฒ๐ฟ is a developer tool that automatically creates and syncs embeddings right in your PostgreSQL database.
In other words: ๐ฝ๐ด๐ฎ๐ถ ๐ฉ๐ฒ๐ฐ๐๐ผ๐ฟ๐ถ๐๐ฒ๐ฟ ๐บ๐ฎ๐ธ๐ฒ๐ ๐ฒ๐บ๐ฏ๐ฒ๐ฑ๐ฑ๐ถ๐ป๐ด ๐บ๐ฎ๐ป๐ฎ๐ด๐ฒ๐บ๐ฒ๐ป๐ ๐ฎ๐ ๐ฒ๐ฎ๐๐ ๐ฎ๐ ๐ฐ๐ฟ๐ฒ๐ฎ๐๐ถ๐ป๐ด ๐ฎ๐ป ๐ถ๐ป๐ฑ๐ฒ๐ ๐ถ๐ป ๐ฃ๐ผ๐๐๐ด๐ฟ๐ฒ๐ฆ๐ค๐.
Available 100% open source (PostgreSQL license) and via our Cloud offering.
With pgai, we are building the first and only developer suite ๐ช๐ฏ๐ด๐ช๐ฅ๐ฆ ๐ต๐ฉ๐ฆ ๐ฅ๐ข๐ต๐ข๐ฃ๐ข๐ด๐ฆ for building AI applications.
Why are we doing this? To make every engineer an AI engineer, by embracing and extending PostgreSQL, the most loved database by developers.
Serve every developer, power the future of computing, and advance the human frontier.
Let's go!! ๐ค ๐ ๐
VECTOR DATABASES ARE THE WRONG ABSTRACTION. Hereโs a better way: introducing pgai Vectorizer, a new open-source PostgreSQL tool that automatically creates and syncs embeddings with source data, just like a database index.
โ Why vector databases fail
Vector databases treat embeddings as independent data, divorced from the source data from which embeddings are created, rather than what they truly are: derived data.
This pitfall means that many AI projects that start out as simple vector search implementations inevitably evolve into a complex orchestra of monitoring, synchronization, and firefighting.
๐ Keeping embeddings in-sync is hard
In an attempt to avoid stale embeddings, engineering teams have to build and maintain a maze of ETL pipelines, juggle multiple databases (vector DB, metadata store, lexical search), and manage complex queuing systems for updates.
Add monitoring for data drift, alert systems for stale results, and validation checks across systems - and you have a brittle infrastructure that inevitably breaks down, leading to stale embeddings and wasted engineering hours.
What if you could just use Postgres instead?
โ Pgai Vectorizer: Vector embeddings as database indexes
Pgai Vectorizer treats embeddings like database indexes. It automatically creates, updates, and maintains embeddings as your data changes. Just like an index, the database handles all the complexity: syncing, versioning, and cleanup happen automatically.
This means no manual tracking, zero maintenance burden, and the freedom to rapidly experiment with different embedding models and chunking strategies without building new pipelines.
๐คWhy did we build pgai Vectorizer?
Our team at @timescaledb built pgai Vectorizer because many developers regard PostgreSQL as the โSwiss army knifeโ of databases, as it can handle everything from vectors and text data to JSON documents.
We think an โeverything databaseโ like PostgreSQL is the solution to eliminate the nightmare of managing multiple databases, making it the ideal home for vectorizers and the foundation for AI applications.
โ๏ธHow does pgai Vectorizer work?
Check out the code snippet below โย it takes just 6 lines of SQL to put your embedding creation pipeline on autopilot with pgai Vectorizer!
Under the hood, pgai Vectorizer checks for modifications to the source table (inserts, updates, and deletes) and asynchronously creates and updates vector embeddings in an external worker.
๐งโ๐ป Sounds exciting! How can I get started?
Pgai Vectorizer is open-source under the PostgreSQL license and available for free to use on any PostgreSQL database. You can find installation instructions on the pgai GitHub repository (see end of post). Itโs also available as a managed service in Timescaleโs PostgreSQL cloud platform.
๐Learn more
[1] Pgai github repo: https://t.co/hut1MxuwPZ
[1] Technical explainer post: https://t.co/A9hOz482Rg
Share this post with your followers to let them know about pgai Vectorizer and comment your reactions and questions.
๐ Vector Databases Are the Wrong Abstraction. Hereโs Why.
They treat embeddings as standalone data, disconnected from their source, leading to outdated embeddings, constant sync issues, and endless maintenance.
Thatโs why we built pgai Vectorizer for Postgresโso every engineer can build AI applications without the headache of managing embedding pipelines. Creating an embedding pipeline is as easy as building an index. It stays synced with your data automaticallyโno extra tools, no stale embeddings.
๐ Whether youโre a busy AI engineer or just getting started, check out the blog to see how pgai Vectorizer lets you focus on building killer AI apps. ๐๐
#Postgres #pgaiVectorizer #Postgres #Data #AI #SQL #DevTools #AIDevelopment #PostgresExtensions #AIinSQL
๐๏ธ "What I really like about it is the consistency; that people really managed to build something great over 30 years." - Hans Jรผrgen Schรถnig, CEO of Cybertec.
In this interview, Hans shares what makes PostgreSQL special, but also what could improveโreview bandwidth. Have thoughts? Take the State of PostgreSQL survey before October 31st!
๐ https://t.co/DqFoQEdS2d
#PostgreSQL #CommunitySurvey #Timescale
@postgresql_007
We are thrilled to announce our partnership with @TimescaleDB for the 2024 State of Postgres Survey that aims to gain insights into usage of PostgreSQL provide ideas to how we can collaboratively enhance it. Shape the future of PostgreSQL, take the survey https://t.co/CLrwuNhBmO
Calling all #PostgreSQL users - if you haven't taken the 2024 State of Postgres survey yet, *please* take a moment to do so before Sept 30! All feedback - whether you're inexperienced or have been working with #Postgres for a very long time - helps. https://t.co/HtFGrIYM21
๐ Iโve just submitted my response to the State of PostgreSQL 2024 survey! ๐ ๐ It was super easy to fill out, and I loved how the questions balanced everything from Postgres use cases and features to community and extensions. The final focus on AI was a nice touch! 1/5
"...itโs easy to think of databases like a black box where you make sure your tables are indexed sensibly and your queries arenโt doing anything silly, and the rest just happens." So how does it work, really?
https://t.co/yHAWGegbzX
#data#datamanagement#postgres#postgresql
Missed the original announcement, but seems major changes are coming about now in Google's Android app store in consequence of them engineering their Play Store as an illegal monopoly.
https://t.co/K7GlrLC4Ck
#technews#business#businessnews#google#android
#CockroachDB (distributed transactional system) plans to retire its open source "Core" product in favor of a new Enterprise licensing structure for self-hosted users after the introduction of version 24.3 in November: https://t.co/UmU3m08OYA
#opensource#foss#technews
One of the things I'm working on for PostgreSQL 18 is executor performance. Today I pushed a patch to add JIT support to increase the performance of generating hash values for Hash Join. Thanks to @AndresFreundTec for letting me know this needed attention. https://t.co/CEyKebKC25
Everybody can support Postgres today.
Take 5 minutes and pop a quick email to your Microsoft and AWS sales reps, telling them that their company's support of open source postgres development matters to you.
Its good for field teams to hear this feedback; don't assume they know!
Join geospatial expert, @michalmigurski, for a 30 minute webinar on building a modern GIS stack with @SnowflakeDB and Felt this Wednesday, Aug 21. Reserve your spot now!
https://t.co/Qd6Rd28NtO
Using @SnowflakeDB for your cloud data warehousing needs? You can now combine #SnowflakeDB with #Felt for a modern approach to geospatial data management, visualization, & collaboration. Learn more at our upcoming free webinar this Weds, Aug 21: https://t.co/oyv0IPneRg
#geospatial #gis #snowflake #datalake #webinar