Xata 🦋

Verified account

@xata

Postgres at scale — with copy-on-write branching, data masking, separated storage & compute , 100% Postgres and your own cloud

🌏 Earth

Joined November 2020

417 Following

4.1K Followers

1.4K Posts

Pinned Tweet

about 1 month ago

One Postgres per tenant, per agent, per CI run. Storage built for a few busy volumes can't carry that. Here's why we built our own engine: Xatastor.

Tudor Golubenco

about 1 month ago

We needed a storage layer that can scale to a huge number of volumes (think millions) for our Postgres platform. Most of the existing storage systems are optimized for a few volumes with really high performance. But we needed the opposite: a very large number of mostly idle volumes. So we wrote our own. It’s called Xatastor and it enables Postgres-per-tenant use cases, “ephemeral” dbs for agents, free tiers, etc. Supports copy-on-write snapshots, clones, and thin provisioning. It’s based on ZFS and NVMe-oF as key technologies. Link with all the details in the first reply.

tudor_g's tweet photo. We needed a storage layer that can scale to a huge number of volumes (think millions) for our Postgres platform.

Most of the existing storage systems are optimized for a few volumes with really high performance. But we needed the opposite: a very large number of mostly idle volumes.

So we wrote our own.

It’s called Xatastor and it enables Postgres-per-tenant use cases, “ephemeral” dbs for agents, free tiers, etc. Supports copy-on-write snapshots, clones, and thin provisioning.

It’s based on ZFS and NVMe-oF as key technologies. Link with all the details in the first reply.

2

30

13

14

8K

0

17

3

10

6K

1 day ago

Once the new store had a real retention window, we deleted the vendor client, the routing, the override header, the flag, the config. Full write-up: https://t.co/jZnrmAGqbz

0

0

0

0

89

1 day ago

We rebuilt the metrics view for every Postgres branch on @xata. Out of a central observability vendor, into a per-cell @VictoriaMetrics stack. Six weeks of work, no user-visible downtime in the console. Here's what we learned. 🧵

1

7

2

3

789

1 day ago

Best part of the cutover: both backends in code, behind a flag, with a header to flip backends per request. Same chart on both, side by side, for a week. That's how we caught a double-count where pod aggregates were summing with the per-container series.

1

0

0

0

96

Who to follow

Verified account

design engineering lead @cloudflare | teaching https://t.co/wStdLbgyHC | writing https://t.co/CttP8HWkYS

Verified account

The platform for devs who just want to ship. Powered by sandboxes that let you deploy any code with confidence.

Verified account

@Flightcontrolhq

A Vercel-style interface for AWS. Deploy apps 2-6x faster with ultimate flexibility and scalability, because you get native AWS.

7 days ago

Repo: https://t.co/ukYvCPJ1Ht

0

0

0

0

241

7 days ago

We open-sourced a Next.js + Postgres starter with 6 Claude Code skills. Each skill wraps the Xata CLI for one workflow: branch, migrate, complete, rollback, clone-with-anonymization, setup. Tutorial: https://t.co/bXG8F2osKp

1

9

2

4

505

15 days ago

@monicasarbu and the Xata team are at AWS Summit Hamburg today, in the AWS Startup Zone, Hall 4. If you're thinking about Postgres for agent scale, come find us.

xata's tweet photo. @monicasarbu and the Xata team are at AWS Summit Hamburg today, in the AWS Startup Zone, Hall 4.
If you're thinking about Postgres for agent scale, come find us. https://t.co/MhZ23seFhW

0

2

1

0

138

15 days ago

DeltaX is a Postgres extension that adds columnar storage and time-series compression. Data lives in regular Postgres tables. pg_dump, replication, and crash recovery all keep working without change. Under the hood: → Type-specific codecs (Gorilla XOR, delta-of-delta, dictionary, block-LZ4) → Vectorized Rust execution, bypasses per-row ExecQual → Segment pruning with bloom filters → Parallel aggregation → Shared-memory blob cache Status is alpha, PostgreSQL 17 and 18. Star the project: https://t.co/usGwI4XjLY

1

2

1

1

1K

15 days ago

DeltaX is now public: the TimescaleDB alternative with Apache 2.0. Preliminary ClickBench: ~4.7× faster on analytical queries.

xata's tweet photo. DeltaX is now public: the TimescaleDB alternative with Apache 2.0.

Preliminary ClickBench: ~4.7× faster on analytical queries. https://t.co/Uhg3xUdktD

2

24

2

7

1M

15 days ago

Built quietly over the past months. Star the repo and run the benchmark: https://t.co/xdmSHAXDU2

Tudor Golubenco

15 days ago

There’s something I’ve been feverishly working on, and I just turned the repo public: pg_deltax (δx) - Fast time-series extension for PostgreSQL. Basically an Apache-licensed Timescale alternative. GitHub link in the thread.

tudor_g's tweet photo. There’s something I’ve been feverishly working on, and I just turned the repo public:

pg_deltax (δx) - Fast time-series extension for PostgreSQL. Basically an Apache-licensed Timescale alternative.

GitHub link in the thread. https://t.co/50MHVSzuAc

7

76

7

24

12K

0

2

1

0

351

21 days ago

The decode-time table filter is a community contribution from @blakewatters add_tables and filter_tables go straight through to wal2json. Thanks Blake. Release: https://t.co/8kxoOVma0O

0

0

0

0

112

21 days ago

pgstream v1.0.2 is out. Post-snapshot catch-up no longer stalls on bulk INSERT or DELETE tables (Postgres sink coalesces flushes now). You can also filter tables at decode time inside the source Postgres via wal2json. Plus Go 1.26.3 security fixes; v0.9.12 backports that one too.

1

4

3

3

895

27 days ago

Every Xata Postgres branch now ships with a managed PgBouncer endpoint, included. A Postgres connection costs about 5MB of backend memory. Fine when traffic comes from a handful of app servers. Breaks when traffic comes from serverless functions, edge workers, or AI agents that open a connection per request. Postgres wants fewer, long-lived connections. Modern apps produce many short-lived ones. A pooler reconciles that. We chose to ship one PgBouncer pod per branch, on the same node as the database, in the same memory budget. Pool size auto-tunes to 0.9 of max_connections and re-tunes when the instance changes. If you connect directly to Postgres, append -pooler to your branch ID: > postgresql[:]//user:pass@branch-id-pooler[.]us-east-1[.]xata[.]sh:5432/postgres One-line change in your connection string. Branching gives you cheap copies of Postgres. Pooling gives those copies somewhere to take traffic. We needed both. https://t.co/4XvMpERSQS

0

5

4

0

1K

xata retweeted

Tudor Golubenco

about 1 month ago

We needed a storage layer that can scale to a huge number of volumes (think millions) for our Postgres platform. Most of the existing storage systems are optimized for a few volumes with really high performance. But we needed the opposite: a very large number of mostly idle volumes. So we wrote our own. It’s called Xatastor and it enables Postgres-per-tenant use cases, “ephemeral” dbs for agents, free tiers, etc. Supports copy-on-write snapshots, clones, and thin provisioning. It’s based on ZFS and NVMe-oF as key technologies. Link with all the details in the first reply.

tudor_g's tweet photo. We needed a storage layer that can scale to a huge number of volumes (think millions) for our Postgres platform.

Most of the existing storage systems are optimized for a few volumes with really high performance. But we needed the opposite: a very large number of mostly idle volumes.

So we wrote our own.

It’s called Xatastor and it enables Postgres-per-tenant use cases, “ephemeral” dbs for agents, free tiers, etc. Supports copy-on-write snapshots, clones, and thin provisioning.

It’s based on ZFS and NVMe-oF as key technologies. Link with all the details in the first reply.

2

30

13

14

8K

about 1 month ago

We rebuilt the storage engine behind Xata Cloud. And we're talking about why. Architecture deep-dive by @tudor_g https://t.co/g9aUOoKMYJ

3

78

10

19

1M

xata retweeted

about 1 month ago

We’re seeing this firsthand with AI platforms we partner with: 👉 every agent needs its own isolated Postgres DB 👉 at scale, that’s millions of databases 👉 many on free tiers → cost matters a lot That combination breaks traditional storage. Most systems are built for a few always-on volumes. Agents need the opposite: millions of mostly idle ones. So we built Xatastor. It enables Postgres-per-tenant use cases, ephemeral DBs for agents, free tiers, and supports copy-on-write snapshots, clones, and thin provisioning.

monicasarbu's tweet photo. We’re seeing this firsthand with AI platforms we partner with:

👉 every agent needs its own isolated Postgres DB
👉 at scale, that’s millions of databases
👉 many on free tiers → cost matters a lot

That combination breaks traditional storage.

Most systems are built for a few always-on volumes.
Agents need the opposite: millions of mostly idle ones.

So we built Xatastor.

It enables Postgres-per-tenant use cases, ephemeral DBs for agents, free tiers, and supports copy-on-write snapshots, clones, and thin provisioning.

1

14

5

3

1K

about 1 month ago

Richard on what changed and what didn't: https://t.co/vbEb55RxCv

0

1

0

1

116

about 1 month ago

AI codes. Humans engineer. AI is already good at boilerplate, search, command line, simple fixes with feedback loops. Outsource it. Engineers still own decisions, distillation, organizational context, code review, and breaking out of incorrect assumptions.

2

4

0

0

297

Last Seen Users on Sotwe

Trends for you

Most Popular Users