geek, scribe, coffee snob, and wanna-be cyclist. Contributor to Apache Spark and Delta Lake maintainer. Developer Relations at Databricks (opinions r my own)
Introducing the lab sponsors for the Databricks Grounded Reasoning Cup at #DataAISummit 2026: @AnthropicAI, @OpenAI, and @GoogleDeepMind.
Each lab is partnering with leading academic teams to build agents that tackle grounded reasoning over complex government data using the latest models and tooling.
Meet the teams, see the agents they’re bringing to the competition, and join us as they push the boundaries of enterprise AI reasoning live on stage! 🏆
We just launched Sites into Codex!
Software creation was always about more than writing code. Sites in Codex fundamentally gives the power of end-to-end software creation to every user, no matter their technical fluency.
These Sites are fully deployed to a URL, private to workspaces, come with authentication, can have static files, and can store dynamic data in databases.
It is in preview for business and enterprise teams and will be rolling out to all workspaces over the next day. Give it a try by typing @ Sites into Codex and ask it to build anything!
This project took a massive amount of effort across hundreds of people at OpenAI - proud that we were able to get this out and excited to see what you all build with it!
❤️ @Lovable now integrates with Databricks, providing a natural language interface that allows anyone, regardless of technical skills, to build live applications that can read and write data stored in Databricks.
See how teams can build dashboards, operational tools, internal chatbots, and other custom apps directly on governed Databricks data without ETL, replication, or sync jobs.
@jpodhoretz Hey there! It’s the first. This is at a mosque, where we were invited by the congregation to speak before prayer. In this context, head coverings are the respectful move. One of the fun parts of being a rep for NYC is learning different customs for so many communities. Cheers!
The future of databases is being built directly on top of object stores. We call this the Lakebase architecture.
For a long time, the industry treated data lakes strictly as analytical or offline storage. But the Lakebase architecture is changing that by enabling true operational databases directly on top of the lake.
I believe this is the future of data infrastructure. It is how every database, whether it's an OLTP system or a vector database, should be built moving forward.
Of course, delivering the stringent performance requirements for operational databases on top of object stores require some creative engineering. Really excited to see more real-world examples of this architecture emerging. The team at Zilliz just shared a piece on why they rebuilt their vector database using this exact approach, and it perfectly captures where the industry is heading.
Check it out here: https://t.co/xZrXtFiAzi
A decade ago, only tech giants had the engineering muscle to get real value out of their data. Databricks was founded to change that. The AI boom accelerated everything.
@hanlintang, Databricks' CTO of Neural Networks: "Companies are already deriving real value from agentic AI. I wouldn't have said that a year ago."
Seven researchers started this at Berkeley. Now we're powering the next chapter. @dhfreedman covers the full story for @Inc: https://t.co/NNXCygbgUV
How we build in a world where cloud limits are hit daily:
1. HA on both Compute and Storage
2. Hold control plane to same standards as data plane
3. Dependency minimalism
4. Control blast radius
5. Failure simulation
6. Measure everything
7. Build a world-class team with deep operating/scaling experience
Holy sh*t. Thomas Massie says he’s going to start publicly naming billionaires implicated in the Epstein files because Trump’s DOJ refuses to convict.
More courage than the rest of the GOP has combined.
#DataAISummit Session Spotlight 👇
Apache Spark™ 4.2: unified batch + streaming for AI workloads: feature pipelines, multimodal data, planner-level optimizations.
🎤 DB Tsai & Xiao Li | 🗓️ June 15–18 | 📍 San Francisco
🔗 Session details: https://t.co/LXLFbSuqDh
#ApacheSpark
James Talarico on abortion: “I trust Texas women to make decisions about their own bodies, to shape their own destinies. I don’t believe that’s a place for government. That’s a belief I hold not despite my faith, but because of my faith. Jesus never talks about abortion. The Bible is silent on abortion”
Headed to Data + AI Summit? Don't miss New Foundations of Delta Lake with Kernel + Spark's DSv2.
Delta on Spark DSv1 set the standard for the past 8 years. Rahul Potharaju & Tathagata Das will highlight the move to DSv2 and new foundations for the next decade.
🗓️June 15-18
📍San Francisco
Add it to your agenda 👇
https://t.co/qg9iXIAdPO
#DeltaLake #DataAISummit
And just like that, it’s completely VANISHED from the media.
A sitting congressman, Ted Lieu, said on the record the Epstein files are being blocked because they show Trump raped and threatened to kill children.
Lets make this viral again 👇
New Yorkers shouldn’t have to worry every time it rains. Ahead of this weekend’s storms, City workers have been out across all five boroughs clearing and inspecting catch basins in flood-prone neighborhoods.
Today I joined @NYCWater in Bushwick to clear a catch basin overflowing with debris and litter on Knickerbocker Avenue, one of the areas hit hardest by Wednesday’s storm.
If you see a clogged catch basin, call 311.
And if you’re able, help your neighbors by clearing leaves or litter from nearby grates before the rain arrives.