Harsha Chintalapani

about 22 hours ago

ChatGPT on database schemas: 10% accuracy. Same model + Collate semantic context: 77%. Context tells AI where data lives. Semantics tells AI what data means. Both layers matter. Collate named to the DBTA 100 — Companies That Matter Most in Data, 2026. https://t.co/yrs3m3jgot #SemanticContext #ContextLayer #AIforData #DBTA100

CollateData's tweet photo. ChatGPT on database schemas: 10% accuracy.
Same model + Collate semantic context: 77%.

Context tells AI where data lives. Semantics tells AI what data means. Both layers matter.

Collate named to the DBTA 100 — Companies That Matter Most in Data, 2026.

https://t.co/yrs3m3jgot

#SemanticContext #ContextLayer #AIforData #DBTA100

0

4

2

0

28

d3fmacro retweeted

1 day ago

Your AI is missing context that lives in Confluence pages, shared drives, documents, and the institutional knowledge your teammates carry but never wrote down. Live demo June 18, 9 am PDT: Collate Context Center. 30 minutes. Register Now: https://t.co/K8dxOhJAQn #Collate #OpenContextLayer #DataGovernance #OpenMetadata #DataEngineering

CollateData's tweet photo. Your AI is missing context that lives in Confluence pages, shared drives, documents, and the institutional knowledge your teammates carry but never wrote down.

Live demo June 18, 9 am PDT: Collate Context Center. 30 minutes.

Register Now: https://t.co/K8dxOhJAQn

#Collate #OpenContextLayer #DataGovernance #OpenMetadata #DataEngineering

0

4

3

0

35

d3fmacro retweeted

Founder @Onehousehq, Creator of @apachehudi, Built the World's first #DataLakehouse, Distributed/Data Systems, Linkedin, Uber, Confluent alum. (views are mine)

2 days ago

Check out the latest episode of the Collate Product Demo Series. This shows many of the powerful data quality capabilities in Collate that eliminate the errors that cost your business money. Also see how to get an analytics head start with Collate AI Analytics. 🎥👉Watch here: https://t.co/Ttqhsrw8fg #DataQuality #OpenMetadata #DataEngineering #Collate #AiAnalytics

CollateData's tweet photo. Check out the latest episode of the Collate Product Demo Series. This shows many of the powerful data quality capabilities in Collate that eliminate the errors that cost your business money. Also see how to get an analytics head start with Collate AI Analytics.

🎥👉Watch here: https://t.co/Ttqhsrw8fg

#DataQuality #OpenMetadata #DataEngineering #Collate #AiAnalytics

0

3

0

69

Who to follow

Vinoth Chandar

@byte_array

d3fmacro retweeted

Warriors on NBCS @NBCSWarriors

5 days ago

Don't compare that man to Steph ever again

469

57K

5K

1K

3M

d3fmacro retweeted

StarRocks @StarRocksLabs

7 days ago

🆕 A while back, @open_metadata added the StarRocks connector. This makes it easier to bring metadata into OpenMetadata, including schemas, tables, column types, and view definitions via StarRocks’ MySQL-compatible interface. Setup guide: https://t.co/OIhAFgdfso

1

8

2

0

210

d3fmacro retweeted

Roman Medvedev

@roomavm

7 days ago

I audited unofficial Openmetadata CLI from romamo/openmetadata-cli against 22 Critical CLI Agent Spec failure modes. Even though it was built during a hackathon, it has the best score among all the other tested tools. Result: 1.5/3 average. Readiness: 12/15 [B]. This CLI already has a serious agent-first foundation: JSON envelopes on normal commands, schema introspection, dry-run mutation previews, MCP mode, bundled agent skills, non-TTY SSO protection, and prompt-injection tagging for external data. The remaining gaps are mostly contract gaps, not conceptual gaps. The two failing Critical checks were: §43 output size: no max-output flag, truncation metadata, or schema-declared output cap. §74 credential scopes: schema does not declare required_scopes and there is no check-permissions preflight. The partial failures matter too: timeout maps to GENERAL_ERROR, credential expiry maps to AUTH_REQUIRED, invocation errors can bypass JSON, and dry-run output lacks affected scope/effect fields. Bottom line: omd is close to being highly agent-ready. The next step is making every failure and credential boundary machine-readable. Full report: https://t.co/6al9QXPe4y @open_metadata

0

8

3

0

125

d3fmacro retweeted

9 days ago

Regulated banks don't get to choose whether they govern data. They govern or they fail audits. @unionbankph: 12.5M customers, 38K+ assets, lineage across @Snowflake + SageMaker + QuickSight. Started with Excel. Moved to @CollateData. Cirene Simbahan at #CollateSummit. June 10. Free. https://t.co/6usRIsUZR2 #DataGovernance #DataLineage

0

3

2

0

28

d3fmacro retweeted

Mitchell Hashimoto

@mitchellh

14 days ago

Supply chain attacks and OSS sustainability go hand in hand. I've semi-seriously joked for years that OSS upstreams should periodically purposely inject full vulns into their code and let downstreams fuck around and find out. Downstreams can pay to get the non-FAFO version. The not joke part is simply that OSS maintainers aren't a supply chain. OSS maintainers are not responsible for monitoring CVEs (because, they are not a supply chain). OSS maintainers are not at fault when bad shit happens to downstreams, because basically every OSS license (MIT, Apache, GPL, etc.) literally says: the software is provided "as-is, without warranty." You get what you pay for (that is to say: absolutely nothing!) Now, the joke part is that I do believe there is an ethical obligation to try to prevent harm downstream. But "try" is the key word. So, this isn't a serious proposal. But, if you're using OSS code and you're not paying for a license with a contract that promises some kind of warranty, you have no supply chain. You (the downstream user of an OSS lib) ARE the supply chain. To use a metaphor: physical goods have a real supply chain. Car manufacturers, chips, clothes, toys, etc. You have a signed commercial agreement with all your suppliers that promises quantity AND quality and blowback if either are missed. Thats a supply chain. If someone puts some chips on the side of the road with a "FREE" sign, then you integrate those into a product, then find out those chips are hacking customers, its your fault, not the person who dropped them on the side of the road.

48

2K

169

371

140K

d3fmacro retweeted

Jason Fried

@jasonfried

16 days ago

Bragging about how much software you’re shipping with AI is like holding down the shutter button and bragging about how many photos you took.

233

6K

673

574

241K

d3fmacro retweeted

16 days ago

@Scout24 runs @Collatedata, built on @open_metadata, in production and calls it a "context catalog." AWS, Starburst, Collate. MCPs for GitHub + Confluence. AI-generated docs, human-certified. PII tags propagate from Collate to the query engine for agent-level governance. Angelita Frozza Sanches presents the full build at Collate Summit, June 10. Free: https://t.co/tjDOwqkspP #OpenMetadata #DataEngineering #AIAgents #ContextLayer

0

2

1

0

67

d3fmacro retweeted

16 days ago

"Data catalog" became a bad word at @Scout24 after their legacy catalog failed. After modernizing with @CollateData, Head of Data Infrastructure and Governance, Angelita Frozza Sanches implemented their "context catalog" The goal: give AI agents and people shared meaning, ownership, and trust signals for every data interaction. She presents the full build at Collate Summit, June 10. Free: https://t.co/rrP7aQaVDb #DataGovernance #ContextLayer #AIAgents

0

5

3

0

90

d3fmacro retweeted

17 days ago

Lukas Patzke, Analytics Architect @Airbus is sharing his knowledge on an expert panel at Collate Summit on June 10. "Governance in an AI-First World" Experts will discuss trends in data and AI governance as self-service analytics and self-service AI scales at organizations. June 10. Free. 🔗 https://t.co/wKFETzHpfc #DataEngineering #DataGovernance #AIGovernance #AIAnalytics #CollateSummit

CollateData's tweet photo. Lukas Patzke, Analytics Architect @Airbus is sharing his knowledge on an expert panel at Collate Summit on June 10.

"Governance in an AI-First World"

Experts will discuss trends in data and AI governance as self-service analytics and self-service AI scales at organizations.

June 10. Free.
🔗 https://t.co/wKFETzHpfc

#DataEngineering #DataGovernance #AIGovernance #AIAnalytics #CollateSummit

1

5

2

0

41

d3fmacro retweeted

17 days ago

Lukas Patzke, Analytics Architect @Airbus is sharing his knowledge on an expert panel at Collate Summit on June 10. "Governance in an AI-First World" Experts will discuss trends in data and AI governance as self-service analytics and self-service AI scales at organizations. June 10. Free. 🔗 https://t.co/tjDOwqkspP #DataEngineering #DataGovernance #AIGovernance #AIAnalytics

open_metadata's tweet photo. Lukas Patzke, Analytics Architect @Airbus is sharing his knowledge on an expert panel at Collate Summit on June 10.

"Governance in an AI-First World"

Experts will discuss trends in data and AI governance as self-service analytics and self-service AI scales at organizations.

June 10. Free.
🔗 https://t.co/tjDOwqkspP

#DataEngineering #DataGovernance #AIGovernance #AIAnalytics

1

4

1

0

104

d3fmacro retweeted

20 days ago

Context isn’t optional for enterprise AI. It’s the difference between answers and accurate, trustworthy decisions. This new white paper from industry expert, @mikeferguson1, explores how leading organizations are building AI-ready data foundations with unified semantics and governance 👇 What you’ll learn: → How a unified knowledge graph eliminates semantic chaos and gives AI consistent context → Why AI-powered governance is key to enforcing quality, security, and policies at scale → How to build reusable, AI-ready data products faster with metadata-driven workflows → What a semantic system of record looks like - and why it acts as persistent memory for AI If you’re serious about reducing hallucinations and scaling AI responsibly, this is a must-read. Download here👇 🔗https://t.co/ZYoYurCGMK #DataGovernance #AI #Metadata #KnowledgeGraph #AIAgents #DataEngineering #SemanticLayer

CollateData's tweet photo. Context isn’t optional for enterprise AI. It’s the difference between answers and accurate, trustworthy decisions.

This new white paper from industry expert, @mikeferguson1, explores how leading organizations are building AI-ready data foundations with unified semantics and governance 👇

What you’ll learn:

→ How a unified knowledge graph eliminates semantic chaos and gives AI consistent context
→ Why AI-powered governance is key to enforcing quality, security, and policies at scale
→ How to build reusable, AI-ready data products faster with metadata-driven workflows
→ What a semantic system of record looks like - and why it acts as persistent memory for AI

If you’re serious about reducing hallucinations and scaling AI responsibly, this is a must-read.

Download here👇
🔗https://t.co/ZYoYurCGMK

#DataGovernance #AI #Metadata #KnowledgeGraph #AIAgents #DataEngineering #SemanticLayer

0

3

1

0

20

d3fmacro retweeted

21 days ago

Most governance failures mean audit findings. In clinical genetics, the stakes are higher. Dan Kostecki from @AmbryGenetics at Collate Summit '26: governed data product lifecycle, PHI-free production environment, CAP/CLIA + HIPAA + FDA Part 11 compliant. June 10 | Virtual | Free https://t.co/ZjguEWTQRE #CollateSummit #DataGovernance #DataQuality #DataEngineering #AIinProduction

0

4

2

0

27

d3fmacro retweeted

21 days ago

Before you deploy an AI agent, ask yourself one question: Does your data have an agreed-upon meaning that a machine can actually read? If the answer is no, the agent will guess. Every time. Semantic intelligence is the trust layer that fixes this. It turns metadata into machine-readable context that both humans and AI can rely on. At @CollateData Summit, we're going deep on how to build it. June 10. Free. 🔗 https://t.co/tjDOwqkspP #SemanticIntelligence #AI #DataGovernance

open_metadata's tweet photo. Before you deploy an AI agent, ask yourself one question:

Does your data have an agreed-upon meaning that a machine can actually read?

If the answer is no, the agent will guess. Every time.

Semantic intelligence is the trust layer that fixes this. It turns metadata into machine-readable context that both humans and AI can rely on.

At @CollateData Summit, we're going deep on how to build it.

June 10. Free.
🔗 https://t.co/tjDOwqkspP

#SemanticIntelligence #AI #DataGovernance

0

5

1

0

81

d3fmacro retweeted

24 days ago

This Wednesday at 11 AM PDT, our CEO & Co-Founder @suresh_m_s is on stage with @DataSciConnect. The question on the table: as enterprises push AI into production, how do you ensure outputs reflect the right data, tone, and constraints, every time? It's a context architecture question. And the answer is becoming foundational to trustworthy AI. Free. One hour. Worth it. Register here 👉️ https://t.co/Jcekxsh7yV #ContextLayerAI #DataScienceConnect #SemanticIntelligence #RAG #GenerativeAI #AIAgents #DataGovernance #OpenMetadata

open_metadata's tweet photo. This Wednesday at 11 AM PDT, our CEO & Co-Founder @suresh_m_s is on stage with @DataSciConnect.

The question on the table: as enterprises push AI into production, how do you ensure outputs reflect the right data, tone, and constraints, every time?

It's a context architecture question. And the answer is becoming foundational to trustworthy AI.

Free. One hour. Worth it.

Register here 👉️ https://t.co/Jcekxsh7yV

#ContextLayerAI #DataScienceConnect #SemanticIntelligence #RAG #GenerativeAI #AIAgents #DataGovernance #OpenMetadata

0

1

0

77

d3fmacro retweeted

dax

@thdxr

27 days ago

how many times are you guys gonna build this

331

8K

198

710

377K

d3fmacro retweeted

26 days ago

In this month's Product Demo, Dale Kim and James Nguyen showed how Collate, the enterprise platform built on OpenMetadata, takes you from "I need data" to "here's the data I need, who owns it, and why I can trust it." Your data teams can spend hours hunting for the right datasets, and even longer figuring out if they can trust what they find. You can't succeed with your data initiatives if you're constantly afraid of garbage-in-garbage-out. They covered: * A brief demo of data discovery and trust signals in Collate * Why discovery is only the start, and what else you need to make data usable * A quick walkthrough and discussion of Data Contracts in Collate 👉🎥Watch here: https://t.co/9IivJZlfPw #AI #datadiscovery #dataquality #datalineage #datagovernance #dataengineering #datastewards #dateacontracts

CollateData's tweet photo. In this month's Product Demo, Dale Kim and James Nguyen showed how Collate, the enterprise platform built on OpenMetadata, takes you from "I need data" to "here's the data I need, who owns it, and why I can trust it."

Your data teams can spend hours hunting for the right datasets, and even longer figuring out if they can trust what they find. You can't succeed with your data initiatives if you're constantly afraid of garbage-in-garbage-out.

They covered:
* A brief demo of data discovery and trust signals in Collate
* Why discovery is only the start, and what else you need to make data usable
* A quick walkthrough and discussion of Data Contracts in Collate

👉🎥Watch here: https://t.co/9IivJZlfPw

#AI #datadiscovery #dataquality #datalineage #datagovernance #dataengineering #datastewards #dateacontracts

0

4

2

0

43

d3fmacro retweeted