Data provenance is becoming a key parameter in the GenAI landscape. Lawmakers are demanding it and the cost of non-compliance is extremely high.
Luckily, we've built a toolkit for building an immutable and auditable data trail. No 3rd party needed and standard-agnostic. 👇👇👇
The tools are early but the architecture makes provenance capture simple and reliable. Check out the POC demo below. We'd love your feedback, especially if you're an agent builder, researcher or practitioner.
This is why we built our provenance toolkit and MCP POC at Datafund. It lets agents capture and store provenance at creation and write it to a decentralised backend for public scrutiny. Try the POC and tell us what is missing.
Data provenance is important because it tells us how the world is vs. how somebody wants us to see it. It tell us if the source was legitimate, if the capture was authentic or if the data reflected reality.
If we cannot verify origin, authenticity or context, data becomes a liability. And as AI agents move into research, finance, science and public systems, they will act on inputs no one can trace. We need provenance captured at the source.
We're pleased to welcome @crtahlin to our Decentralized AI panel discussion! 🤩
Data flows are the pillar of the emerging AI economy. It is important to think about how AI agents can work with data and how trust can be established in decentralized AI.
👇
https://t.co/zwwpjtKXCQ
AI needs a provenance infrastructure that unifies ethics, law and technical aspects. That's what we're building for with our latest provenance POC.
👇👇👇
A recent paper (https://t.co/K4JI1KDR8N) highlights that a growing issue with provenance tools is their piecemeal nature. 🧩
Dev teams can’t reliably assess safety, legality or copyright across thousands of mixed datasets. Creators, on the other hand, can’t track their content.
6️⃣
Provenance allows us to build agentic workflows that are transparent, auditable and grounded in verifiable evidence. 🔗🛠️
That’s what we’re building with our latest provenance toolkit. 🚀
📽️ Check out the demo in the post below. 👇👇👇
1️⃣
Agents are becoming the new knowledge workers, and we need to understand how they know what they know. 🧠🤖
The best AI models sit at ~50% of human baseline for agentic work. Most are even lower. But they are grinding toward human-level capability. 📈
5️⃣
Which sources influenced its outputs and to what degree?
Who has the power to influence the information it relies on?
In all fields built on knowledge work, traceability of evidence = credibility. 📜✔️
This goes double for AI in high-stakes domains. 🏥💶⚖️🏛️
We’re tackling this head-on. Auditable, immutable provenance data on decentralised storage.
Watch the demo and drop your feedback: https://t.co/fUvAMYlZPx
Data provenance is becoming a key parameter in the GenAI landscape. Lawmakers are demanding it and the cost of non-compliance is extremely high.
Luckily, we've built a toolkit for building an immutable and auditable data trail. No 3rd party needed and standard-agnostic. 👇👇👇
A recent paper from the @data_foundation warns that without provenance, we can’t tell where AI data comes from, how it changes, or who influences it.
https://t.co/qXQpCe2d74
Now, 20 years into the platform age we know that privacy is something that should've been built in from the start. But it seems that we're making the same mistake with data traceability in AI.
Provenance should be at the core of AI, not its afterthought if we want trust.