If you want to trust AI, you have to own your data.
An operator reads an AI-generated recommendation on a screen and has seconds to act. The recommendation is clean and confident but the operator has no sense of how well it is grounded in truth.
That gap, between how sure the machine sounds and how sure it should be, is the real barrier to using AI in a fight. We have spent years fixing how we manage our data and almost none teaching our systems how to calibrate trust.
The good news: the hard part is within reach. We need to own our data and build trust into what stands on it.
If you want to own your data, you have to do something about it.
The Department of War diagnosed the data problem correctly. Our information sat locked in silos that never spoke to each other, built for a slower era. The answer is a data architecture that frees that information and governs who can reach it.
That foundation is only half built, and the will to finish it is fragile under pressure to jump straight to AI analytics. This work is unglamorous. It is the plumbing beneath everything else, easy to underfund and easy to declare prematurely finished.
Skip it and you handicap AI before you ever see its potential. That’s because the models are only as good as the data feeding it. Starving the foundation hollows out the very capabilities we are chasing.
That said, acting on AI initiatives shouldn’t wait until the perfect data layer emerges. AI can deliver trust on missing data by doing three things. It recognizes the gap, because the architecture tracks what feeds each answer and can see when the evidence is thin. It lowers its confidence or declines, rather than bluffing to fill the silence. And it tells the operator where the thin ice sits, so a human can decide whether to act, gather more, or override.
The Harsh Reality.
Even with data in hand, AI initiatives may stall, and the cause is easy to miss. The system sounds exactly as sure when it is guessing as when it knows. A fabricated answer and a sound one arrive identical, equally fluent and equally confident.
Who is going to take responsibility for bad AI? Everyone admits these systems hallucinate. Everyone agrees better data yields better results. Neither fact helps the operator who needs to know whether this answer, right now, can be trusted. The research community frames the goal more sharply. The aim is not to make people trust AI more. It is to make their trust accurate. A machine that sounds as sure when guessing as when it knows is a fortune teller, and it breeds mistrust at scale.
So before you field anything, ask the governance questions:
- Who is using your data, and can you control access?
- Can you see what data trained the model?
- Do you know how the model reaches its answer?
- Are you willing to own what it decides?
If you cannot answer these, you do not have trusted AI.
Trust you can measure, not trust you hope for
The goal is not to make people trust AI more. It is to make their trust accurate. Trust that runs ahead of what a system can do leads to misuse. Trust that falls short of it leads people to ignore a tool that would have helped.
We cannot fix this by stapling a confidence score onto a finished black box, any more than we create insight by bolting analysis onto disconnected data. Trust has to be built in from the start. That is the same lesson the federated data foundation taught us, one level up.
Calibrated AI is the result. Every answer comes with an honest read of how much to believe it and why. It flags thin ground instead of bluffing past it. Trust becomes something you can measure for this answer, on this task, for this user.
How it actually works
It starts at the data layer, and isn’t tagged by hand. The architecture does it as data moves. A source carries its origin, a timestamp records each step, and handling rules apply by policy. The model then carries its own uncertainty instead of returning one tidy answer. It weighs how strong its evidence is and lowers its confidence, or declines, when support is thin. The reasoning stays in the forefront, and the operator can always push back.
The bottom line
Own your data. Build trust into what you put on top of it. The warfighting advantage goes to the side whose people can tell, in the moment, when to trust the machine and when to override it. Is that us?
If you are fielding AI, which of those governance questions can you answer today?
The question nobody wants to answer.
The Department of War diagnosed the first problem correctly. Our data sat scattered across silos that never spoke to each other, built for a slower era. The answer is a federated data architecture that frees information from those silos using governance and trust rules.
But that data foundation is only half built, and the will to finish it is fragile given mandates to move immediately to AI implementation. This work is unglamorous. It is the plumbing beneath everything else, easy to underfund and easy to declare finished before it is. The federated data architecture demands the funding and the patience to be completed, because the next problem cannot be solved without it.
That next problem is already arriving, and it invites a mistake. As the rush to put AI first builds, leaders are tempted to treat data as solved and pour their attention into increasingly sophisticated models. However, AI is only as good as the data fed into it. Move on from the data architecture too soon and we hollow out the very capability you are chasing. You will not see its full potential without a solid data foundation.
Here is the part that’s largely unaddressed. Even with the data fully in hand, military AI stalls, and the cause is easy to miss. The system sounds exactly as sure when it is guessing as when it knows. A fabricated answer and a sound one arrive identical, equally fluent and equally confident. Everyone admits these systems hallucinate. Everyone agrees that better data yields better results. Neither fact helps the operator staring at one recommendation who needs to know whether this answer can be trusted right now.
Trust has to be built into the system from the start. Adding a confidence label once the model is finished does not create it, any more than bolting analysis onto disconnected data ever created insight. This is the same lesson the federated data architecture taught us, applied one level up.
Calibrated AI is the result. It delivers, with every answer, an honest read of how much to believe it and why, and it flags thin ground instead of bluffing past it. Researchers call it AI that reports its own uncertainty. Trust becomes something you can measure for this answer, on this task, for this user, rather than a slogan about the system as a whole.
Building it in starts at the data layer, but nobody tags this by hand. The federated data architecture does it automatically as data moves. When a source connects, it inherits that origin, and a timestamp records each step it passes through. Classification and handling rules apply by policy, the way email already marks sensitivity without a person deciding each time. Older records get caught up gradually, prioritized by mission need.
The model then carries its own uncertainty instead of returning one tidy answer. It weighs how strong its evidence is and lowers its confidence, or declines, when support is thin. The reasoning stays readable, and the operator can always push back.
This is where the federated data architecture and the AI meet. A model can calibrate its confidence honestly only if it can see the evidence behind a question. Sparse or stale data yields a guess dressed as an answer. Full, well governed data access, the thing the federated data architecture exists to provide, is what lets a system know when it stands on solid ground and when it does not. The federated data architecture is what makes results worth calibrating. The work ahead is to ground the frontier labs’ models in our own data and instrument them so a person can check every answer. That is well within reach, and it does not require building a better model than they have.
The need is plain. Decisions get made one at a time, by people who answer for them, against an adversary working to push us into the gaps where our data runs thin. A confident wrong answer is worse than no answer. Keeping a human in authority means little if that human cannot see what they are authorizing. This is the adoption accelerator the Department’s own research enterprise is now pursuing through its work on calibrated trust.
Finish the federated data architecture already underway rather than declaring it done, and require calibrated trust in what gets built on top of it. Make the systems we field show their confidence, show their evidence, and admit when they do not know. The programs that ask for this in their requirements will get it. The ones that ask only for accuracy will keep fielding fortune tellers. The advantage goes to the side whose people can tell, in the moment, when to trust the machine and when to override it.