Every procurement vendor is rushing to ship AI features. Autonomous sourcing agents. AI-powered risk scoring. Natural language analytics. The marketing is loud, and the pressure on procurement leaders to adopt is real. But here is the question almost nobody is asking: does the platform have the data foundation to make any of it actually work?
At Opstream, we work with procurement, finance, legal, and IT leaders across mid-market and enterprise organizations. The pattern we see is consistent: teams that invest in AI features before solving their data problem end up with expensive automation that produces unreliable results. The organizations that get AI right are the ones that solved the data problem first.
This post breaks down why clean, unified data is the prerequisite for procurement AI, what happens when organizations skip that step, and how to evaluate whether a platform can actually deliver on its AI promises.

The short answer: because the board is. According to the 2025 Gartner AI Adoption in Supply Chain Survey, 64% of supply chain organizations now have a formal or informal AI strategy. But 84% have limited AI deployment or are still in the pilot phase. The gap between intention and execution is enormous.
The challenges are not primarily technical. According to the same survey, 47% of organizations cite low data quality and 45% cite cultural readiness as the major obstacles to scaling AI investments (Gartner, Top Insights on AI for Chief Procurement Officers, May 2026). In other words, most teams are trying to deploy AI on data foundations that cannot support it.
The pressure is compounding. The Hackett Group’s 2026 Procurement Key Issues survey reports that procurement workloads are expected to grow by 8% this year while staffing shrinks by 1%. Top-performing teams already spend 2x more on technology per FTE than their peers. Only 19% of organizations have actually implemented GenAI tools for procurement tasks, according to Gartner’s 2025 Digital Transformation Survey. The message is clear: most teams are investing in AI before they have solved the data problem that determines whether AI works.
This has created a predictable market dynamic. Vendors across the procurement technology landscape are racing to add AI capabilities. Source-to-pay suites are bolting on orchestration and AI layers. Orchestration platforms are expanding into end-to-end S2P coverage. Art of Procurement’s State of Procurement Orchestration 2026 report found that nearly all providers now claim some level of AI capability. But the differences in substance are substantial.
As that same report notes: “Procurement decision makers must ask themselves how deeply AI is embedded in each platform and how autonomously it can operate. At the leading edge, this means agentic AI, with agents that conduct supplier risk assessments, perform first-pass contract reviews, and route work based on contextual intelligence rather than static rules.”
The distinction between “AI as a feature” and “AI as a foundation” is becoming the most important evaluation criterion in procurement technology. And that distinction starts with data.
Most procurement organizations run on a patchwork of systems. ERPs, accounts payable platforms, contract lifecycle management tools, GRC systems, spreadsheets. Critical procurement information is trapped across these systems with no unified view.
The problems go deeper than fragmentation. In a survey Opstream conducted in Q4 2025 across procurement and finance leaders:
Gartner’s research reinforces the pattern. Their Cool Vendors in Sourcing and Procurement Technology 2025 report found that “many procurement organizations struggle to realize the full potential of AI due to inconsistent, incomplete or poor-quality data. Without robust data governance practices in place, AI models are often trained on unreliable information, leading to inaccurate insights and suboptimal decision making.”
This is the environment into which organizations are deploying AI. And the results are predictable.
Consider what happens when an AI agent tries to identify duplicate suppliers across systems. In one ERP, a vendor is listed as “Acme Corp.” In the AP system, it appears as “ACME Corporation.” In the CLM, the contracting entity is “Acme Holdings LLC.” Without semantic understanding of vendor entities, an AI system treats these as three separate suppliers, producing inaccurate spend analysis, missed consolidation opportunities, and incorrect risk assessments.
Or take spend categorization. When the same categories are coded differently across systems, AI-generated analytics become unreliable. A category labeled “Professional Services” in one system might overlap with “Consulting” in another and “Advisory Services” in a third. The AI dutifully produces charts and recommendations, but they are built on inconsistent inputs.
As Gartner puts it in their Top Insights on AI for Chief Procurement Officers report: “High-quality, well-governed data is the single biggest differentiator in ROI on AI initiatives.” The average procurement function is capturing just 64% of the value it could be getting from analytics, according to Gartner’s research on procurement analytics performance. And access to and quality of procurement data explains 30% of analytics success, more than talent (18%) and technology (9%) combined.
AI without good data is just expensive automation.
The procurement orchestration category emerged to address fragmented processes and rigid workflows. These platforms created a single entry point for procurement requests and automated the routing and approval process. That was a meaningful step forward.
But most orchestration platforms were built as workflow layers, not data layers. They connect systems and move information between them. They do not understand the information they are moving. (Procurement leaders increasingly describe these platforms as dashboards on top of broken systems rather than solutions that fix the underlying problem.) (We explored this structural shift in more detail in The Rise of Data Activation.)
This creates two structural limitations:
The integration ceiling. Traditional integrations hit limits when data is not normalized. AI cannot reason over inconsistent information. When the same supplier, contract, or category is represented differently across connected systems, the orchestration layer faithfully passes through the inconsistency. It moves data. It does not reconcile it.
The configurability trap. Platforms with deep configurability often require significant implementation effort and ongoing maintenance. Art of Procurement’s 2026 report documented instances where configuration changes in complex orchestration platforms caused suppliers to bypass procurement’s risk review entirely. The report warns: “Multiple practitioners questioned the point at which the effort to build and maintain complex workflows begins to negate the efficiency gains orchestration is supposed to deliver.”
This is not a criticism of orchestration as a concept. Orchestration is necessary. But orchestration built on a rigid data model, or one that depends on third-party integration platforms to normalize data, will struggle to support the agentic AI capabilities that are rapidly becoming table stakes in this market.
The AOP report’s evaluation framework reflects this shift. It identifies five dimensions for evaluating procurement platforms in 2026: orchestration depth, S2P breadth, AI depth, operational maturity, and ability to execute. Of these, AI depth is the most difficult to assess and the most consequential to get right.
The stakes just got higher. According to the 2025 Gartner Procurement Organization Design for the Future Survey, 67% of procurement organizations are discussing or planning to implement agentic AI. But 33% are dissatisfied with the ease of implementation, and 21% worry about the rework it would introduce.
This matters because agentic AI operates with greater autonomy than generative AI. GenAI produces recommendations for a human to evaluate. Agentic AI makes decisions and executes tasks independently, escalating to humans only when it encounters exceptions. That autonomy is powerful, but it also means errors in the data foundation compound faster and with less human oversight to catch them.
Gartner is explicit on the sequence: “As AI agents emerge, they will first automate tasks where high-quality data is available, making data quality critical.” The agents will not wait for you to clean your data. They will automate where they can and leave the rest untouched. Organizations with fragmented, unnormalized data will find that agentic AI only works on a fraction of their procurement workflows, while organizations with unified data foundations get compounding automation across the board.
We hear this directly from procurement leaders in regulated industries. One CPO at a manufacturer with 17,000+ employees described wanting to build agentic AI onto all procurement workflows, but recognized the prerequisite: “Have we entered the supplier name correctly in all the systems it sits in, so that if we’re going to put AI on it, it doesn’t get confused? Because it looks for the vendor we put the spelling in, and then it misses the other four that had a spelling mistake, and so it only has half the picture.”
That is the data quality test for agentic AI in a single sentence. If your AI agent only has half the picture, its autonomous decisions are half-informed. In a traditional workflow, a human catches the gap. In an agentic workflow, the gap propagates at machine speed.
Gartner predicts that by 2028, 40% of procurement teams will have implemented at least one AI agent (Gartner, Predicts 2025: Procurement Addresses Data Challenges). The organizations that solve their data problem before that deadline will be the ones whose agents actually work.
A data-first approach inverts the typical procurement technology stack. Instead of starting with workflows and bolting on integrations, it starts with a unified data foundation and builds intelligence on top of it.
This means solving four problems before deploying AI agents:
1. Semantic entity resolution. The platform automatically identifies and merges duplicate suppliers, contracts, and purchase orders across systems. Not through static matching rules, but through contextual AI that understands “Acme Corp,” “ACME Corporation,” and “Acme Holdings LLC” are the same vendor entity.
2. Dynamic taxonomy mapping. Category codes, GL accounts, and spend classifications are harmonized in real-time using contextual understanding. When Finance calls it “Professional Services” and Procurement calls it “Consulting,” the data layer reconciles both into a coherent model that AI can reason over.
3. Custom data model generation. Every enterprise has unique business processes, approval hierarchies, and compliance requirements. A data-first platform builds a unique data model per organization based on how that business actually operates, not a one-size-fits-all template.
4. Multi-system orchestration with context. The platform reads and writes across all major ERPs and P2P systems natively, but does so with semantic awareness. It does not just pass data through. It understands what the data means in the context of each system and the organization’s policies.
When these foundations are in place, the AI capabilities built on top of them are qualitatively different. An agent evaluating vendor risk can draw on unified data across every system the organization uses. An autonomous workflow can auto-populate 87% of a typical intake form (which averages 70 fields and normally takes a requester 30 to 45 minutes) because it has access to historical POs, past requests, vendor defaults, and the organization’s data model. A natural language analytics query about supplier spend trends can return reliable answers because the underlying data is semantically consistent.
This is the difference between AI that generates plausible-looking outputs and AI that generates outputs you can act on.
Procurement technology is moving through three distinct eras. The integration era connected systems and required manual rule configuration. The orchestration era automated workflows with rigid logic. The agentic era introduces intelligent agents that make contextual decisions, with humans intervening only where judgment is required.
This is not a speculative timeline. Gartner predicts that by 2028, 40% of procurement teams will have implemented at least one AI agent. Gartner’s forecast for agentic AI in supply chain management software projects the market growing from $2 billion in 2025 to $53 billion by 2030, a five-year CAGR of 93.5%. Gartner also predicts that by 2027, 85% of procurement organizations will still be working on improving data quality to exploit efficiencies from technologies like GenAI.
That last number is the one that matters most. Even as organizations race to adopt AI agents, the vast majority will still be fighting their data quality problem. The organizations that solve it first will have a compounding advantage.
In an agentic model, the platform does not just ask “who should approve this?” It answers “what should happen next, based on everything we know?”
Here is what this looks like in practice:
A credit card charge appears for an unrecognized vendor. The agent identifies the missing vendor, triggers autonomous onboarding, collects vendor data via an email questionnaire, validates and normalizes the information, pushes it to the ERP, and flags for human approval only if risk thresholds are exceeded.
An invoice arrives with no matching purchase order. The agent cross-references contract terms, identifies the discrepancy or validates the payment, schedules it in the system, and escalates to a CFO only when the amount exceeds delegation thresholds.
A contract renewal approaches with spend above forecast. The agent surfaces the renewal automatically based on termination clause analysis, recommends renegotiation or budget reallocation, and routes to procurement with full performance history. Humans make the strategic decision; the agent did the preparation and routing.
None of these workflows are possible without a unified data foundation. The agent that identifies an unrecognized vendor needs semantic entity resolution to check against existing vendor records across every system. The agent that validates an invoice needs harmonized contract and PO data. The agent that surfaces renewal risks needs historical spend data, contract terms, and budget context from multiple sources.
Data first. Then agents.
The ROI of procurement technology is becoming more measurable. The AOP report found that across orchestration providers, customers consistently cite these outcomes:
When the data foundation is solid, these outcomes compound. K-health, a digital health company managing hundreds of vendor relationships, reduced its supplier base by 15% through intelligent spend analysis, identified $2.3M in annual savings through automated tail spend detection, achieved 100% contract compliance tracking across 600+ active agreements, and cut procurement cycle time from 14 days to 3.
“Opstream transformed our procurement from a cost center to a strategic advantage. The data clarity alone was worth the investment.”
VP Operations, K-health
A global investment bank moved from manual PO matching, limited spend visibility, and reactive compliance to automated approval flows, complete spend transparency, and proactive risk management. Their Director of Procurement noted: “The platform creates accountability across every level of the organization, from requesters to managers to executives. Requesters can no longer claim they mentioned something verbally, and approvers cannot dispute what they authorized.”
What makes these outcomes possible is not the workflow automation itself. It is the fact that the automation operates on reliable, unified, semantically consistent data. The same AI features deployed on fragmented, unnormalized data would produce a fraction of the value.
If you are evaluating procurement platforms, particularly their AI capabilities, here are the questions that separate substance from marketing:
These questions are adapted from Art of Procurement’s five-dimension evaluation framework and from the patterns we see in enterprise procurement evaluations. (For a broader comparison of platforms, see our guide to the best AI procurement platforms in 2026.) The answers will tell you whether a platform’s AI is built on solid ground or not.
Data readiness means your procurement data is unified, normalized, and semantically consistent across all systems. This includes vendor master data, spend categorizations, contract terms, and approval hierarchies. Without data readiness, AI features produce inconsistent or unreliable results because they are operating on fragmented inputs.
AI features are only as valuable as the data they operate on. A platform with advanced AI capabilities but poor data foundations will produce impressive demos but disappointing production results. Organizations that solve the data problem first see compounding returns from every AI capability deployed on top of that foundation.
Generative AI produces content, summaries, or recommendations that a human then acts on. Agentic AI operates independently: it evaluates context, makes decisions, executes tasks, and escalates to humans only when necessary. In procurement, this means agents that autonomously handle vendor onboarding, invoice reconciliation, contract review, and risk assessment rather than simply generating reports about them.
Ask what percentage of customers are using AI features in production. Ask how the platform handles your specific data model, not a standardized one. Ask whether AI behavior is configurable and auditable. And ask how the platform resolves semantic inconsistencies across your systems. Platforms with genuine AI depth will have clear answers to all four questions.
It can, but only if the platform has a semantic data layer that normalizes information across ERPs in real-time. Most enterprises operate multiple ERPs simultaneously due to M&A, regional requirements, and legacy systems. Traditional integrations break down in multi-ERP environments because they move data without understanding its context. A data-first platform resolves entity duplicates, harmonizes taxonomies, and builds a unified model across all connected systems.
The procurement platforms of tomorrow are being built today on data foundations, not integration layers. The organizations that will get the most value from AI in procurement are not the ones that adopted AI features first. They are the ones that solved their data problem first.
If your procurement data is fragmented across systems, inconsistently categorized, and manually reconciled, no amount of AI features will close that gap. Start with the data.
If you are evaluating procurement technology, make data foundations the first question you ask, not the last. The platform you choose will determine your ability to deploy agentic AI, bridge cross-functional processes, and deliver the efficiency gains that close the procurement productivity gap. The wrong choice compounds over time. The right one does too.
See how a data-first approach changes procurement operations
Opstream unifies your procurement data across every system and builds a semantic model unique to your organization. Autonomous workflows and agentic analytics are built on that foundation.
About the Author
Mor Cohen-Tal is the Co-Founder and Chief Technology Officer of Opstream. With 8 patents in cloud and AI infrastructure, Mor previously served as Cloud CTO at Turbonomic, where she led the company’s transition to a cloud-centric model before its acquisition by IBM for nearly $2B. A leading thought leader in cloud and AI, Mor has keynoted at Microsoft Ignite and AWS re:Invent. She holds an M.Eng from Cornell University and a B.Sc from the Hebrew University.