Skip to content
All topics

TOPIC BRIEFING

AI & Automation

AI has moved from differentiator to baseline in payments infrastructure. The gap between operators running production ML and those still running rules is measurable in authorisation rate, fraud loss, and operational headcount.

Production AI in payments clusters around four layers: authorisation intelligence, fraud ML, agentic commerce, and operational automation. Each has documented performance lift, specific architectures, and a noise-to-signal ratio in vendor marketing that operators must learn to filter — because the difference between real ML and relabelled rules is worth hundreds of basis points.

15 briefings Auth & routing MLFraud detectionMLOpsAgentic commerce

Stack map

The AI Payments Stack

Six layers where AI is deployed in production payment operations — from raw data to agentic interfaces. Each layer has distinct tooling, distinct performance signals, and distinct vendor markets.

  1. 01

    Data layer

    Feature stores, labelled training data, drift detection — the quality of this layer determines the ceiling for every model above it.

  2. 02

    Inference

    Sub-100ms real-time scoring for fraud, routing, and risk — where latency budgets and model complexity trade off.

  3. 03

    Decisioning

    Authorisation logic, retry orchestration, liability shift mechanics — ML output converted to a binary approve/decline with audit trail.

  4. 04

    Orchestration

    Multi-acquirer routing, ML-driven retries, smart payout selection — the layer that compounds authorisation rate improvements across the stack.

  5. 05

    Automation

    Reconciliation, dispute response, customer ops workflows — LLMs handling structured and semi-structured payment data at scale.

  6. 06

    Agentic interface

    AI agents transacting via Stripe ACP, Visa TAP, Mastercard Agent Pay — autonomous payment execution without a human at checkout.

The operator thesis

Three operator takes

01

Authorisation rate is now an ML problem

Static routing tables and rules-based retry logic left 2–5% authorisation rate on the table. Stripe Payment Fingerprint recovered $6B in falsely declined transactions in 2024. Adyen Uplift cut false declines 42% in a January 2025 update. ML routing is the new baseline — not a premium feature.

02

Fraud ML requires MLOps, not just data science

Labelling lag, population drift, and explainability obligations are the operational disciplines that separate production fraud teams from those who ship models that decay in silence. A model without a retraining cadence and drift monitoring is a liability, not an asset.

03

Agentic commerce is live infrastructure in 2026

Visa TAP, Mastercard Agent Pay, and Stripe ACP are real production rails — not roadmap items. AI agents transacting autonomously is a fraud-modelling and credential-design problem operators must solve now, before agent-initiated transaction volumes make the gap visible.

Start here

Reading paths for AI & Automation

AI in fraud and risk operations

Real-time decisioning architecture, production fraud detection, and where rules and ML coexist.

Routing, reconciliation, and operations

ML routing vs static rules, AI in reconciliation, and automated merchant onboarding at scale.

Agentic commerce and governance

The payment-agent stack, EU AI Act classification for payment systems, and keeping models accurate.

Briefings, grouped by decision

15 briefings in AI & Automation

Reference

Frequently asked

What is AI actually being used for in payments today — and what's still marketing?

Production AI in payments clusters around four categories: fraud scoring (ML models scoring transactions in real time, deployed at Stripe, Adyen, PayPal, and most large PSPs for years), smart routing (ML-driven acquirer selection to maximise authorisation rates — Stripe PFM, Adyen Uplift), reconciliation automation (LLMs and NLP matching payment records against invoices, handling remittance data extraction), and agentic commerce (AI agents initiating and executing payments autonomously — live in controlled deployments via Visa TAP, Mastercard Agent Pay, Stripe ACP as of 2025–2026). Marketing AI in payments usually means rule-based systems relabelled, simple threshold scoring, or features still in internal testing. The test: ask the vendor for the specific model architecture, retraining cadence, and a documented authorisation rate or fraud reduction lift.

How does ML fraud detection differ from traditional rule engines?

Rule engines apply explicit human-written conditions (if country = high-risk AND amount > $500 AND new device, decline). They are interpretable, fast to deploy, and predictable — but they decay as fraud patterns evolve and require constant manual updating. ML models learn statistical patterns from labelled transaction data — identifying non-obvious feature combinations that correlate with fraud. They generalise to new fraud patterns without manual rule updates, but require: sufficient labelled training data (typically millions of transactions), feature engineering, ongoing retraining as fraud drifts, and explainability infrastructure for compliance and appeals. In practice, production fraud systems at scale are hybrid: rules handle obvious blocks and business logic (declined merchant categories, velocity limits), while ML models score the ambiguous middle of the distribution.

What is agentic commerce and why does it matter for merchants in 2026?

Agentic commerce refers to AI agents — software that can browse, decide, and transact autonomously on behalf of users — completing purchases without a human present at checkout. As of 2026, this is live infrastructure: Mastercard confirmed Europe's first end-to-end AI agent payment (Santander, March 2026), Stripe's ACP endpoint handles OpenAI's Instant Checkout, and Visa's Trusted Agent Protocol provides cryptographic agent identity verification for merchants. For merchants, agentic commerce means: your checkout will be called by software, not just humans; agent-initiated transactions need separate fraud signal treatment (no mouse movement, typing cadence, or device fingerprint signals); and Visa/Mastercard network credential models (scoped tokens per agent) are the emerging authorisation standard for this traffic.

How do I evaluate whether a payments vendor's 'AI' claims are substantive?

Four questions cut through most AI marketing in payments. First, what is the specific model architecture — gradient boosting, transformer, ensemble — and when was it last retrained? Vague answers ('advanced machine learning') signal marketing language. Second, what is the documented performance lift — authorisation rate improvement in basis points, fraud reduction as percentage of GMV, FP rate reduction — with a defined time period and merchant cohort? Third, how does the vendor handle concept drift — when fraud patterns shift, how quickly is the model updated and what triggers retraining? Fourth, is the model explainable — can it produce a human-readable reason for a specific decision? Regulated operators in the EU (AI Act, credit decisions) and US (adverse action notices) increasingly need explainability for automated payment decisions.

What MLOps capabilities do production payment AI models actually need?

Payment AI models face specific MLOps challenges that generic ML infrastructure does not fully address. Labelling lag: fraud labels arrive weeks after transactions (chargebacks, confirmed disputes), so models train on incomplete ground truth. PSI monitoring: Population Stability Index tracking detects when the input distribution has drifted from training data — a leading indicator that model performance is degrading before it shows up in fraud rates. Event-driven retraining: retraining triggered by drift metrics, not just calendar schedule, because fraud patterns can shift in days during active attack campaigns. Shadow mode deployment: running new models in shadow alongside production before switching, to validate performance without live risk. Fallback logic: when the model returns low-confidence scores, rules-based fallback prevents both excessive fraud and excessive false declines.

AI has moved from differentiator to baseline in payments infrastructure. Fraud scoring, smart routing, reconciliation automation, and agentic commerce are the four production application layers that matter for operators in 2026 — and in each, the gap between operators who have invested in AI-driven capabilities and those who have not is measurable in authorisation rate basis points, fraud loss percentage, and operational headcount.

The noise-to-signal ratio in payments AI is high. Most PSP marketing conflates rule-based systems with ML models, describes internal prototypes as production features, and presents lift figures without controlling for cohort selection. The signal: Stripe’s Payment Fingerprint recovered $6B in falsely declined transactions in 2024; Adyen Uplift cut false declines by 42% and processing costs by 9.4% in a documented January 2025 update; Mastercard confirmed Europe’s first live end-to-end AI agent payment in March 2026. These are verifiable, specific claims. The briefings in this topic are grounded in the same standard — specific architectures, documented outcomes, and honest assessments of what is live versus in development.

The operator question in 2026 is no longer whether to use AI in the payment stack. It is which layers to build in-house versus vendor, how to evaluate vendor claims, and how to position for agentic commerce as it crosses from controlled pilot to mainstream checkout traffic over the next 12–24 months.