Skip to content
Ai And Automation 11 min read

AI in Merchant Onboarding: KYB Automation, Risk Scoring, and Approvals

ML compresses merchant onboarding from days to minutes. Document AI, UBO extraction, risk scoring, and straight-through processing logic at PSPs and PayFacs.

PB
By Shaun Toh
TL;DR

AI has compressed merchant onboarding from days to minutes for low-risk applicants. The ML stack covers document verification, UBO extraction, risk scoring, and instant-approval routing — with Persona, Alloy, and Sardine leading the vendor landscape.

Merchant onboarding has historically been a bottleneck — the gap between a business signing up with a PSP or PayFac and actually being able to process payments. Manual KYB review, document verification, UBO tracing, and risk assessment routinely took two to five business days. For a PSP onboarding thousands of merchants monthly, it also represented significant operational cost.

Machine learning has changed this for low-risk applicants. The ML stack can now compress the decision for a straightforward merchant application to minutes, with straight-through processing approval requiring no human involvement. The bottleneck that remains is exactly where it should be: high-risk or complex applicants where human judgment is genuinely needed.

Why Manual Onboarding Fails at Scale

A single manual KYB review for a UK limited company involves: checking Companies House for registration status and director list, running each director and ultimate beneficial owner through sanctions and PEP screening, assessing the business website and product against the stated merchant category, reviewing submitted bank statements for legitimacy, and making a risk tier assignment. For a straightforward applicant this takes 20–40 minutes of analyst time. For a multi-jurisdiction holding structure with nominee directors, it can take days.

For high-volume PayFacs sub-boarding thousands of SMEs — e-commerce enablers, SaaS platforms with embedded payments, marketplace operators — manual review of every application is operationally impossible. Risk-based automation is not optional; it is the only way to run the model.

The ML Onboarding Stack

Layer 1: Document AI and data extraction

The first bottleneck in manual KYB is document review. Analysts receive certificates of incorporation, bank statements, utility bills, and director ID documents in inconsistent formats and quality levels.

Document AI models automate this: OCR extracts text from structured and unstructured documents, classification models identify document type (certificate of incorporation vs bank statement vs driving license), and entity extraction models pull the relevant fields — company name, registration number, registered address, director names, incorporation date.

The output is structured data with a confidence score on each extracted field. High-confidence extractions proceed automatically; low-confidence extractions (poor image quality, unusual document format, handwritten fields) are flagged for human review. This immediately routes the 80–90% of clean applications away from analyst queues.

Layer 2: Entity resolution and registry verification

With structured data in hand, automated verification checks run in parallel:

  • Company registry lookup: match the extracted registration number and company name against the relevant national registry (Companies House for UK, ACRA for Singapore, Handelsregister for Germany). Confirm the entity is active, not struck off, and matches the stated registration.
  • Director and UBO verification: extract the director list from the registry record and run each individual through KYC verification — identity document matching, liveness check, PEP screening.
  • UBO chain tracing: for layered ownership structures, graph-based ML traces beneficial ownership through intermediate entities to identify the natural persons who ultimately own or control the business above the 25% threshold. This is the hardest step to automate reliably, particularly for structures involving trusts, foundations, or offshore jurisdictions.
  • Sanctions and watchlist screening: the entity, its directors, and its UBOs are screened against OFAC, EU, UN, UK, and other sanctions lists in real time.

Layer 3: Risk scoring at application

With verification complete, the ML risk scoring model assigns a risk tier. Input signals typically include:

  • Category risk: the MCC associated with the stated business activity carries an inherent risk profile. Online gambling, adult content, and crypto exchange MCCs attract higher risk scores than SaaS or professional services.
  • Volume plausibility: does the stated monthly processing volume match what a business of this size and age would plausibly process? A newly registered company stating $2M monthly processing volume without corresponding business evidence is an anomaly signal.
  • Jurisdiction risk: businesses in higher-risk jurisdictions, or with directors from higher-risk jurisdictions, receive elevated risk scores that may route to enhanced review.
  • Data consistency: mismatches between application data and registry data — different registered address, slightly different company name — are risk signals even when they are innocent data entry errors.
  • Application session behavior: device fingerprint, IP geolocation, application timing, and browser signals from the application session itself.

The output: a risk tier assignment (low / medium / high) that determines the routing decision.

Layer 4: Approval routing

The routing decision converts risk tier into action:

  • Straight-through processing (STP): low-risk, all-green applications. Automated approval with no human review. PSP account activated, processing enabled. Decision time: minutes.
  • Light-touch review: medium-risk or single-flag applications. Routed to a human reviewer with the ML assessment pre-populated — the analyst reviews the flagged items rather than starting from scratch. Decision time: hours.
  • Enhanced due diligence: high-risk or multi-flag applications. Full manual review, additional document requests, possible site visit for high-value or high-risk merchant categories. Decision time: days.
  • Decline: clear disqualifiers — confirmed sanctions match, known fraud ring connection, rejected document authenticity. Automated decline with regulatory record.

Layer 5: Post-onboarding behavioral monitoring

Onboarding is not a one-time event for AML purposes. Ongoing monitoring obligations require continuous assessment of whether the merchant’s actual behavior matches their onboarded profile.

Post-onboarding ML monitors: transaction patterns inconsistent with stated business type, unusual velocity spikes, chargebacks arriving within the first few days of activation (a reliable signal for synthetic identity merchant fraud), and network graph connections to other merchants or accounts flagged for fraud.

Alloy’s perpetual KYB product and Sardine’s merchant risk monitoring both address this layer — moving from point-in-time verification to continuous risk assessment throughout the merchant lifecycle.

The Vendor Landscape

Persona positions around automated KYB, UBO-related checks, and document verification workflows. Their platform enables configurable verification flows that adjust based on risk signals — low-risk applicants get minimal friction, high-risk applicants get additional verification steps.

Alloy focuses on perpetual KYB and customer risk assessment — ongoing monitoring as well as initial verification. Particularly strong for financial services companies with ongoing AML monitoring obligations.

Sardine explicitly positions around merchant onboarding and merchant risk monitoring for payment businesses and PayFacs. Strong on behavioral signals and fraud pattern detection.

Middesk specializes in US business verification — company registry data, UBO verification, and business identity for domestic US onboarding workflows.

For PSPs operating globally, the typical architecture combines a specialist orchestration layer (Alloy or Persona) with market-specific data sources (ACRA for Singapore, ACIS for Hong Kong, BvD Orbis for multi-market UBO data).

The EU AI Act Framing

The EU AI Act’s Annex III lists high-risk AI system categories. It includes creditworthiness assessment and access to essential services for natural persons. The EU AI Act’s risk classification explicitly carves out AI systems used to detect financial fraud from the creditworthiness high-risk category — a nuance most KYB vendors haven’t internalized yet.

Automated merchant onboarding AI assesses legal entities — not natural persons in the consumer credit sense. It does not automatically fall into the Annex III high-risk classification on these grounds.

The correct framing: merchant onboarding AI sits in a rising regulatory-risk zone. Not automatically high-risk under the EU AI Act, but increasingly expected to meet governance standards that overlap with high-risk requirements: auditability of decisions, explainability of risk scores, human review triggers for consequential decisions, and defensible documentation of the decision logic.

This is partly driven by the Act’s direction of travel, partly by existing AML supervisor expectations, and partly by enterprise customer requirements — large merchants increasingly ask their PSPs to document how automated onboarding decisions are made and how they can be challenged.

The practical implication: build the audit trail and explainability layer for your ML onboarding models now. Not because the EU AI Act formally requires it for this use case today, but because the expectation is moving in that direction and retrofitting explainability onto deployed models is substantially harder than building it in from the start.

The real-time fraud decisioning stack and the account takeover detection layer face the same governance direction — the theme across payment AI is that explainability and audit trail requirements are expanding regardless of formal high-risk classification.

Sources

Persona — Automated KYBOperator estimate

Persona provides automated KYB, UBO verification, and document verification workflows for business onboarding

Checked:

Alloy — KYB ProductOperator estimate

Alloy provides perpetual KYB and customer risk assessment for ongoing monitoring

Checked:

Sardine positions around merchant onboarding and merchant risk monitoring for payment businesses

Checked:

EU AI Act Annex III excludes AI systems used for detecting financial fraud from the creditworthiness high-risk category

Checked:

Source types explained in our Methodology.

Shaun Toh By Shaun Toh · Director, Digital Payments · Razer

Subscribers get the PSP Selection RFP Kit — 60+ structured questions, evaluation scorecard, and negotiation playbook — delivered to your inbox instantly.

More Ai And Automation briefings