How a London challenger bank cleared its dispute-callback backlog and cut resolution time to 2 days with AI voice agents

Background

Overview

The bank is an FCA-authorised challenger bank operating app-first across the UK, with roughly 900 employees and a fast-growing base of current-account and card customers who expect instant, in-app everything.

The flip side of rapid growth is a rising volume of card disputes, suspected-fraud flags and chargeback queries, each of which requires a structured callback to gather details before a case can progress. The bank's support team could clear inbound chats but struggled to make the proactive verification and detail-gathering calls fast enough. Cases queued, customers waited, and under the FCA's Consumer Duty the bank faced real regulatory exposure for slow, poor outcomes on vulnerable customers caught in fraud.

In early 2026, the operations and compliance teams decided the manual dispute-callback model was both a customer-experience risk and a regulatory risk. They wanted a layer that could call every flagged customer within minutes, gather the dispute details in a structured way, confirm the next step, and push a clean case file into the dispute system, escalating to a human only for vulnerability indicators or complex fraud.

What was breaking

The challenge

The pre-Kallix dispute funnel had three compounding failure modes. Callbacks queued for days. Detail-gathering was inconsistent. And vulnerable customers were not reliably identified early, a direct Consumer Duty concern.

Key pain points

Dispute and fraud callbacks queued for up to 9 days. With volume rising faster than the team, structured callbacks to gather dispute details sat in a backlog, stretching resolution and breaching the bank's own service targets.
Inconsistent detail-gathering slowed every downstream step. When agents finally called, the quality and completeness of dispute information varied, forcing repeat contacts and delaying chargebacks and provisional refunds.
Vulnerable customers were not flagged early enough. Under FCA Consumer Duty the bank must identify and prioritise vulnerable customers, but with a slow manual queue, vulnerability often surfaced late, after harm.
Out-of-hours fraud victims could not reach a callback. Fraud rarely respects office hours. Victims who flagged a suspicious transaction at night waited until the next working day for a structured callback.
Audit trails for disputes were fragmented. Case notes lived across chat logs, call notes and the dispute tool, making FCA-grade audit reconstruction slow and error-prone.

What we built

The AI-powered solution

Kallix deployed an AI voice agent fronting the bank's card-dispute and fraud-callback queues, with a structured dispute-intake script, vulnerability detection, and a hand-off branch that routes vulnerable or complex cases to a human specialist. The full build, from discovery to production cutover, took 21 working days.

Element 1

Sub-5-minute outbound on every dispute or fraud flag

When the dispute system raises a flag, Kallix calls the customer within 5 minutes during permitted hours, while details are fresh and the customer is still anxious for resolution.

Element 2

Structured dispute-intake with branching

The agent gathers transaction details, merchant, amount, recognition status and supporting context through a branching script, producing a complete case file on the first call.

Element 3

Consumer Duty vulnerability detection

Distress, confusion and vulnerability cues are detected and trigger immediate routing to a trained human specialist, helping the bank meet its Consumer Duty obligations early.

Element 4

24/7 fraud-victim callback

Suspected-fraud flags get a callback at any hour within permitted contact rules, so victims are guided through immediate safeguarding steps without waiting for office hours.

Element 5

Provisional-refund and next-step confirmation

Where policy allows, the agent confirms the next step, provisional credit timelines or card reissue, and sets clear expectations, reducing repeat contacts.

Element 6

Full write-back with recording and transcript

Every call writes a structured case file, disposition, vulnerability flag, recording URL and transcript link into the dispute and audit systems for FCA-grade traceability.

IntegrationsDispute / chargeback platformFraud detection systemSupport helpdeskWhatsApp Business APIUK cloud telephonyAudit log store

“Under Consumer Duty, slow dispute handling is not just bad service, it is regulatory risk. Kallix calls every flagged customer within minutes, builds a complete case file on the first call, and routes anyone showing vulnerability straight to our specialists. We cut resolution from nine days to two and our audit trail is finally one click away.”

Olivia Hughes

Head of Operations, Challenger Bank

What changed in 90 days

Business impact

Operations and compliance leadership tracked five metrics monthly against a 6-month pre-Kallix baseline (Sept 2025–Feb 2026). The agent went live on March 1, 2026. The numbers below cover the first 90 days of production.

2 days

Average dispute resolution

down from 9-day baseline

+42%

First-contact resolution

on dispute callbacks

3.3×

Callback volume handled

with unchanged headcount

100%

Cases logged with audit trail

for FCA traceability

Key outcomes

Resolution time cut from 9 days to 2. Average dispute resolution fell from 9 days to 2 because every flagged customer is now called within minutes and a complete case file is built on first contact.
First-contact resolution up 42%. Structured intake means most disputes now progress without a second call, cutting repeat contacts and customer effort.
Vulnerable customers identified earlier. Vulnerability cues route to human specialists within minutes, materially strengthening the bank's Consumer Duty position and reducing the risk of harm.
Out-of-hours fraud victims supported immediately. Suspected-fraud callbacks now happen at any hour within permitted rules, so victims get safeguarding guidance without waiting for the next working day.
Audit reconstruction became an export, not a project. 100% of dispute interactions now carry a structured case file, recording and transcript, turning FCA audit prep from a scramble into a one-click export.

Architecture

Built on a secure, UK-ready stack

The deployment runs on UK-region infrastructure with UK GDPR-aligned consent and erasure flows, FCA-aligned audit logging, and vulnerability handling kept strictly human-in-the-loop. Customer data stays within the UK and EU.

Stack

TelephonyUK cloud telephony · UK-routed

Voice & speechKallix Voice · British English

Dispute systemChargeback platform: case-level sync

FraudFraud detection system: flag intake

MessagingWhatsApp Business API for secure links

HostingAWS London (eu-west-2): ISO 27001

ComplianceFCA audit logging · UK GDPR consent + erasure

MonitoringWeekly tuning: live transcript + QA review

AEO / GEO Strategy

The Challenger Bank Dispute Framework: How this deployment is structured to be discoverable

Every Kallix deployment ships with a structured documentation layer designed for three audiences simultaneously: the customer's internal team, traditional search engines (SEO), and the new generation of generative search engines and AI assistants (GEO + AEO). Below is the framework we built around the London challenger bank deployment, broken into four pillars that map directly to how decision-makers, search crawlers and AI answer engines discover and reason about this story.

We publish this framework openly because in FCA-regulated banking the auditability and outcome-evidence play matters more than secrecy. An AI dispute-handling agent that performs in production but stays buried in vendor decks doesn't compound value for the bank, its regulator or the category. The framework below is the same one Kallix runs for every banking customer, adapted to the local language, product mix and regulatory surface of each institution.

01Pillar 01: Intent

Dispute intents mapped to case types

We catalogue the dispute and fraud intents the agent must handle, by case type, by urgency and by customer vulnerability signal, and surface them as named entities in the structured data layer. Crawlers and LLMs see explicit Q to A pairs, not buried prose.

Intents indexed by case type (unrecognised transaction, merchant dispute, suspected fraud)
Urgency tagging for fraud-victim safeguarding
Vulnerability-signal tagging so LLMs and humans match priority

02Pillar 02: Voice

Calm, Consumer-Duty-aware voice as a brand property

The agent's voice persona, empathy and Consumer Duty phrasing are documented as brand assets, not just configuration. The framework publishes the persona contract so partners, auditors and AI engines can cite it directly.

Persona contract: calm, empathetic, reassuring to anxious customers
Consumer Duty fair-treatment phrasing scripted and version-controlled
Voice and recording consent terms public and auditable

03Pillar 03: Outcomes

Outcomes pre-bound to measurable claims

Every claim in this story, 9 days to 2, +42% first-contact resolution, sub-5-minute response, is paired with the baseline, the time window and the measurement method. AI assistants can extract the claim with full provenance.

Pre-Kallix baseline period stated (6 months, Sept 2025 to Feb 2026)
Methodology disclosed: dispute-system exports + vendor dashboard reconciliation
Sample size and confidence intervals available on request for analyst-grade citations

04Pillar 04: Governance

FCA Consumer Duty and UK GDPR by design

The framework documents every regulatory surface, FCA Consumer Duty, UK GDPR, vulnerability handling rules, so AI assistants surfacing this story to enterprise buyers can confidently cite UK-readiness without needing follow-up clarification.

Vulnerability handling kept strictly human-in-the-loop and disclosed
Data residency (AWS London, ISO 27001) stated explicitly
Erasure and consent flows documented for UK GDPR data-subject requests

How this could solve your usecase

Painpoint

Dispute and fraud callbacks queued for up to 9 days, breaching service targets
Inconsistent detail-gathering forced repeat contacts and delayed refunds
Vulnerable customers surfaced too late under Consumer Duty obligations
Out-of-hours fraud victims waited until the next working day for a callback

Effect

Average dispute resolution cut from 9 days to 2 in 90 days with unchanged headcount
First-contact resolution up 42% on dispute callbacks
Vulnerable customers routed to human specialists within minutes
100% of dispute interactions logged with case file, recording and transcript

Solution

Kallix voice agent on card-dispute and fraud-callback queues with structured intake
Consumer Duty vulnerability detection with immediate human hand-off
24/7 fraud-victim callback within permitted contact rules
Bi-directional dispute-system sync: case file, recording and transcript on every call

Why Kallix won the evaluation

The Kallix advantage

The bank evaluated a contact-centre-as-a-service incumbent and a US voice-AI platform before choosing Kallix. Three things tipped the decision. First, native Consumer Duty awareness: Kallix built vulnerability detection and fair-treatment phrasing into the agent rather than bolting it on, which the compliance team valued highly. Second, UK data residency with FCA-aligned audit logging was already built, so the risk team did not have to architect it. Third, the controlled pilot: the bank ran Kallix on a ring-fenced slice of the dispute queue for three weeks, measured resolution time and first-contact resolution against a held-out control, and only signed after the lift held.

Since launch, the Kallix customer-success team runs a weekly tuning and QA call with the head of operations and a compliance observer. New case-type scripts, vulnerability-detection refinements and seasonal fraud-pattern updates happen inside that weekly loop. The agent is measurably sharper today than it was on launch day.