The "Plug & Play" Lie: Why Your Oracle CDC Pipeline Will Fail
Breakout Session
Surviving unbounded numerics, missing SMTs, and XStream configuration hell.
We are often promised that Change Data Capture (CDC) is a solved problem: "Just install the Debezium connector, point it at Oracle, and stream." In reality, connecting Debezium to Oracle XStream is not the end of the journey, it is merely the start of a complex engineering challenge.
We will share why without a rigorous platform around it, a raw XStream implementation often leads to production outages, data corruption via type mismatches, and operational gaps.
In this session, we will expose the missing pieces required to turn a raw Debezium connector into a resilient data pipeline. We will move beyond "Hello World" examples and dissect the painful realities of Oracle CDC. We will frame this discussion around our own architectural evolution: sharing the scars from our v1, the decisions that defined our current production v2, and the architectural features of a hypothetical v3 that we are still chasing.
We’ll go deep on:
The Type System Minefield: How to handle Oracle’s "Unbounded Numerics" and complex timestamps without crashing your consumers or losing precision (and why default SMTs aren't enough)
Declarative Pipeline Generation: Why handwriting connector configs is a recipe for disaster. We will demonstrate using SpecMesh and pipeline definitions to auto-generate complex Debezium and SMT configurations. We use these definitions as collaborative contracts with domain teams, agreeing on schemas and intent upstream.
Closing the Trust Gap: CDC without verification is just a best-guess. We will share an overview of our continuous reconciliation process that proves that the data matches the source of truth.
Full-Stack Local Testing: Running Kubernetes, Oracle, Kafka, and your full pipeline on a developer machine. We’ll show how to test schema evolution and SMT logic locally, long before deployment
Survival Mechanics: Deep dives into XStream position recovery, implementing heartbeat to prevent quiet tables holding on to redo logs for longer than expected and handling Confluent Cloud region failover without data loss
This is a deeply technical, practitioner-focused session aimed at engineers and architects who are interested in migrating data from Oracle databases. You’ll come away with: a mental model of how Oracle XStream works, design patterns for building resilient pipelines, concrete tips for observability and performance tuning, and a set of “day two” operational checklists.
Kiril Piskunov
MarketAxess
Declan Curran
MarketAxess