Skip to main content

Change Data Capture (Planned)

Planned — Q1 2027

CDC is on the committed roadmap for Q1 2027. Requires both Managed Kafka (streaming) and Managed PostgreSQL (databases) to be operational.

Purpose: For platform engineers and data engineers, explains the planned CDC service — scope, supported databases, lifecycle model, and dependencies.

Overview

Change Data Capture captures row-level changes from source databases and publishes them as events to Kafka topics. openCenter CDC uses Debezium connectors deployed on KafkaConnect, managed via KafkaConnector CRDs in Git and reconciled by FluxCD.

This is a narrow, opinionated CDC service — not a general-purpose data integration platform. PostgreSQL is the primary source; MySQL and MongoDB support follows.

Planned Scope

CapabilityDescription
PostgreSQL capturepgoutput logical decoding — row-level inserts, updates, deletes
MySQL capturebinlog-based capture
MongoDB captureChange streams
Kafka sinkEvents published to Kafka topics (one topic per table default)
Schema handlingIntegrates with Schema Registry (Q4 2026) when available
Connector lifecycleDeploy, pause, resume, restart, delete via KafkaConnector CRD
MonitoringPrometheus metrics for connector lag, errors, throughput
GitOps managedConnector definitions in Git, FluxCD reconciliation

Supported Databases

DatabaseMechanismMinimum VersionStatus
PostgreSQLpgoutput logical replication14+Primary target
MySQLbinlog (ROW format)8.0+Secondary
MongoDBChange streams6.0+Secondary

Lifecycle Operations

OperationMechanism
Deploy connectorApply KafkaConnector CR via Git commit
Pause captureSet state: paused in KafkaConnector CR
Resume captureSet state: running — resumes from last committed offset
Restart (error recovery)Annotation-triggered restart via operator
Schema evolutionHandled by Schema Registry compatibility checks
Upgrade DebeziumConnector image version bump in KafkaConnect CR

Architecture

Dependencies

DependencyRequiredNotes
Managed KafkaYesSink for CDC events; provides KafkaConnect runtime
Managed PostgreSQLYes (for PG source)Source database with logical replication enabled
Schema RegistryRecommendedSchema evolution without breaking consumers
cert-managerYesTLS for connector-to-database and connector-to-Kafka
kube-prometheus-stackYesConnector metrics and alerting
FluxCDYesGitOps lifecycle for KafkaConnector CRDs

Constraints

  • PostgreSQL source requires wal_level = logical (configured by Managed PostgreSQL service)
  • One Debezium connector per source database (fan-out via Kafka topics)
  • Large initial snapshots may require dedicated KafkaConnect task allocation
  • Topic naming follows convention: <connector-name>.<schema>.<table>

Further Reading