CDC Sources

Install

See the Install guide for the full setup, including Windows PowerShell.

curl -fsSL https://install.skippr.io/install.sh | shClick to copy

Installing Skippr means accepting the Skippr EULA.

Skippr reads native change logs from five source systems. Each source captures inserts, updates, and deletes and emits them as CDC events with mutation kind and order token metadata.

For reliable replay, each CDC source needs three things:

a durable resume position
mutation fidelity for inserts, updates, and deletes
an order model that can support the selected destination contract

See CDC Guarantees for the contract and CDC Operations for lag, retention, and restart guidance.

PostgreSQL

PostgreSQL CDC uses WAL logical replication with the pgoutput output plugin. Skippr creates a replication slot and publication, then streams row-level changes in real time.

Prerequisites

Set wal_level = logical in postgresql.conf (requires restart)
The replication user must have the REPLICATION attribute or be a superuser
max_replication_slots must be at least 1 (default is usually 10)

Configuration

yaml

source:
  kind: postgres
  host: localhost
  port: 5432
  user: replicator
  password: ${POSTGRES_PASSWORD}
  database: mydb
  cdc_mode: snapshot_then_cdc

Field	Default	Description
`cdc_mode`	`snapshot`	`snapshot`, `snapshot_then_cdc`, or `cdc_only`
`replication_slot_name`	`skippr_slot`	Name of the replication slot
`publication_name`	`skippr_pub`	Name of the publication

What gets captured

INSERT rows with mutation kind insert
UPDATE rows with mutation kind update (full row after image)
DELETE rows with mutation kind delete

Resume behavior

With snapshot_then_cdc, Skippr runs the initial snapshot once, stores the committed LSN (Log Sequence Number), and then streams changes. On restart, replication resumes from the stored LSN and the snapshot is skipped. With cdc_only, Skippr starts from the replication slot position without running a snapshot.

The replication slot is reused across restarts (not recreated), so PostgreSQL retains WAL segments only until Skippr has confirmed them.

MySQL

MySQL CDC uses binlog replication. Skippr connects as a replication client, reads row-level events from the binary log, and emits them as CDC mutations.

Prerequisites

Set binlog_format = ROW in my.cnf
Set binlog_row_image = FULL (ensures complete before/after images)
The replication user must have REPLICATION SLAVE and REPLICATION CLIENT privileges

Configuration

yaml

source:
  kind: mysql
  connection_string: mysql://replicator:${MYSQL_PASSWORD}@host:3306/mydb
  cdc_mode: snapshot_then_cdc

Field	Default	Description
`cdc_mode`	`snapshot`	`snapshot`, `snapshot_then_cdc`, or `cdc_only`
`server_id`	auto-generated	MySQL server ID for the replication client

What gets captured

WRITE_ROWS events (inserts)
UPDATE_ROWS events (updates with full row image)
DELETE_ROWS events (deletes)

Resume behavior

With snapshot_then_cdc, Skippr captures the binlog position, runs the initial snapshot once, and then streams from that position. On restart, the binlog stream resumes from the stored filename and position. With cdc_only, Skippr skips the snapshot and starts from the current or stored binlog position.

MongoDB

MongoDB CDC uses change streams, which are backed by the oplog. Skippr opens a change stream on the target database and receives real-time notifications for document mutations.

Prerequisites

MongoDB must be running as a replica set or sharded cluster (change streams require an oplog)
The connection user must have read access on the target database

Configuration

yaml

source:
  kind: mongodb
  connection_string: mongodb://user:${MONGO_PASSWORD}@host:27017/mydb
  cdc_mode: snapshot_then_cdc

Field	Default	Description
`cdc_mode`	`snapshot`	`snapshot`, `snapshot_then_cdc`, or `cdc_only`

What gets captured

insert operations
update operations (full document after image via fullDocument: updateLookup)
delete operations

Resume behavior

With snapshot_then_cdc, Skippr captures a change-stream resume token, runs the initial snapshot once, and then streams with resume_after. On restart, the change stream resumes from the stored token. With cdc_only, Skippr skips the snapshot and starts the change stream directly unless a stored token is available.

DynamoDB

DynamoDB CDC uses DynamoDB Streams to capture item-level changes. Skippr reads shard iterators and processes records with NEW_AND_OLD_IMAGES to get full before/after item state.

Prerequisites

Enable DynamoDB Streams on the table with StreamViewType = NEW_AND_OLD_IMAGES
The IAM role must have dynamodb:DescribeStream, dynamodb:GetShardIterator, and dynamodb:GetRecords permissions

Configuration

yaml

source:
  kind: dynamodb
  table_name: my_table
  region: us-east-1
  cdc_mode: snapshot_then_cdc

Field	Default	Description
`cdc_mode`	`snapshot`	`snapshot`, `snapshot_then_cdc`, or `cdc_only`

What gets captured

INSERT events (new items)
MODIFY events (updated items, full new image)
REMOVE events (deleted items)

Resume behavior

With snapshot_then_cdc, Skippr scans the table once, records a bootstrap checkpoint, and then consumes DynamoDB Streams. On restart, each shard resumes from the stored sequence number and the snapshot is skipped. With cdc_only, Skippr skips the scan and starts from the stream.

Kafka

Kafka CDC consumes Debezium-formatted messages from Kafka topics. Skippr parses the Debezium envelope to extract mutation kind, key fields, and payload.

Prerequisites

A Debezium connector must be running and publishing change events to the Kafka topic
Messages must use the standard Debezium envelope format with op, before, and after fields

Configuration

yaml

source:
  kind: kafka
  brokers: "localhost:9092"
  topic: dbserver1.public.customers
  cdc_mode: cdc_only

Field	Default	Description
`cdc_mode`	`snapshot`	Use `cdc_only` for Debezium CDC streams. Kafka does not support `snapshot_then_cdc`.
`debezium_format`	`false`	Parse messages as Debezium envelopes
`group_id`	`skippr-{project}`	Kafka consumer group ID

What gets captured

op: c (create / insert)
op: u (update)
op: d (delete)

Resume behavior

Skippr uses a stable group_id derived from the project name. Kafka's consumer group offset tracking provides durable resume -- on restart, consumption resumes from the last committed offset.

Install

See the Install guide for the full setup, including Windows PowerShell.

curl -fsSL https://install.skippr.io/install.sh | shClick to copy

Installing Skippr means accepting the Skippr EULA.

CDC Sources ​

Install ​

PostgreSQL ​

Prerequisites ​

Configuration ​

What gets captured ​

Resume behavior ​

MySQL ​

Prerequisites ​

Configuration ​

What gets captured ​

Resume behavior ​

MongoDB ​

Prerequisites ​

Configuration ​

What gets captured ​

Resume behavior ​

DynamoDB ​

Prerequisites ​

Configuration ​

What gets captured ​

Resume behavior ​

Kafka ​

Prerequisites ​

Configuration ​

What gets captured ​

Resume behavior ​

Install ​

CDC Sources

Install

PostgreSQL

Prerequisites

Configuration

What gets captured

Resume behavior

MySQL

Prerequisites

Configuration

What gets captured

Resume behavior

MongoDB

Prerequisites

Configuration

What gets captured

Resume behavior

DynamoDB

Prerequisites

Configuration

What gets captured

Resume behavior

Kafka

Prerequisites

Configuration

What gets captured

Resume behavior

Install