debezium/dbz#1530 Support connector generation and epoch incrementing by twthorn · Pull Request #269 · debezium/debezium-connector-vitess

twthorn · 2026-01-15T23:37:17Z

Description

Add the connector generation config, used to manually update the epoch of a connector's shards.

We need this to allow users to migrate to transaction chunking in this PR #268

This can also be useful for maintenance in general in case the ordering semantics are ever changed, generation is incremented to trigger an epoch bump.

Why Do We Need Manual Epoch Increments

The only known use case is in #268. It is necessary to preserve metadata correctness when enabling that feature.

As taken from Debezium docs:

applications that consume from the topic can apply the following logic to determine which of the two event to apply (the newer event) and which to discard:

If the values for transaction_epoch are not equal, return the event with the higher transaction_epoch value. Otherwise, continue.

If the values for transaction_rank are not equal, return the event with the higher transaction_rank value. Otherwise, continue.

Return the event with a greater total_order value.

So what chunking does is change how the transaction rank field is computed.

Let

ti be transaction i
rk be the rank computed for transaction k
let all transactions t operate on pkA, the primary key A, and pkA - col1: v1 denote setting col1 to the value v1
let order be the total_order of the transaction

Without transaction chunking: Since an entire transaction is received at once, we can compute the rank from the GTID of the transaction (the last event received in the transaction)
t1 -> r1, pkA - col1: v1, total_order = 3
t2 -> r2, pkA - col1: v2, total_order = 2
t3 -> r3, pkA - col1: v3, total_order = 1

With transaction chunking: Instead we receive chunks of a transaction. Only the last chunk will contain the GTID of the transaction. Thus the only GTID available to compute rank is the one of the preceding transaction (the only possible alternative would be to buffer until we receive the last chunk but that defeats the purpose of chunking). Thus the ranks are computed as:
t1 -> r0, pkA - col1: v1, total_order = 3
t2 -> r1, pkA - col1: v2, total_order = 2
t3 -> r2, pkA - col1: v3, total_order = 1

So in the case of enabling transaction chunking there is one of two cases:

Case 1: Sent messages had offsets committed

t1 -> r1, pkA - col1: v1, total_order = 3
t2 -> r2, pkA - col1: v2, total_order = 2 (offsets committed)
<restart, enable chunking>
t3 -> r2, pkA - col1: v3, total_order = 1

Problem: by the previous algorithm, t2 r2 with order 2 wins setting col1, stale value finalized as v2

Case 2: Sent message did not have offset comitted:
t1 -> r1, pkA - col1: v1, total_order = 3
t2 -> r2, pkA - col1: v2, total_order = 2 (offsets not committed)
<restart, enable chunking>
t2 -> r1, pkA - col1: v2, total_order = 2
t3 -> r2, pkA - col1: v3, total_order = 1

Problem: by the previous algorithm, two stale values are incorrectly chosen:

t1 -> r1 v1 has order 3 which beats t2->r1, stale value takes precedence v1
t2-> r2 has order 2 which beats t3->r2, stale value finalized as v2

Overall, the downstream system cannot correctly use metadata to determine which change is the latest for pk1. At a minimum values will be tied (which may be broken with offset check assuming no concurrent kafka re-partition). But with particular transaction ordering values (decreasing rather than increasing like offsets), it may even wrongfully set to old values (stale and incorrect indefinitely until a new update comes along for the pkey)

Fixes debezium/dbz#1530

Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com>

twthorn · 2026-01-21T01:01:44Z

@jpechane this is ready for review

jpechane · 2026-01-23T11:38:25Z

@twthorn Thanks for the PR. I wonder if i is something that should be implemented as connector config or a connector-specific signal, WDYT?

twthorn · 2026-01-23T22:25:08Z

@jpechane thanks for the review. I think that a config fits better here as its inherently persistent.

Consider a busy shard -80 (epoch: 1) and idle shard 80- (epoch: 3). We trigger a new connector generation with a signal. the busy shard -80 would soon update its state (epoch: 2). If there has been no activity for 80- in several days or a week the signal record may fall out of retention of its torage medium (e.g., kafka). Then when a record is finally received for the idle shard 80- the epoch won't be incremented correctly (epoch: 3 remains, should be 4). By making the signal medium retention infinite, users could get around this, but that's a point of failure where the user may misconfigure.

Since it's permanent in source control and a history, I think it makes sense as a config.

twthorn · 2026-01-27T18:49:32Z

@jpechane let me know your thoughts on this, thanks!

Naros

Just a few inline comments @twthorn, if you could take a look.

Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com>

twthorn · 2026-01-30T18:11:22Z

@Naros addressed all comments. Can you take another look when you get the chance? Thanks!

Naros · 2026-02-03T20:05:04Z

@twthorn before I merge this, did you and @jpechane eventually agree on config over signal?

twthorn · 2026-02-04T22:59:39Z

@Naros What do you think of config vs signal? I have not heard back from Jiri, but I believe config is the appropriate fit here.

Naros · 2026-02-05T02:10:10Z

Hi @twthorn, I can see the advantages and disadvantages of both, but I'm not overly familiar with the internal workings of Vitess and its shard system to say either way.

From a general point of view, the signal approach paired with a heartbeat.action.query would be what we recommend for other relational connectors to handle low-activity scenarios where the signal may be sourced from outside the database. But I am not sure whether the use of heartbeat.action.query targets all or specific shards, given your comment above about the low-activity shard?

With the configuration-style approach, everything is driven from a single place, with no moving parts. It does not require database write permissions, nor does it become a concern in the low activity scenario you mentioned.

I think the biggest downside of the configuration approach is that it triggers a connector restart, whereas the benefit of the signal might be that it avoids downtime or restarts.

So suffice it to say, with my limited understanding, I can see how the configuration approach looks better.

But I would appreciate it if you could evaluate my comment above with signal + heartbeats and explain how that might look or work, or why it doesn't, for my own edification.

jpechane · 2026-02-13T13:07:02Z

@twthorn I am fine with merging the current implementation. Still I'd like to see your opnion about approach proposed in debezium/dbz#1602
Do you think that this is something feasible in future - so the code would be reworked to support signal-based trigger and at the same time it owuld be possible to automatically trigger that signal based on connector/offset config?

twthorn · 2026-02-17T00:15:49Z

Thanks for the feedback @Naros and @jpechane.

But I would appreciate it if you could evaluate my comment above with signal + heartbeats and explain how that might look or work, or why it doesn't, for my own edification.

@Naros Signal + heartbeats could work. However, we do not require users to enable heartbeats in order to run Debezium Vitess Connector. In order to enable heartbeats, users need one of the newest Vitess versions to get the new feature, and it must be enabled on both Vitess & Debezium sides. Thus, I don't think it's reasonable to make correctness require heartbeats as most users do not have it enabled.

Still I'd like to see your opinion about approach proposed in debezium/dbz#1602
Do you think that this is something feasible in future - so the code would be reworked to support signal-based trigger and at the same time it would be possible to automatically trigger that signal based on connector/offset config?

@jpechane I replied in that issue thread, hope the context helps.

Overall, I think config makes the most sense with the current requirements and usage patterns. However, I can see two ways of such a config approach working

Connector generation (this PR)

Decouple incrementing epochs from the logic that requires them to be incremented. It allows operators to increment them when they see fit. The flow is:

Roll out a connector without chunking, generation: ) (default to 0)
Update config: generation: 1, chunking: 1234
On restart the connector updates all epochs incrementing them by 1

Cons: users must change two configs (some manual error risk)
Pros: it allows for most flexibility going forward (eg if we have another change that requires epoch to be bumped, it can be done without extra code change)

Automated epoch handling specific to binlog chunking #268

Store a specific property in state: chunking_enabled. Used to track if our rank ever falls backward (requires epoch bump to maintain correct ordering)

Roll out a connector without chunking
Update config: chunking: 1234
On restart the connector sees it went from chunking disabled to chunking enabled, and increments its epoch

Cons: it is tightly coupled, if another change requires epochs to be bumped, it must be added to state as well
Pros: reduces manual error risk and handles any flips back and forth of the config implementing epoch as needed

Overall I prefer the decoupled approach which is what this PR adds. Let me know if you all think one makes more sense. I will also add some more details of why we need this in the PR description

jpechane · 2026-02-17T12:33:39Z

@twthorn Thanks for he thorough explanantion. I think this is good to go as is and we can think about future updates when needed. Good job!

jpechane · 2026-02-17T12:37:39Z

@twthorn Please make sure not only the option but the process is documented as well, thanks!

twthorn added 2 commits January 15, 2026 15:36

debezium/dbz#1530 Support connector generation and epoch incrementing

d3f1128

Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com>

debezium/dbz#1530 Fix itest bug since shard is picked at random

fa3f755

Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com>

twthorn mentioned this pull request Jan 20, 2026

Support VStream transaction chunking (prevents out of memory errors) #268

Open

Naros reviewed Jan 28, 2026

View reviewed changes

Comment thread src/main/java/io/debezium/connector/vitess/pipeline/txmetadata/VitessEpochProvider.java Outdated

Comment thread src/main/java/io/debezium/connector/vitess/pipeline/txmetadata/VitessEpochProvider.java Outdated

debezium/dbz#1530 Use get default, avoid unnecessary map inits

3879f33

Signed-off-by: Thomas Thornton <thomaswilliamthornton@gmail.com>

twthorn requested a review from Naros January 29, 2026 00:08

Naros approved these changes Feb 3, 2026

View reviewed changes

jpechane merged commit bb53dfc into debezium:main Feb 17, 2026
5 checks passed

twthorn mentioned this pull request Feb 20, 2026

Add docs on connector generation debezium/debezium#7089

Merged

Conversation

twthorn commented Jan 15, 2026 • edited by jpechane Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Why Do We Need Manual Epoch Increments

Uh oh!

twthorn commented Jan 21, 2026

Uh oh!

jpechane commented Jan 23, 2026

Uh oh!

twthorn commented Jan 23, 2026

Uh oh!

twthorn commented Jan 27, 2026

Uh oh!

Naros left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

twthorn commented Jan 30, 2026

Uh oh!

Naros commented Feb 3, 2026

Uh oh!

twthorn commented Feb 4, 2026

Uh oh!

Naros commented Feb 5, 2026

Uh oh!

jpechane commented Feb 13, 2026

Uh oh!

twthorn commented Feb 17, 2026

Connector generation (this PR)

Automated epoch handling specific to binlog chunking #268

Uh oh!

Uh oh!

jpechane commented Feb 17, 2026

Uh oh!

jpechane commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

twthorn commented Jan 15, 2026 •

edited by jpechane

Loading