Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The resolvedTs of a changefeed get stuck during initialization if there are more than 400k resolvedTs per second sending from upstream. #11066

Closed
fubinzh opened this issue May 9, 2024 · 4 comments · Fixed by #10506
Assignees
Labels
affects-7.5 area/ticdc Issues or PRs related to TiCDC. severity/major This is a major bug. type/bug This is a bug.

Comments

@fubinzh
Copy link

fubinzh commented May 9, 2024

What did you do?

  1. TiDB cluster with 8.1 TiKV and 7.5 CDC
  2. create kafka simple protocol changefeed
  3. no workload running

What did you expect to see?

CDC resolved ts should be normal

What did you see instead?

CDC resolved ts stucks
image
Uploading image.png…

Versions of the cluster

TiKV version
Release Version: 8.1.0
Git Commit Hash: 56613f7c3e28c02853cc51d15bc1b77f68b58be8

CDC version:
Release Version: 7.5
Git Commit Hash: 29dae6c

@fubinzh fubinzh added area/ticdc Issues or PRs related to TiCDC. type/bug This is a bug. labels May 9, 2024
@github-actions github-actions bot added this to Need Triage in Question and Bug Reports May 9, 2024
@fubinzh
Copy link
Author

fubinzh commented May 9, 2024

/assign @asddongmen

@asddongmen
Copy link
Contributor

It shows that the frontier consume a huge amount of cpu:
img_v3_02an_f2a63726-5d98-47f6-b129-6480c450248g

And log below can be found in the upstream TiKV:

> wc -l tikv-2024-05-08T15-09-29.950.log
 6186489 tikv-2024-05-08T15-09-29.950.log
> grep "cdc send event failed, full" tikv-2024-05-08T15-09-29.950.log | wc -l
 6186448

It seems that the issue arises because the number of tables and the resolvedTs value exceed the frontier's ability.

@asddongmen asddongmen changed the title Resolve ts stucks The resolvedTs of a changefeed get stuck during initialization if there are more than 400k resolvedTs per second sending from upstream. May 13, 2024
@asddongmen
Copy link
Contributor

asddongmen commented May 13, 2024

cc @hicqu To fix this issue, #10506 is needed cherry-pick to v7.5.2.

@fubinzh
Copy link
Author

fubinzh commented May 13, 2024

/severity major

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
affects-7.5 area/ticdc Issues or PRs related to TiCDC. severity/major This is a major bug. type/bug This is a bug.
Development

Successfully merging a pull request may close this issue.

4 participants