Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ebs br: backup tolerates the rolling restart #5436

Closed
BornChanger opened this issue Dec 3, 2023 · 1 comment
Closed

ebs br: backup tolerates the rolling restart #5436

BornChanger opened this issue Dec 3, 2023 · 1 comment
Assignees
Labels

Comments

@BornChanger
Copy link
Contributor

Feature Request

Is your feature request related to a problem? Please describe:

For a large cluster with more than 120 TiKV nodes, rolling restart could last for more than 10 hours. EBS snapshot could fail due to not able to keep lightning suspension for all TiKV nodes during the snapshot checkpoint generation time.

Describe the feature you'd like:

We need to pause evict-leader-scheduler and add retry logic for lightning suspension request to TiKV nodes.

Describe alternatives you've considered:

Teachability, Documentation, Adoption, Migration Strategy:

@BornChanger
Copy link
Contributor Author

The fix in tidb PR pingcap/tidb#49154

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants