Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix Backup getting restarted #227

Merged

Conversation

Akrog
Copy link
Contributor

@Akrog Akrog commented Jul 11, 2023

Due to a bug in the cinder backup service 1 the service may appear to be down when doing multiple concurrent operations.

When the service appears as down the probe will detect it and will end up restarting the service, which aborts all the ongoing operations.

While the bug is getting fixed we increase the service_down_time to 3 minutes. This is not ideal because it also affects the cinder volume service, but it also prevents human operators from seeing the false down state.

The alternative would be to change the probing configuration just for the backup service.

Due to a bug in the cinder backup service [1] the service may appear to
be down when doing multiple concurrent operations.

When the service appears as down the probe will detect it and will end
up restarting the service, which aborts all the ongoing operations.

While the bug is getting fixed we increase the service_down_time to 3
minutes. This is not ideal because it also affects the cinder volume
service, but it also prevents human operators from seeing the false down
state.

The alternative would be to change the probing configuration just for
the backup service.

[1]: https://bugs.launchpad.net/cinder/+bug/2026877
@openshift-ci openshift-ci bot requested review from abays and dprince July 11, 2023 15:12
@Akrog
Copy link
Contributor Author

Akrog commented Jul 14, 2023

/test all

@fmount
Copy link
Contributor

fmount commented Jul 21, 2023

/test cinder-operator-build-deploy-kuttl

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jul 21, 2023

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: Akrog, fmount

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-merge-robot openshift-merge-robot merged commit 2570d0e into openstack-k8s-operators:main Jul 21, 2023
2 checks passed
ASBishop pushed a commit to ASBishop/cinder-operator that referenced this pull request Mar 11, 2024
…rators/renovate/openstack-k8s-operators

Update openstack-k8s-operators
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants