Extend liveness probe timing #156

bshephar · 2023-07-03T01:50:52Z

The Horizon pod is fairly frequently killed due to the current liveness probe timing. This seems to happen more frequently during the initial deployment while the system is under load. This change extends the liveness probe to provide some extra time for kubelet to validate the functionality of the Horizon pod before terminating it.

abays · 2023-07-03T08:43:40Z

The Horizon pod is fairly frequently killed due to the current liveness probe timing. This seems to happen more frequently during the initial deployment while the system is under load. This change extends the liveness probe to provide some extra time for kubelet to validate the functionality of the Horizon pod before terminating it.

If this failure happens during the initial deployment, I wonder if it would be prudent to consider adding a startup probe [1]?

[1] https://kubernetes.io/docs/tasks/configure-pod-container/configure-liveness-readiness-startup-probes/#define-startup-probes

The Horizon pod is fairly frequently killed due to the current liveness probe timing. This seems to happen more frequently during the initial deployment while the system is under load. This change adds the startup probe to provide some extra time for kubelet to validate the functionality of the Horizon pod before terminating it. Signed-off-by: Brendan Shephard <[email protected]>

bshephar · 2023-07-03T11:51:04Z

The Horizon pod is fairly frequently killed due to the current liveness probe timing. This seems to happen more frequently during the initial deployment while the system is under load. This change extends the liveness probe to provide some extra time for kubelet to validate the functionality of the Horizon pod before terminating it.

If this failure happens during the initial deployment, I wonder if it would be prudent to consider adding a startup probe [1]?

[1] https://kubernetes.io/docs/tasks/configure-pod-container/configure-liveness-readiness-startup-probes/#define-startup-probes

Sounds reasonable. Let's give it a run and see how it goes. I changed my commit to reflect this.

abays

/lgtm

openshift-ci · 2023-07-03T14:00:41Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: abays, bshephar

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [abays,bshephar]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci bot requested review from abays and viroel July 3, 2023 01:51

openshift-ci bot added the approved label Jul 3, 2023

bshephar force-pushed the extend-liveness-check branch 2 times, most recently from be40c02 to 74d33e0 Compare July 3, 2023 02:15

bshephar force-pushed the extend-liveness-check branch from 74d33e0 to e0abf14 Compare July 3, 2023 11:50

abays approved these changes Jul 3, 2023

View reviewed changes

openshift-ci bot assigned abays Jul 3, 2023

openshift-ci bot added the lgtm label Jul 3, 2023

openshift-merge-robot merged commit fb6d1d7 into openstack-k8s-operators:main Jul 3, 2023
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend liveness probe timing #156

Extend liveness probe timing #156

bshephar commented Jul 3, 2023

abays commented Jul 3, 2023

bshephar commented Jul 3, 2023

abays left a comment

openshift-ci bot commented Jul 3, 2023

Extend liveness probe timing #156

Extend liveness probe timing #156

Conversation

bshephar commented Jul 3, 2023

abays commented Jul 3, 2023

bshephar commented Jul 3, 2023

abays left a comment

Choose a reason for hiding this comment

openshift-ci bot commented Jul 3, 2023