Karpenter 1.1.0 not scaling down empty nodes #7466
Labels
bug
Something isn't working
lifecycle/stale
triage/needs-information
Marks that the issue still needs more information to properly triage
Description
After upgrading to 1.1.0 I noticed that karpenter will not delete nodeclaim and terminate nodes. My nodeclass:
When there are no pods on the node (excluding daemonset pods), Karpenter marks node for deletion:
And indeed, the node is tainted. However, nothing happens after that. Nodeclaim is never deleted, and as a result empty nodes are just hanging around unschedulable (because of the taint).
I enabled debug logs but found nothing there, no errors or warnings. I am 100% certain there are no non-daemonset pods on the dangling nodes. Sometimes, scale down happens, maybe 1 time out of 10. In all cases I tested by scheduling 2 pods so that Karpenter creates 2 nodes. There are no other karpenter managed nodes in the cluster.
The text was updated successfully, but these errors were encountered: