-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Open
Labels
sig/nodeCategorizes an issue or PR as relevant to SIG Node.Categorizes an issue or PR as relevant to SIG Node.tracked/noDenotes an enhancement issue is NOT actively being tracked by the Release TeamDenotes an enhancement issue is NOT actively being tracked by the Release Teamwg/device-managementCategorizes an issue or PR as relevant to WG Device Management.Categorizes an issue or PR as relevant to WG Device Management.
Description
Enhancement Description
DRA drivers may encounter errors such that the devices allocated by kube-scheduler for a pod can never be successfully returned from the NodePrepareResources
gRPC call to the driver. Currently, pods in that state will be continuously retried forever, wasting CPU cycles in the kubelet and DRA driver. This proposal describes a method to break that cycle of continuous retries that are known will fail.
/sig node
/wg device-management
/assign @nojnhuh
/cc @pohly @lauralorenz @SergeyKanzhelev
- One-line enhancement description (can be used as a release note): DRA: Handle permanent driver failures
- Kubernetes Enhancement Proposal: https://github.com/nojnhuh/enhancements/blob/5322-dra-perma-err/keps/sig-node/5322-dra-driver-permanent-failure/README.md
- Discussion Link:
- Primary contact (assignee): @nojnhuh
- Responsible SIGs: SIG Node
- Enhancement target (which target equals to which milestone):
- Alpha release target (x.y):
- Beta release target (x.y):
- Stable release target (x.y):
- Alpha
- KEP (
k/enhancements
) update PR(s): - Code (
k/k
) update PR(s): - Docs (
k/website
) update PR(s):
- KEP (
Please keep this description up to date. This will help the Enhancement Team to track the evolution of the enhancement efficiently.
Metadata
Metadata
Assignees
Labels
sig/nodeCategorizes an issue or PR as relevant to SIG Node.Categorizes an issue or PR as relevant to SIG Node.tracked/noDenotes an enhancement issue is NOT actively being tracked by the Release TeamDenotes an enhancement issue is NOT actively being tracked by the Release Teamwg/device-managementCategorizes an issue or PR as relevant to WG Device Management.Categorizes an issue or PR as relevant to WG Device Management.
Type
Projects
Status
🏗 In progress
Status
Removed
Status
Removed from Milestone