Add env variable to signal skip vfio-pci unbind #2079

karthikvetrivel · 2026-01-29T15:15:59Z

Relevant PR: NVIDIA/k8s-driver-manager#146

Description

Pass GPU_WORKLOAD_CONFIG environment variable to k8s-driver-manager init container in vfio-manager DaemonSet to prevent unnecessary GPU unbind/rebind operations during rolling updates.

Problem

During rolling updates of the vfio-manager DaemonSet, k8s-driver-manager unconditionally unbinds all GPUs from vfio-pci on startup. When the desired state is already vfio-pci binding, this causes unnecessary disruption to active VM workloads using GPU passthrough (KubeVirt, Kata Containers).

Design Rationale

We know that vfio-manager only runs on vm-passthrough nodes: The DaemonSet's nodeSelector requires nvidia.com/gpu.deploy.vfio-manager: "true", which is only set for gpuWorkloadConfigVMPassthrough nodes. This is true regardless of whether the workload config comes from an explicit node label or sandboxWorkloads.defaultWorkload.

Checklist

No secrets, sensitive information, or unrelated changes
Lint checks passing (make lint)
Generated assets in-sync (make validate-generated-assets)
Go mod artifacts in-sync (make validate-modules)
Test cases are added for new code paths

Signed-off-by: Karthik Vetrivel <kvetrivel@nvidia.com>

Add env variable to signal skip vfio-pci unbind

31361f1

Signed-off-by: Karthik Vetrivel <kvetrivel@nvidia.com>

karthikvetrivel requested review from ArangoGutierrez, cdesiniotis, elezar, shivamerla and tariq1890 as code owners January 29, 2026 15:16

karthikvetrivel mentioned this pull request Jan 29, 2026

Skip vfio-pci unbind when GPUs already bound in VFIO mode NVIDIA/k8s-driver-manager#146

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add env variable to signal skip vfio-pci unbind #2079

Add env variable to signal skip vfio-pci unbind #2079

karthikvetrivel commented Jan 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add env variable to signal skip vfio-pci unbind #2079

Are you sure you want to change the base?

Add env variable to signal skip vfio-pci unbind #2079

Conversation

karthikvetrivel commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Problem

Design Rationale

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

karthikvetrivel commented Jan 29, 2026 •

edited

Loading