nvidia_runtime_setup
Ansible role to configure NVIDIA driver/runtime readiness on Debian-family hosts.
What it does
- Detects NVIDIA GPU hardware via
lspci - Auto-selects a recommended driver on Ubuntu (or uses an explicit package pin)
- Installs the NVIDIA driver package
- Optionally installs CUDA toolkit and NVIDIA container toolkit
- Handles optional reboot logic
- Verifies readiness with
nvidia-smi
Safe defaults
- Reboot is disabled by default (
nvidia_runtime_reboot_if_needed: false) - CUDA and container toolkit installs are disabled by default
- Validation is enabled by default and fails if
nvidia-smiis unavailable
Example
---
- name: Configure NVIDIA runtime for AI nodes
hosts: ai_nodes
become: true
roles:
- role: nvidia_runtime_setup
vars:
nvidia_runtime_reboot_if_needed: true