GPUs are frequently used to speed up highly parallel processes such as rendering or machine learning applications. This runbook contains commands that check the compute, memory utilization, and other metrics for your GPUs and helps to debug potential issues related to this.
Parameters
Debug
Check the configuration of the graphics card
List all NVIDIA GPUs
Displays NVIDIA gpu info, including driver version, number of GPUs, product name, display mode, etc.
Display gpu utilization.
Display gpu performance.
Display gpu clock information.
Display gpu power information.
Display gpu memory usage.
Display and constantly refresh all info about the NVIDIA GPU
Repair
Update or install NVIDIA driver.
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.