it couldn‘t communicate with the NVIDIA driver报错

前段时间刚装的驱动,用GPU跑模型时,竟然提示不存在,什么破?

NVIDIA-SMI has failed because it couldn‘t communicate with the NVIDIA driver

 

谢邀,出现这种情况最直接的方式是重装驱动。

如果你觉得重装比较麻烦,也可以尝试下面的方法:

(1) 打开终端,先用nvidia-smi查看一下,看是否发现如下报错:

NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. 
Make sure that the latest NVIDIA driver is installed and running.

(2) 使用nvcc -V检查驱动和cuda

发现驱动是存在的,于是进行下一步

(3) 查看已安装驱动的版本信息

ls /usr/src | grep nvidia

(4) 第四步,依次输入以下命令

sudo apt-get install dkms

sudo dkms install -m nvidia -v 450.57

等待安装完成后,再次输入nvidia-smi,查看GPU使用状态, 就没问题了。