![華碩筆記型電腦無法安裝NVIDIA驅動程式:NVIDIA-SMI因無法與NVIDIA驅動程式通訊而失敗](https://rvso.com/image/1711455/%E8%8F%AF%E7%A2%A9%E7%AD%86%E8%A8%98%E5%9E%8B%E9%9B%BB%E8%85%A6%E7%84%A1%E6%B3%95%E5%AE%89%E8%A3%9DNVIDIA%E9%A9%85%E5%8B%95%E7%A8%8B%E5%BC%8F%EF%BC%9ANVIDIA-SMI%E5%9B%A0%E7%84%A1%E6%B3%95%E8%88%87NVIDIA%E9%A9%85%E5%8B%95%E7%A8%8B%E5%BC%8F%E9%80%9A%E8%A8%8A%E8%80%8C%E5%A4%B1%E6%95%97.png)
我嘗試在我的全新 ASUS Vivobook n580gd 上安裝 NVIDIA CUDA 驅動程序,但沒有成功。經過大量谷歌搜尋後我仍然沒有找到解決方案。
我預先感謝您的任何建議/建議。
我的筆記型電腦有兩張顯示卡:一張 Intel UHD Graphics 630 和一張 NVIDIA GeForce GTX 1050 Mobile。它附帶 Windows10,但我使用 CentOS-7.6 雙重啟動它。
這就是我到目前為止所做的:
- 我在 Windows 10(已存在於筆記型電腦中)旁邊執行了 CentOS-7 的雙重開機安裝。為了順利地從 USB 啟動 CentOS,我需要使用「nomodeset」標誌。
- 我按照官方安裝指南進行的https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html
- 我安裝了 GCC 和其他必要的軟體包
- 我從這裡下載rpm(本地版本)文件https://developer.nvidia.com/cuda-downloads?target_os=Linux&target_arch=x86_64&target_distro=CentOS&target_version=7&target_type=rpmlocal
- 我使用指南中建議的命令安裝了 cuda 和驅動程式。 (特別是“sudo yum install cuda”)
- 我將 cuda 二進位檔案新增至 PATH 變量
然後我重新啟動並嘗試檢查驅動程式:
$ nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
$ nvidia-settings
ERROR: NVIDIA driver is not loaded
ERROR: Unable to load info from any available system
$ ll /dev/nvidia*
ls: cannot access /dev/nvidia*: No such file or directory
$ sudo lshw -numeric -C display
*-display UNCLAIMED
description: 3D controller
product: GP107M [GeForce GTX 1050 Mobile] [10DE:1C8D]
vendor: NVIDIA Corporation [10DE]
physical id: 0
bus info: pci@0000:01:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress bus_master cap_list
configuration: latency=0
resources: memory:a3000000-a3ffffff memory:90000000-9fffffff memory:a0000000-a1ffffff ioport:4000(size=128) memory:a4000000-a407ffff
*-display
description: VGA compatible controller
product: UHD Graphics 630 (Mobile) [8086:3E9B]
vendor: Intel Corporation [8086]
physical id: 2
bus info: pci@0000:00:02.0
version: 00
width: 64 bits
clock: 33MHz
capabilities: pciexpress msi pm vga_controller bus_master cap_list rom
configuration: driver=i915 latency=0
resources: irq:128 memory:a2000000-a2ffffff memory:80000000-8fffffff ioport:5000(size=64)
**一些額外的信息**
作業系統:
$ cat /etc/redhat-release
CentOS Linux release 7.6.1810 (Core)
海灣合作委員會:
$ gcc --version
gcc (GCC) 4.8.5 20150623 (Red Hat 4.8.5-36)
核心版本:
$ uname -a
Linux centos 3.10.0-957.21.2.el7.x86_64 #1 SMP Wed Jun 5 14:26:44 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
安全啟動關閉:
$ mokutil --sb-state
SecureBoot disabled
顯示卡:
$ nvidia-detect -v
Probing for supported NVIDIA devices...
[8086:3e9b] Intel Corporation UHD Graphics 630 (Mobile)
[10de:1c8d] NVIDIA Corporation GP107M [GeForce GTX 1050 Mobile]
This device requires the current 418.74 NVIDIA driver kmod-nvidia
An Intel display controller was also detected
有關顯示卡的詳細信息
$ sudo lspci -v -s 01:00.0
01:00.0 3D controller: NVIDIA Corporation GP107M [GeForce GTX 1050 Mobile] (rev a1)
Subsystem: ASUSTeK Computer Inc. Device 1fc0
Flags: bus master, fast devsel, latency 0, IRQ 255
Memory at a3000000 (32-bit, non-prefetchable) [size=16M]
Memory at 90000000 (64-bit, prefetchable) [size=256M]
Memory at a0000000 (64-bit, prefetchable) [size=32M]
I/O ports at 4000 [disabled] [size=128]
Expansion ROM at a4000000 [disabled] [size=512K]
Capabilities: [60] Power Management version 3
Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
Capabilities: [78] Express Endpoint, MSI 00
Capabilities: [100] Virtual Channel
Capabilities: [250] Latency Tolerance Reporting
Capabilities: [258] L1 PM Substates
Capabilities: [128] Power Budgeting <?>
Capabilities: [420] Advanced Error Reporting
Capabilities: [600] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
Capabilities: [900] #19
Kernel modules: nouveau
$ sudo lspci -v -s 00:02.0
00:02.0 VGA compatible controller: Intel Corporation UHD Graphics 630 (Mobile) (prog-if 00 [VGA controller])
Subsystem: ASUSTeK Computer Inc. Device 1fc0
Flags: bus master, fast devsel, latency 0, IRQ 127
Memory at a2000000 (64-bit, non-prefetchable) [size=16M]
Memory at 80000000 (64-bit, prefetchable) [size=256M]
I/O ports at 5000 [size=64]
[virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
Capabilities: [40] Vendor Specific Information: Len=0c <?>
Capabilities: [70] Express Root Complex Integrated Endpoint, MSI 00
Capabilities: [ac] MSI: Enable+ Count=1/1 Maskable- 64bit-
Capabilities: [d0] Power Management version 2
Capabilities: [100] Process Address Space ID (PASID)
Capabilities: [200] Address Translation Service (ATS)
Capabilities: [300] Page Request Interface (PRI)
Kernel driver in use: i915
Kernel modules: i915
安裝的(?)驅動程式:
$ yum list installed | grep nvidia
dkms-nvidia.x86_64 3:418.67-1.el7 @cuda
nvidia-detect.x86_64 418.74-2.el7.elrepo @elrepo
nvidia-driver.x86_64 3:418.67-4.el7 @cuda
nvidia-driver-NVML.x86_64 3:418.67-4.el7 @cuda
nvidia-driver-NvFBCOpenGL.x86_64 3:418.67-4.el7 @cuda
nvidia-driver-cuda.x86_64 3:418.67-4.el7 @cuda
nvidia-driver-cuda-libs.x86_64 3:418.67-4.el7 @cuda
nvidia-driver-devel.x86_64 3:418.67-4.el7 @cuda
nvidia-driver-libs.x86_64 3:418.67-4.el7 @cuda
nvidia-libXNVCtrl.x86_64 3:418.67-1.el7 @cuda
nvidia-libXNVCtrl-devel.x86_64 3:418.67-1.el7 @cuda
nvidia-modprobe.x86_64 3:418.67-1.el7 @cuda
nvidia-persistenced.x86_64 3:418.67-1.el7 @cuda
nvidia-settings.x86_64 3:418.67-1.el7 @cuda
nvidia-xconfig.x86_64 3:418.67-1.el7 @cuda
Nvcc編譯器:
$ nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Wed_Apr_24_19:10:27_PDT_2019
Cuda compilation tools, release 10.1, V10.1.168
先感謝您!
答案1
我解決了這個問題。我的錯,我忘了安裝 kernel-devel 套件。
安裝後:
sudo yum install kernel-devel-$(uname -r)
並重新啟動,驅動程式正在工作。
$ nvidia-smi
Tue Jun 11 23:15:28 2019
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.67 Driver Version: 418.67 CUDA Version: 10.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GeForce GTX 1050 Off | 00000000:01:00.0 Off | N/A |
| N/A 44C P8 N/A / N/A | 0MiB / 4042MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
$ sudo lshw -numeric -C display
*-display
description: 3D controller
product: GP107M [GeForce GTX 1050 Mobile] [10DE:1C8D]
vendor: NVIDIA Corporation [10DE]
physical id: 0
bus info: pci@0000:01:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress bus_master cap_list rom
configuration: driver=nvidia latency=0
resources: irq:508 memory:a3000000-a3ffffff memory:90000000-9fffffff memory:a0000000-a1ffffff ioport:4000(size=128) memory:a4000000-a407ffff
*-display
description: VGA compatible controller
product: UHD Graphics 630 (Mobile) [8086:3E9B]
vendor: Intel Corporation [8086]
physical id: 2
bus info: pci@0000:00:02.0
version: 00
width: 64 bits
clock: 33MHz
capabilities: pciexpress msi pm vga_controller bus_master cap_list rom
configuration: driver=i915 latency=0
resources: irq:128 memory:a2000000-a2ffffff memory:80000000-8fffffff ioport:5000(size=64)
ll /dev/nvidia*
crw-rw-rw-. 1 root root 195, 0 Jun 11 23:12 /dev/nvidia0
crw-rw-rw-. 1 root root 195, 255 Jun 11 23:12 /dev/nvidiactl
crw-rw-rw-. 1 root root 195, 254 Jun 11 23:12 /dev/nvidia-modeset
crw-rw-rw-. 1 root root 241, 0 Jun 11 23:12 /dev/nvidia-uvm
crw-rw-rw-. 1 root root 241, 1 Jun 11 23:12 /dev/nvidia-uvm-tools