Razer Core X eGPU 在 Thinkpad + Lubuntu 18.04 中無法運作

Razer Core X eGPU 在 Thinkpad + Lubuntu 18.04 中無法運作

我剛剛第一次將帶有 eGPU 的 Razor Core X 連接到我的 Thinkpad。風扇在旋轉,但nvidia-smi不顯示 eGPU。

我能做些什麼?

答案1

首先,檢查dmesg | tail -n 200.它可能會顯示如下內容:

[   74.959198] thunderbolt 0000:06:00.0: current switch config:
[   74.959201] thunderbolt 0000:06:00.0:  Switch: 8086:15da (Revision: 6, TB Version: 2)
[   74.959202] thunderbolt 0000:06:00.0:   Max Port Number: 11
[   74.959203] thunderbolt 0000:06:00.0:   Config:
[   74.959204] thunderbolt 0000:06:00.0:    Upstream Port Number: 1 Depth: 1 Route String: 0x3 Enabled: 1, PlugEventsDelay: 254ms
[   74.959205] thunderbolt 0000:06:00.0:    unknown1: 0x0 unknown4: 0x0
[   74.999560] thunderbolt 0000:06:00.0: 3: reading drom (length: 0x56)
[   75.301575] thunderbolt 0000:06:00.0: 3: uid: 0x1279cc9b0ba8400
[   75.301686] thunderbolt 0000:06:00.0:  Port 0: 8086:15d3 (Revision: 6, TB Version: 1, Type: Port (0x1))
[   75.301689] thunderbolt 0000:06:00.0:   Max hop id (in/out): 7/7
[   75.301692] thunderbolt 0000:06:00.0:   Max counters: 8
[   75.301694] thunderbolt 0000:06:00.0:   NFC Credits: 0x800000
[   75.302174] thunderbolt 0000:06:00.0:  Port 1: 8086:15d3 (Revision: 6, TB Version: 1, Type: Port (0x1))
[   75.302178] thunderbolt 0000:06:00.0:   Max hop id (in/out): 15/15
[   75.302180] thunderbolt 0000:06:00.0:   Max counters: 16
[   75.302183] thunderbolt 0000:06:00.0:   NFC Credits: 0x7800000
[   75.302681] thunderbolt 0000:06:00.0:  Port 2: 8086:15d3 (Revision: 6, TB Version: 1, Type: Port (0x1))
[   75.302683] thunderbolt 0000:06:00.0:   Max hop id (in/out): 15/15
[   75.302685] thunderbolt 0000:06:00.0:   Max counters: 16
[   75.302687] thunderbolt 0000:06:00.0:   NFC Credits: 0x0
[   75.302689] thunderbolt 0000:06:00.0: 3:3: disabled by eeprom
[   75.302691] thunderbolt 0000:06:00.0: 3:4: disabled by eeprom
[   75.302692] thunderbolt 0000:06:00.0: 3:5: disabled by eeprom
[   75.302806] thunderbolt 0000:06:00.0:  Port 6: 8086:15d3 (Revision: 6, TB Version: 1, Type: PCIe (0x100102))
[   75.302808] thunderbolt 0000:06:00.0:   Max hop id (in/out): 8/8
[   75.302809] thunderbolt 0000:06:00.0:   Max counters: 2
[   75.302811] thunderbolt 0000:06:00.0:   NFC Credits: 0x800000
[   75.302960] thunderbolt 0000:06:00.0:  Port 7: 8086:15d3 (Revision: 6, TB Version: 1, Type: PCIe (0x100101))
[   75.302962] thunderbolt 0000:06:00.0:   Max hop id (in/out): 8/8
[   75.302964] thunderbolt 0000:06:00.0:   Max counters: 2
[   75.302966] thunderbolt 0000:06:00.0:   NFC Credits: 0x800000
[   75.302967] thunderbolt 0000:06:00.0: 3:8: disabled by eeprom
[   75.302969] thunderbolt 0000:06:00.0: 3:9: disabled by eeprom
[   75.302971] thunderbolt 0000:06:00.0: 3:a: disabled by eeprom
[   75.302973] thunderbolt 0000:06:00.0: 3:b: disabled by eeprom

這表示存在權限/安全性問題。

讓我們安裝 Thunderbolt 管理工具,以便修復它:

sudo apt install thunderbolt-tools

現在,讓我們檢查一下 Thunderbolt 是否能看到擴充座:

root@mymachine:~# tbtadm devices
0-4 Razer Core X  non-authorized  not in ACL

確實如此!

現在,讓我們授權擴充座:

tbtadm approve 0-4

由此可見:

Authorizing "/sys/bus/thunderbolt/devices/0-4"
Already in ACL
system:5 Input/output error

拔下並重新插入擴充座並dmesg再次查看顯示:

[11187.232181] thunderbolt 0000:06:00.0: PCIe tunnel creation failed

那我們再看看迅雷:

root@mymachine:~# tbtadm devices
0-4 Razer Core X  non-authorized  in ACL

事實上,我們可以看到擴充座已連接:

root@mymachine:~# tbtadm acl
0XXXXXb0-XXXX-XXXX-ffff-ffffffffffff  Razer Core X  connected

讓我們嘗試手動授權:

root@mymachine:~# echo '1' > /sys/bus/thunderbolt/devices/0-4/authorized 
-bash: echo: write error: Input/output error

此時,我懷疑BIOS可能是問題所在。因此,重新啟動,調出 BIOS 設定。它位於“用戶授權”中,但讓我們使用核心“無安全”選項(可能有助於弄清楚如何稍後再次鎖定事物): BIOS 雷電設置

此時,再次啟動機器。

在插入 GPU 之前,請確保已載入 Nvidia 驅動程式:

sudo modprobe nvidia-uvm

並嘗試找到 GPU:

nvidia-smi

成功!

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.87.00    Driver Version: 418.87.00    CUDA Version: 10.1     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  GeForce RTX 208...  Off  | 00000000:3D:00.0 Off |                  N/A |
| 15%   36C    P0     1W / 250W |      0MiB / 10989MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

根據這個文件BIOS設定的意思是:

  • 沒有安全保障:允許自動連接 Thunderbolt 裝置。
  • 使用者授權:用戶授權後允許連接Thunderbolt設備。
  • 安全連線:允許 Thunderbolt 裝置使用使用者批准的已儲存金鑰進行連線。
  • 顯示連接埠和 USB:僅允許連接顯示輸出和 USB 裝置。不允許連接Thunderbolt設備

相關內容