Ubuntu 20.04 隨機當機,懷疑 Nvidia 驅動

Ubuntu 20.04 隨機當機,懷疑 Nvidia 驅動

Ubuntu 20.04 隨機凍結,有時在 20 分鐘後,有時在幾個小時後。

這意味著我無法移動滑鼠,也無法使用鍵盤執行任何操作。如果正在播放聲音,則會在約 2 秒的時間內循環播放。鍵盤組合沒有任何作用。只有使用電腦電源開關進行硬重啟才能重新啟動我的電腦。

根據這個帖子我收集了一些信息,儘管我是這些事情的新手,並且無法真正理解其中的任何內容:

free -h

total        used        free      shared  buff/cache   available
Mem:           15Gi       4,1Gi       7,3Gi       103Mi       4,0Gi        10Gi
Swap:         2,0Gi          0B       2,0Gi
sysctl vm.swappiness

vm.swappiness = 60
ls -al /var/crash
total 8
drwxrwsrwt  2 root whoopsie 4096 Dez 17 12:03 .
drwxr-xr-x 14 root root     4096 Feb  9  2021 ..
sudo lshw -C memory
*-firmware                
   description: BIOS
   vendor: AMI
   physical id: 0
   version: F.42
   date: 05/28/2020
   size: 64KiB
   capacity: 16MiB
   capabilities: pci pnp upgrade shadowing cdboot bootselect edd int5printscreen int9keyboard int14serial int17printer acpi usb biosbootspecification netboot uefi
*-memory
   description: System Memory
   physical id: 42
   slot: System board or motherboard
   size: 16GiB
 *-bank:0
      description: DIMM DDR4 Synchronous 2666 MHz (0,4 ns)
      product: M378A1G43TB1-CTD
      vendor: Samsung
      physical id: 0
      serial: 719AF9F3
      slot: ChannelA-DIMM0
      size: 8GiB
      width: 64 bits
      clock: 2666MHz (0.4ns)
 *-bank:1
      description: DIMM DDR4 Synchronous 2666 MHz (0,4 ns)
      product: M378A1G43TB1-CTD
      vendor: Samsung
      physical id: 1
      serial: 719AF9F7
      slot: ChannelB-DIMM0
      size: 8GiB
      width: 64 bits
      clock: 2666MHz (0.4ns)
*-cache:0
   description: L1 cache
   physical id: 4c
   slot: L1 Cache
   size: 384KiB
   capacity: 384KiB
   capabilities: synchronous internal write-back unified
   configuration: level=1
*-cache:1
   description: L2 cache
   physical id: 4d
   slot: L2 Cache
   size: 1536KiB
   capacity: 1536KiB
   capabilities: synchronous internal write-back unified
   configuration: level=2
*-cache:2
   description: L3 cache
   physical id: 4e
   slot: L3 Cache
   size: 12MiB
   capacity: 12MiB
   capabilities: synchronous internal write-back unified
   configuration: level=3
*-memory UNCLAIMED
   description: RAM memory
   product: Cannon Lake PCH Shared SRAM
   vendor: Intel Corporation
   physical id: 14.2
   bus info: pci@0000:00:14.2
   version: 10
   width: 64 bits
   clock: 33MHz (30.3ns)
   capabilities: pm cap_list
   configuration: latency=0
   resources: memory:a3516000-a3517fff memory:a3520000-a3520fff
    total 8
    drwxrwsrwt  2 root whoopsie 4096 Dez 17 12:03 .
    drwxr-xr-x 14 root root     4096 Feb  9  2021 ..
sudo lshw -C memory

*-firmware                
   description: BIOS
   vendor: AMI
   physical id: 0
   version: F.42
   date: 05/28/2020
   size: 64KiB
   capacity: 16MiB
   capabilities: pci pnp upgrade shadowing cdboot bootselect edd int5printscreen int9keyboard int14serial int17printer acpi usb biosbootspecification netboot uefi
*-memory
   description: System Memory
   physical id: 42
   slot: System board or motherboard
   size: 16GiB
 *-bank:0
      description: DIMM DDR4 Synchronous 2666 MHz (0,4 ns)
      product: M378A1G43TB1-CTD
      vendor: Samsung
      physical id: 0
      serial: 719AF9F3
      slot: ChannelA-DIMM0
      size: 8GiB
      width: 64 bits
      clock: 2666MHz (0.4ns)
 *-bank:1
      description: DIMM DDR4 Synchronous 2666 MHz (0,4 ns)
      product: M378A1G43TB1-CTD
      vendor: Samsung
      physical id: 1
      serial: 719AF9F7
      slot: ChannelB-DIMM0
      size: 8GiB
      width: 64 bits
      clock: 2666MHz (0.4ns)
*-cache:0
   description: L1 cache
   physical id: 4c
   slot: L1 Cache
   size: 384KiB
   capacity: 384KiB
   capabilities: synchronous internal write-back unified
   configuration: level=1
*-cache:1
   description: L2 cache
   physical id: 4d
   slot: L2 Cache
   size: 1536KiB
   capacity: 1536KiB
   capabilities: synchronous internal write-back unified
   configuration: level=2
*-cache:2
   description: L3 cache
   physical id: 4e
   slot: L3 Cache
   size: 12MiB
   capacity: 12MiB
   capabilities: synchronous internal write-back unified
   configuration: level=3
*-memory UNCLAIMED
   description: RAM memory
   product: Cannon Lake PCH Shared SRAM
   vendor: Intel Corporation
   physical id: 14.2
   bus info: pci@0000:00:14.2
   version: 10
   width: 64 bits
   clock: 33MHz (30.3ns)
   capabilities: pm cap_list
   configuration: latency=0
   resources: memory:a3516000-a3517fff memory:a3520000-a3520fff
grep -i swap /etc/fstab
/swapfile                                 none            swap    sw              0       0

sudo dmidecode -s bios-version

F.42

如果與此無關,我很抱歉。我懷疑我的 GPU 無法正常工作,因為我的程式運行速度比同一系統上的雙啟動 Windows 慢得多。另外,在執行一些程式時,我遇到了一些 libcudart 和 libcublas 錯誤,據我所知,這些錯誤似乎與我的 GPU 有關。

您對如何排查或解決此問題有什麼建議嗎?

根據要求,我運行了命令dmesgcat /var/log/syslogjournalctl結果如下:

github 文件

貓 ...

日誌控制

由於正文限制,我無法將所有程式碼貼到此處,因此我將其放在 GitHub 上。

答案1

我將給出可能的解決方案:

  1. 重新安裝 Gnome:
sudo apt-get update 
sudo apt-get install --reinstall systemd gnome-settings-daemon gnome-settings-daemon-common

我來更新一下...

相關內容