我在桌上型電腦 Intel i9 Core 16Gb RAM ASUS 主機板上使用 Ubuntu 20.04。有時,當我運行 OBS Studio、Skype、Chrome 等應用程式時,我的電腦會突然重新啟動。我不知道原因,也找不到可以幫助解決此問題的合適文章。接下來我將解釋我的嘗試,試圖找出我的硬體可能存在的問題。
輸入後的結果last reboot
顯示,我之前的 Ubuntu 運行在意外重新啟動後顯示為「仍在運行」:
reboot system boot 5.4.0-42-generic Wed Aug 26 11:00 still running
reboot system boot 5.4.0-42-generic Tue Aug 25 06:20 still running
reboot system boot 5.4.0-42-generic Mon Aug 24 06:38 - 00:06 (17:28)
reboot system boot 5.4.0-42-generic Sun Aug 23 18:52 - 23:36 (04:44)
reboot system boot 5.4.0-42-generic Sun Aug 23 06:32 - 23:36 (17:04)
reboot system boot 5.4.0-42-generic Thu Aug 20 09:42 - 18:17 (2+08:35)
reboot system boot 5.4.0-42-generic Mon Aug 17 21:55 - 22:22 (00:26)
reboot system boot 5.4.0-42-generic Mon Aug 17 09:22 - 21:55 (12:33)
reboot system boot 5.4.0-42-generic Mon Aug 17 09:00 - 21:55 (12:54)
reboot system boot 5.4.0-42-generic Mon Aug 17 08:55 - 21:55 (12:59)
reboot system boot 5.4.0-42-generic Mon Aug 17 05:56 - 07:37 (01:40)
reboot system boot 5.4.0-42-generic Mon Aug 17 05:34 - 07:37 (02:02)
reboot system boot 5.4.0-42-generic Sun Aug 16 21:09 - 00:07 (02:58)
reboot system boot 5.4.0-42-generic Sun Aug 16 20:52 - 21:09 (00:17)
reboot system boot 5.4.0-42-generic Sun Aug 16 20:38 - 20:51 (00:12)
reboot system boot 5.4.0-42-generic Sun Aug 16 20:14 - 20:38 (00:23)
reboot system boot 5.4.0-42-generic Sun Aug 16 20:05 - 20:38 (00:33)
reboot system boot 5.4.0-42-generic Sun Aug 16 19:31 - 20:38 (01:07)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:39 - 19:30 (00:51)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:27 - 18:38 (00:11)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:22 - 18:27 (00:04)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:18 - 18:27 (00:08)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:16 - 18:27 (00:10)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:11 - 18:27 (00:15)
reboot system boot 5.4.0-42-generic Sun Aug 16 16:42 - 18:11 (01:28)
reboot system boot 5.4.0-42-generic Sun Aug 16 16:30 - 16:42 (00:11)
reboot system boot 5.4.0-42-generic Sun Aug 16 16:22 - 16:30 (00:08)
reboot system boot 5.4.0-42-generic Sun Aug 16 16:13 - 16:22 (00:08)
reboot system boot 5.4.0-42-generic Sun Aug 16 15:50 - 16:13 (00:23)
reboot system boot 5.4.0-42-generic Sun Aug 16 15:46 - 16:13 (00:27)
reboot system boot 5.4.0-42-generic Sun Aug 16 14:01 - 15:42 (01:41)
reboot system boot 5.4.0-42-generic Sun Aug 16 13:50 - 14:00 (00:09)
電腦硬體配置如下:
00:01.0 PCI bridge: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor PCIe Controller (x16) (rev 0d)
00:02.0 VGA compatible controller: Intel Corporation UHD Graphics 630 (Desktop 9 Series) (rev 02)
00:14.0 USB controller: Intel Corporation 200 Series/Z370 Chipset Family USB 3.0 xHCI Controller
00:16.0 Communication controller: Intel Corporation 200 Series PCH CSME HECI #1
00:17.0 SATA controller: Intel Corporation 200 Series PCH SATA controller [AHCI mode]
00:1c.0 PCI bridge: Intel Corporation 200 Series PCH PCI Express Root Port #5 (rev f0)
00:1c.7 PCI bridge: Intel Corporation 200 Series PCH PCI Express Root Port #8 (rev f0)
00:1d.0 PCI bridge: Intel Corporation 200 Series PCH PCI Express Root Port #11 (rev f0)
00:1f.0 ISA bridge: Intel Corporation Device a2ca
00:1f.2 Memory controller: Intel Corporation 200 Series/Z370 Chipset Family Power Management Controller
00:1f.3 Audio device: Intel Corporation 200 Series PCH HD Audio
00:1f.4 SMBus: Intel Corporation 200 Series/Z370 Chipset Family SMBus Controller
01:00.0 VGA compatible controller: NVIDIA Corporation GK208 [GeForce GT 710] (rev a1)
01:00.1 Audio device: NVIDIA Corporation GF119 HDMI Audio Controller (rev a1)
03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 15)
當我第一次安裝 Ubuntu 時,我多次嘗試讓 Nvidia 驅動程式正常工作,但任何官方 nvidia 驅動程式都成功識別我的 nvidia 卡。因此,我目前正在運行 Noveau 驅動程式。
我使用該工具對我的 CPU 進行了壓力測試stress-ng
,並安裝了該powertop
工具來檢查我的硬體設備的功耗。我的電腦連接到不間斷(600 Va),壓力測試期間我的硬體最大耗電量為104W。根據sensors
,壓力測試時我的cpu核心溫度為:
coretemp-isa-0000
Adapter: ISA adapter
Package id 0: +92.0°C (high = +86.0°C, crit = +100.0°C)
Core 0: +91.0°C (high = +86.0°C, crit = +100.0°C)
Core 1: +87.0°C (high = +86.0°C, crit = +100.0°C)
Core 2: +92.0°C (high = +86.0°C, crit = +100.0°C)
Core 3: +91.0°C (high = +86.0°C, crit = +100.0°C)
Core 4: +92.0°C (high = +86.0°C, crit = +100.0°C)
Core 5: +91.0°C (high = +86.0°C, crit = +100.0°C)
Core 6: +89.0°C (high = +86.0°C, crit = +100.0°C)
Core 7: +89.0°C (high = +86.0°C, crit = +100.0°C)
acpitz-acpi-0
Adapter: ACPI interface
temp1: +27.8°C (crit = +119.0°C)
temp2: +29.8°C (crit = +119.0°C)
powertop
相同壓力測試期間的輸出:
System baseline power is estimated at 104 W
Power est. Usage Device name
85.4 W 1065% CPU core
9.68 W 1065% CPU misc
1.01 W 1065% DRAM
100,0% PCI Device: NVIDIA Corporation GK208 [GeForce GT 710]
100,0% USB device: xHCI Host Controller
100,0% USB device: USB Optical Mouse (Logitech)
100,0% USB device: USB Keyboard (USB)
100,0% PCI Device: Intel Corporation 200 Series/Z370 Chipset Family Power Management Controller
100,0% PCI Device: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller
100,0% PCI Device: Intel Corporation 200 Series PCH SATA controller [AHCI mode]
100,0% PCI Device: Intel Corporation 200 Series PCH PCI Express Root Port #5
100,0% PCI Device: Intel Corporation Device a2ca
100,0% PCI Device: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor PCIe Controller (x16)
100,0% PCI Device: Intel Corporation 200 Series PCH PCI Express Root Port #8
100,0% PCI Device: Intel Corporation 200 Series PCH HD Audio
100,0% PCI Device: Intel Corporation 8th Gen Core 8-core Desktop Processor Host Bridge/DRAM Registers [Coffee
100,0% PCI Device: Intel Corporation 200 Series PCH PCI Express Root Port #11
100,0% PCI Device: Intel Corporation UHD Graphics 630 (Desktop 9 Series)
100,0% PCI Device: Intel Corporation 200 Series/Z370 Chipset Family USB 3.0 xHCI Controller
100,0% Audio codec hwC0D0: Realtek
18,6 pkts/s Network interface: enp3s0 (r8169)
誰能告訴我我的電腦出了什麼問題?我很欣賞這些建議!
謝謝!
答案1
CPU溫度
此stress-ng
工具顯示所有 8 個 CPU 的 CPU 溫度均為 87.0°C 至 92.0°C(幾乎 200°F)。這些溫度會毀掉你的機器。
檢查您的風扇是否正確接線、連接和運作。
檢查您的 BIOS 中的自訂風扇設定。
盡快降低溫度!
超頻
如果您的 CPU 或 RAM 已超頻,請將其恢復為預設值。
BIOS
華碩 PRIME H310M-E R2.0/BR
您的 BIOS 版本為 1402,日期為 2020 年 5 月 21 日。
有更新的 BIOS 可用,版本 1605,日期為 2020 年 8 月 14 日,可下載這裡。
注意:驗證我是否擁有適合您主機板的正確網頁。
注意:更新 BIOS 之前請先做好備份。
英偉達
NVIDIA 公司 GK208 [GeForce GT 710]
關於Nvidia問題...目前驅動程式版本為450.66,可下載這裡。
確認 BIOS 中已停用安全啟動。
清除所有目前的 Nvidia 驅動程序,然後安裝新驅動程式。
更新#1:
您從 Nvidia 驅動程式傳回的訊息表示 450.66 不支援您的顯示卡,因此它們不適用於您的配置。您需要聯絡 Nvidia 支援以詢問要使用哪個驅動程式。在此之前,請選擇 Nouveau 視訊驅動程序,然後再次清除所有 Nvidia 內容。
答案2
的輸出ps auxc | grep therm
是:
root 228 0.0 0.0 0 0 ? I< 07:39 0:00 acpi_thermal_pm
root 872 0.0 0.0 134500 9892 ? Ssl 07:40 0:00 thermald
我成功更新了BIOS版本並安裝了Nvidia驅動程式450,但在安裝過程中電腦自動重新啟動。
我的電腦閒置時的溫度如下:
sensors
nct6796-isa-0290
Adapter: ISA adapter
Vcore: 328.00 mV (min = +0.00 V, max = +1.74 V)
in1: 1.02 V (min = +0.00 V, max = +0.00 V) ALARM
AVCC: 3.39 V (min = +2.98 V, max = +3.63 V)
+3.3V: 3.41 V (min = +2.98 V, max = +3.63 V)
in4: 1.02 V (min = +0.00 V, max = +0.00 V) ALARM
in5: 160.00 mV (min = +0.00 V, max = +0.00 V) ALARM
in6: 128.00 mV (min = +0.00 V, max = +0.00 V) ALARM
3VSB: 3.39 V (min = +2.98 V, max = +3.63 V)
Vbat: 3.17 V (min = +2.70 V, max = +3.63 V)
in9: 1000.00 mV (min = +0.00 V, max = +0.00 V) ALARM
in10: 152.00 mV (min = +0.00 V, max = +0.00 V) ALARM
in11: 128.00 mV (min = +0.00 V, max = +0.00 V) ALARM
in12: 144.00 mV (min = +0.00 V, max = +0.00 V) ALARM
in13: 128.00 mV (min = +0.00 V, max = +0.00 V) ALARM
in14: 136.00 mV (min = +0.00 V, max = +0.00 V) ALARM
fan1: 0 RPM (min = 0 RPM)
fan2: 1220 RPM (min = 0 RPM)
fan3: 0 RPM (min = 0 RPM)
fan4: 0 RPM (min = 0 RPM)
fan5: 0 RPM (min = 0 RPM)
fan7: 0 RPM (min = 0 RPM)
SYSTIN: +32.0°C (high = +98.0°C, hyst = +95.0°C) sensor = thermistor
CPUTIN: +31.5°C (high = +80.0°C, hyst = +75.0°C) sensor = thermistor
AUXTIN0: +110.0°C sensor = thermistor
AUXTIN1: +115.0°C sensor = thermistor
AUXTIN2: +114.0°C sensor = thermistor
AUXTIN3: +115.0°C sensor = thermistor
PECI Agent 0: +34.0°C (high = +98.0°C, hyst = +95.0°C)
(crit = +100.0°C)
PECI Agent 0 Calibration: +31.5°C
PCH_CHIP_CPU_MAX_TEMP: +0.0°C
PCH_CHIP_TEMP: +0.0°C
intrusion0: OK
intrusion1: ALARM
beep_enable: disabled
acpitz-acpi-0
Adapter: ACPI interface
temp1: +27.8°C (crit = +119.0°C)
temp2: +29.8°C (crit = +119.0°C)
coretemp-isa-0000
Adapter: ISA adapter
Package id 0: +38.0°C (high = +86.0°C, crit = +100.0°C)
Core 0: +35.0°C (high = +86.0°C, crit = +100.0°C)
Core 1: +34.0°C (high = +86.0°C, crit = +100.0°C)
Core 2: +38.0°C (high = +86.0°C, crit = +100.0°C)
Core 3: +35.0°C (high = +86.0°C, crit = +100.0°C)
Core 4: +33.0°C (high = +86.0°C, crit = +100.0°C)
Core 5: +34.0°C (high = +86.0°C, crit = +100.0°C)
Core 6: +35.0°C (high = +86.0°C, crit = +100.0°C)
Core 7: +34.0°C (high = +86.0°C, crit = +100.0°C)
重新啟動後,我看到 Nvidia 450 驅動程式已安裝,但當我輸入 時nvidia-smi
,我收到訊息:
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
PS:這台電腦很新穎…我兩週前買的。