
內核日誌顯示以下 EDAC 錯誤的許多實例:
EDAC MC0: 1 CE ie31200 CE on unknown memory (csrow:3 channel:1 page:0x0 offset:0x0 grain:1 syndrome:0x1c)
問題是......我的系統上沒有csrow #3
(輸出被截斷以提高可見性):
$ ls -l /sys/devices/system/edac/mc/mc0
drwxr-xr-x 3 root root 0 May 19 10:53 csrow0
drwxr-xr-x 3 root root 0 May 19 10:53 csrow1
怎麼可能?有沒有實際上儲存設備故障?我怎麼能辨識它是哪一個?
更多可能有幫助的資訊:
$ cat /sys/devices/system/edac/mc/mc0/ce_count
1069
$ cat /sys/devices/system/edac/mc/mc0/csrow?/ce_count
0
0
$ sudo edac-util -v
mc0: 0 Uncorrected Errors with no DIMM info
mc0: 0 Corrected Errors with no DIMM info
mc0: csrow0: 0 Uncorrected Errors
mc0: csrow0: mc#0csrow#0channel#0: 0 Corrected Errors
mc0: csrow0: mc#0csrow#0channel#1: 0 Corrected Errors
mc0: csrow1: 0 Uncorrected Errors
mc0: csrow1: mc#0csrow#1channel#0: 0 Corrected Errors
mc0: csrow1: mc#0csrow#1channel#1: 0 Corrected Errors
edac-util: No errors to report.
- 作業系統:ArchLinux / 5.17.8-arch1-1 #1 SMP 搶佔
- CPU:至強E-2124
- 主機板:超微X11SCH-LN4F
謝謝