35 GB/día de "Error de bus PCIe: gravedad=Corregido, tipo=Capa de enlace de datos..." en syslog

35 GB/día de "Error de bus PCIe: gravedad=Corregido, tipo=Capa de enlace de datos..." en syslog
  • Ubuntu 20.04.4 LTS
  • Dell XPS 8940
  • BIOS más reciente 2.4.0

Nueva instalación y el sistema parece estar funcionando bien, pero tengo 35 gigas/día de mensajes de error de syslog como:

Feb 25 00:00:10 mumsilar kernel: [32409.088886] pcieport 0000:00:01.0: AER: Multiple Corrected error received: 0000:00:01.0
Feb 25 00:00:10 mumsilar kernel: [32409.088907] pcieport 0000:00:01.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
Feb 25 00:00:10 mumsilar kernel: [32409.088908] pcieport 0000:00:01.0:   device [8086:4c01] error status/mask=00000040/00002000
Feb 25 00:00:10 mumsilar kernel: [32409.088910] pcieport 0000:00:01.0:    [ 6] BadTLP
Feb 25 00:00:19 mumsilar kernel: [32418.024062] pcieport 0000:00:01.0: AER: Multiple Corrected error received: 0000:00:01.0
Feb 25 00:00:19 mumsilar kernel: [32418.024100] pcieport 0000:00:01.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Feb 25 00:00:19 mumsilar kernel: [32418.024102] pcieport 0000:00:01.0:   device [8086:4c01] error status/mask=00000001/00002000
Feb 25 00:00:19 mumsilar kernel: [32418.024103] pcieport 0000:00:01.0:    [ 0] RxErr
Feb 25 00:00:20 mumsilar kernel: [32418.431966] pcieport 0000:00:01.0: AER: Multiple Corrected error received: 0000:00:01.0
Feb 25 00:00:20 mumsilar kernel: [32418.432012] pcieport 0000:00:01.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Feb 25 00:00:20 mumsilar kernel: [32418.432014] pcieport 0000:00:01.0:   device [8086:4c01] error status/mask=00000001/00002000
Feb 25 00:00:20 mumsilar kernel: [32418.432016] pcieport 0000:00:01.0:    [ 0] RxErr
Feb 25 00:00:20 mumsilar kernel: [32418.443484] pcieport 0000:00:01.0: AER: Corrected error received: 0000:00:01.0
Feb 25 00:00:20 mumsilar kernel: [32418.443492] pcieport 0000:00:01.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
Feb 25 00:00:20 mumsilar kernel: [32418.443494] pcieport 0000:00:01.0:   device [8086:4c01] error status/mask=00000040/00002000
Feb 25 00:00:20 mumsilar kernel: [32418.443495] pcieport 0000:00:01.0:    [ 6] BadTLP
...

Aquí está el resultado de los siguientes comandos.

sudo lspci -nn
sudo lspci -tv
sudo lshw -C network
sudo sysctl vm.swappiness
inxi -Fxxxrz
sudo lspci -s 00:01.0 -vvv

https://pastebin.com/XnvMbxm5 ( lspci -s 00:01.0 -vvvsalida agregada)

Gracias por cualquier ayuda que usted puede dar. Salud.

PD

Había añadido esta línea a/etc/modprobe.d/alsa-base.conf

# apparently after power saving shuts down the audio, the next time it turns on
# it will audibly pop.  Turn off shutting down the audio to prevent the popping.
# see https://superuser.com/questions/1493096/linux-ubuntu-speakers-popping-every-few-seconds#:~:text=The%20operation%20system's%20default%20behavior,value%20from%201%20to%200.
options snd-hda-intel power_save=0 power_save_controller=N

Respuesta1

Parece que su tarjeta Nvidia está causando el problema de syslog.


Feb 25 00:00:10 mumsilar kernel: [32409.088886] pcieport 0000:00:01.0: AER: Multiple Corrected error received: 0000:00:01.0
Feb 25 00:00:10 mumsilar kernel: [32409.088907] pcieport 0000:00:01.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
Feb 25 00:00:10 mumsilar kernel: [32409.088908] pcieport 0000:00:01.0:   device [8086:4c01] error status/mask=00000040/00002000

00:01.0 PCI bridge [0604]: Intel Corporation Device [8086:4c01] (rev 01)

-[0000:00]-+-00.0-[ff]--
           +-01.0-[02]--+-00.0  NVIDIA Corporation TU116 [GeForce GTX 1660 Ti]
           |            +-00.1  NVIDIA Corporation TU116 High Definition Audio Controller
           |            +-00.2  NVIDIA Corporation TU116 USB 3.1 Host Controller
           |            \-00.3  NVIDIA Corporation TU116 [GeForce GTX 1650 SUPER]

Pruebe Nvidia 510.54. Pruebe con una tarjeta de video diferente, si puede acceder a una.

Para silenciar el ruido de syslog, haga esto...

sudo -H gedit /etc/default/grub# editar este archivo

Encontrar:

GRUB_CMDLINE_LINUX_DEFAULT="quiet splash"

Cámbielo a:

GRUB_CMDLINE_LINUX_DEFAULT="quiet splash pci=noaer"

Guarda el archivo.

sudo update-grub# actualizar GRUB

reboot# reinicia la computadora

Actualización n.º 1:

Se solucionó apagando ASPM: pcie_aspm=off

https://forums.developer.nvidia.com/t/pcie-bus-error-severity-corrected-on-jetson-nano/155780

información relacionada