우리는 아래 사양의 사전 구축된 머신을 주문했고 SSD에 Ubuntu 17.04가 설치되어 있었습니다. Ubuntu는 네 가지 다른 지점에서 무작위로 멈췄습니다(Python용 anaconda를 설치하는 동안 포함). 우리는 동일한 SSD 드라이브에서 16.04(어쨌든 일상 업무에서 이 OS에 더 익숙했기 때문에)로 이동하기로 결정했지만 문제는 지속되었습니다.
이는 정지 시간 중 하나와 관련된 syslog의 출력입니다.
Jun 5 02:22:08 PsertainTech org.gnome.evolution.dataserver.Sources5[1648]: ** (evolution-source-registry:1869): WARNING **: secret_service_search_sync: must specify at least one attribute to match
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736597] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736609] 7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=7188
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736613] (detected by 3, t=15002 jiffies, g=241027, c=241026, q=3366)
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736620] Task dump for CPU 7:
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736622] swapper/7 R running task 0 0 1 0x00000008
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736628] 0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736632] 7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736636] ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736640] Call Trace:
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736650] [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736655] [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736658] [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun 5 02:22:18 PsertainTech kernel: [ 3648.736662] [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752786] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752796] 7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=27789
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752799] (detected by 4, t=60007 jiffies, g=241027, c=241026, q=11266)
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752803] Task dump for CPU 7:
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752805] swapper/7 R running task 0 0 1 0x00000008
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752810] 0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752815] 7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752819] ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752823] Call Trace:
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752833] [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752839] [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752843] [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun 5 02:25:18 PsertainTech kernel: [ 3828.752848] [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun 5 02:25:28 PsertainTech org.gnome.evolution.dataserver.Sources5[2720]: ** (evolution-source-registry:2988): WARNING **: secret_service_search_sync: must specify at least one attribute to match
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773198] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773210] 7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=48323
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773214] (detected by 0, t=105012 jiffies, g=241027, c=241026, q=19082)
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773221] Task dump for CPU 7:
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773223] swapper/7 R running task 0 0 1 0x00000008
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773228] 0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773232] 7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773236] ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773240] Call Trace:
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773251] [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773256] [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773259] [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun 5 02:28:18 PsertainTech kernel: [ 4008.773264] [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789674] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789686] 7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=68966
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789690] (detected by 0, t=150017 jiffies, g=241027, c=241026, q=26836)
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789696] Task dump for CPU 7:
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789698] swapper/7 R running task 0 0 1 0x00000008
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789703] 0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789707] 7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789710] ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789715] Call Trace:
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789725] [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789730] [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789733] [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun 5 02:31:18 PsertainTech kernel: [ 4188.789738] [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808848] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808861] 7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=89546
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808865] (detected by 3, t=195022 jiffies, g=241027, c=241026, q=34852)
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808871] Task dump for CPU 7:
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808873] swapper/7 R running task 0 0 1 0x00000008
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808879] 0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808883] 7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808887] ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808891] Call Trace:
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808901] [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808906] [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808909] [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun 5 02:34:18 PsertainTech kernel: [ 4368.808914] [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828850] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828862] 7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=110118
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828865] (detected by 9, t=240027 jiffies, g=241027, c=241026, q=42598)
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828871] Task dump for CPU 7:
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828873] swapper/7 R running task 0 0 1 0x00000008
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828878] 0000000000000010 0000000000000246 ffff8c86585b7e70 0000000000000018
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828882] 7735940000000000 00000343907a93bf 0000000000000007 ffff8c86585b8000
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828886] ffff8c865f3e2900 ffffffffac4b92e0 ffff8c86585b4000 ffff8c86585b7eb8
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828889] Call Trace:
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828900] [<ffffffffabd157f7>] ? cpuidle_enter+0x17/0x20
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828904] [<ffffffffab6c7a5a>] ? call_cpuidle+0x2a/0x50
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828907] [<ffffffffab6c7e3e>] ? cpu_startup_entry+0x29e/0x350
Jun 5 02:37:18 PsertainTech kernel: [ 4548.828912] [<ffffffffab651891>] ? start_secondary+0x151/0x190
Jun 5 02:40:18 PsertainTech kernel: [ 4728.843394] INFO: rcu_sched detected stalls on CPUs/tasks:
Jun 5 02:40:18 PsertainTech kernel: [ 4728.843406] 7-...: (433 GPs behind) idle=07d/1/0 softirq=176166/176166 fqs=130672
Jun 5 02:40:18 PsertainTech kernel: [ 4728.843410] (detected by 3, t=285032 jiffies, g=241027, c=241026, q=50334)
c-state 수정을 시도했지만 실패했습니다.GRUB_CMDLINE_LINUX_DEFAULT="quiet splash intel_idle.max_cstate=1"
- Mobo: Asus Intel X99-A II USB 3.1 시리즈 DDR4/ Quad CrossFireX&Quad SLI/ SATA3 & USB3.1
- RAM: 64GB DDR4-2133/2400 PC4-17000/19200 4X16GB
- CPU: 코어 i7-6850K 브로드웰-E 6xCore 3.6GHz 2011 v3
- SSD: 삼성 850 EVO 시리즈 500GB 솔리드 스테이트 드라이브
- HDD: 2TB Western Digital 블랙 7200RPM SATA-3 6Gb/s 64MB 캐시
- GPU: 엔비디아 지포스 GT 730 2GB DDR PCI-익스프레스
free -h
산출:
total used free shared buff/cache available
Mem: 62G 2.1G 59G 55M 1.4G 60G
Swap: 63G 0B 63G
스왑 출력:
이름 유형 사용된 크기 PRIO /dev/sdb5 파티션 63.9G 0B -1
sudo blkid 출력:
/dev/sda1: UUID="a8fb3b82-1a85-4377-99e3-20d22e63a451" TYPE="swap" PARTUUID="7f08bb24-9185-47bf-a949-a316a0b63f5b"
/dev/sdb1: UUID="1c22f4a6-4c23-46c1-bdcc-7644ed39e193" TYPE="ext4" PARTUUID="43ee50b9-01"
/dev/sdb5: UUID="c7a7c6b7-e3a2-4942-b4a1-7d66eccb0915" TYPE="swap" PARTUUID="43ee50b9-05"
/dev/sdb6: UUID="8668e4db-4dd9-4b76-8d06-b6241d521800" TYPE="ext4" PARTUUID="43ee50b9-06"
고양이 /etc/fstab 출력:
# /etc/fstab: static file system information.
#
# Use 'blkid' to print the universally unique identifier for a
# device; this may be used with UUID= as a more robust way to name devices
# that works even if disks are added and removed. See fstab(5).
#
# <file system> <mount point> <type> <options> <dump> <pass>
# / was on /dev/sdb1 during installation
UUID=1c22f4a6-4c23-46c1-bdcc-7644ed39e193 / ext4 errors=remount-ro 0 1
# /home was on /dev/sdb6 during installation
UUID=8668e4db-4dd9-4b76-8d06-b6241d521800 /home ext4 defaults 0 2
# swap was on /dev/sdb5 during installation
UUID=c7a7c6b7-e3a2-4942-b4a1-7d66eccb0915 none swap sw 0 0
cat /etc/crypt* 출력: cat: '/etc/crypt*': 해당 파일이나 디렉터리가 없습니다.
ls -alh /swapfile 출력: ls: '/swapfile'에 액세스할 수 없습니다. 해당 파일이나 디렉터리가 없습니다.
Jun 8 08:29:22 psertain kernel: [37593.535883] nouveau 0000:01:00.0: fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]
Jun 8 08:29:22 psertain kernel: [37593.535898] nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Jun 8 08:29:22 psertain kernel: [37593.535911] nouveau 0000:01:00.0: fifo: channel 3: killed
Jun 8 08:29:22 psertain kernel: [37593.535918] nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Jun 8 08:29:22 psertain kernel: [37593.535998] nouveau 0000:01:00.0: Xorg[1105]: channel 3 killed!
Jun 8 08:29:33 psertain kernel: [37604.030229] [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:37:head-0] hw_done timed out
Jun 8 08:29:43 psertain kernel: [37614.269426] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:37:head-0] hw_done timed out
Jun 8 08:29:53 psertain kernel: [37624.508634] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:37:head-0] flip_done timed out
Jun 8 08:30:59 psertain rsyslogd: [origin software="rsyslogd" swVersion="8.16.0" x-pid="909" x-info="http://www.rsyslog.com"] start
답변1
그래서 새로운 테스트 버전이 아닌 Linux를 다시 설치하고 nvidia 드라이버의 340 버전을 활성화해야 했으며 지금까지 24시간 동안 정지되지 않았습니다. 대기 모드에서 나오는 것도 작동합니다. 이것이 계속되기를 바랍니다. –@sousuffer