我們最近購買了一台新的Dell R340XL 伺服器並安裝了CentOS 8。 。幾天前它運作良好,但兩天前,我們再也看不到 RAID 捲了。我們與戴爾通了電話,他們幫我們升級了多個裝置上的韌體,但我們仍然看不到 RAID 磁碟區。任何幫助將非常感激。
當我執行 lspci 時,該設備會顯示:
02:00.0 RAID bus controller: Broadcom / LSI MegaRAID SAS-3 3008 [Fury] (rev 02)
Subsystem: Dell PERC H330 Adapter
...
Kernel driver in use: megaraid_sas
Kernel modules: megaraid_sas
我下載了MegaCli,可以得到各種資訊。該命令MegaCli64 -LdGetNum -aAll
告訴我這一點:
Number of Virtual Drives Configured on Adapter 0: 1
如果我們一遍又一遍地重新啟動伺服器,偶爾我們會看到 RAID 陣列顯示為區塊設備,但它顯示為 sda,從而使啟動磁碟機成為 sdb。
我查看了 dmesg 和系統日誌,我不知道如何識別,但我確實看到了這樣的事情:
[ 1.296976] megasas: 07.707.51.00-rc1
[ 1.301092] megaraid_sas 0000:02:00.0: FW now in Ready state
[ 1.301095] megaraid_sas 0000:02:00.0: 63 bit DMA mask and 32 bit consistent mask
[ 1.301363] megaraid_sas 0000:02:00.0: firmware supports msix : (96)
[ 1.301364] megaraid_sas 0000:02:00.0: current msix/online cpus : (12/12)
[ 1.301365] megaraid_sas 0000:02:00.0: RDPQ mode : (disabled)
[ 1.301366] megaraid_sas 0000:02:00.0: Current firmware supports maximum commands: 928 LDIO threshold: 237
[ 1.301477] megaraid_sas 0000:02:00.0: Configured max firmware commands: 927
[ 1.303185] megaraid_sas 0000:02:00.0: FW supports sync cache : No
…
[ 1.680004] megaraid_sas 0000:02:00.0: FW provided supportMaxExtLDs: 0 max_lds: 32
[ 1.680005] megaraid_sas 0000:02:00.0: controller type : iMR(0MB)
[ 1.680005] megaraid_sas 0000:02:00.0: Online Controller Reset(OCR) : Enabled
[ 1.680006] megaraid_sas 0000:02:00.0: Secure JBOD support : No
[ 1.680006] megaraid_sas 0000:02:00.0: NVMe passthru support : No
[ 1.680007] megaraid_sas 0000:02:00.0: FW provided TM TaskAbort/Reset timeout : 0 secs/0 secs
[ 1.702120] megaraid_sas 0000:02:00.0: INIT adapter done
[ 1.702121] megaraid_sas 0000:02:00.0: Jbod map is not supported megasas_setup_jbod_map 5371
[ 1.728949] megaraid_sas 0000:02:00.0: pci id : (0x1000)/(0x005f)/(0x1028)/(0x1f44)
[ 1.728950] megaraid_sas 0000:02:00.0: unevenspan support : yes
[ 1.728950] megaraid_sas 0000:02:00.0: firmware crash dump : no
[ 1.728951] megaraid_sas 0000:02:00.0: jbod sync map : no
[ 1.729017] scsi host0: Avago SAS based MegaRAID driver
[ 1.730804] scsi 11:0:0:0: Processor Marvell Console 1.01 PQ: 0 ANSI: 5
[ 1.732057] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
[ 1.732076] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
[ 1.732094] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
[ 1.732112] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
[ 1.732131] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
[ 1.732149] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
[ 1.732167] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
[ 1.732185] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
[ 1.732206] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
[ 1.732224] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
[ 1.732242] scsi_alloc_sdev: Allocation failure during SCSI scanning, some SCSI devices might not be configured
分配失敗看起來令人擔憂,但當我嘗試用谷歌搜尋它時,我沒有找到太多。一個人試圖做超過 200 個邏輯設備,但我們絕對不會在這裡這樣做。
提前致謝。
答案1
仔細檢查它是 H330,而不是 HBA330;如果它確實是 H330,請檢查是否有人不小心將其置於 HBA/直通模式。希望是這樣簡單的事情,但我猜還有更多的事情。