Projet

Général

Profil

Actions

Anomalie #4254

fermé

Alertes RAID sur virola

Ajouté par Christian P. Momon il y a presque 5 ans. Mis à jour il y a plus de 4 ans.

Statut:
Fermé
Priorité:
Urgente
Assigné à:
Christian P. Momon
Catégorie:
-
Version cible:
Début:
09/02/2020
Echéance:
% réalisé:

0%

Temps estimé:
Difficulté:
2 Facile

Description

Icinga remonte les deux alertes suivantes :

RAID 1st sunday during resync
Active checks are being scheduled as normal - passive checks are disabled
    WARNING     2020-02-09 23:07:26     0d 2h 16m 15s     4/4     WARNING: mdstat:[md0(1.79 TiB raid1):UU (resync:10.7% 34803K/sec ETA: 820.9min)]     

RAID all days without resync
Active checks are being scheduled as normal - passive checks are disabled
    WARNING     2020-02-09 23:07:26     0d 2h 15m 14s     4/4     WARNING: mdstat:[md0(1.79 TiB raid1):UU (resync:10.7% 34807K/sec ETA: 820.8min)]

Mis à jour par Christian P. Momon il y a presque 5 ans

Mais pourquoi ?

(April) root@virola:/var/log# cat /proc/mdstat 
Personalities : [raid1] [linear] [multipath] [raid0] [raid6] [raid5] [raid4] [raid10] 
md0 : active raid1 sdb1[1] sda1[0]
      1920618880 blocks [2/2] [UU]
      [==>..................]  resync = 12.1% (232842880/1920618880) finish=1260.4min speed=22316K/sec

unused devices: <none>

(April) root@virola:/var/log#  mdadm --detail /dev/md0
/dev/md0:
           Version : 0.90
     Creation Time : Mon Mar  7 16:02:57 2016
        Raid Level : raid1
        Array Size : 1920618880 (1831.64 GiB 1966.71 GB)
     Used Dev Size : 1920618880 (1831.64 GiB 1966.71 GB)
      Raid Devices : 2
     Total Devices : 2
   Preferred Minor : 0
       Persistence : Superblock is persistent

       Update Time : Sun Feb  9 22:26:00 2020
             State : active, resyncing 
    Active Devices : 2
   Working Devices : 2
    Failed Devices : 0
     Spare Devices : 0

Consistency Policy : resync

     Resync Status : 12% complete

              UUID : 3ed83f38:2f77c02f:848f0d91:5dd9f907
            Events : 0.47417

    Number   Major   Minor   RaidDevice State
       0       8        1        0      active sync   /dev/sda1
       1       8       17        1      active sync   /dev/sdb1

Mis à jour par Christian P. Momon il y a presque 5 ans

Le boot :

Feb  9 19:02:12 virola kernel: [    2.147742] ata2.00: ATA-8: TOSHIBA MG03ACA200,     FL1D, max UDMA/100
Feb  9 19:02:12 virola kernel: [    2.147747] ata2.00: 3907029168 sectors, multi 16: LBA48 NCQ (depth 32), AA
Feb  9 19:02:12 virola kernel: [    2.149638] ata2.00: configured for UDMA/100
Feb  9 19:02:12 virola kernel: [    2.149939] scsi 1:0:0:0: Direct-Access     ATA      TOSHIBA MG03ACA2 FL1D PQ: 0 ANSI: 5
Feb  9 19:02:12 virola kernel: [    2.369520] usb 2-9.1: new high-speed USB device number 3 using xhci_hcd
Feb  9 19:02:12 virola kernel: [    2.465918] ata6: SATA link down (SStatus 0 SControl 300)
Feb  9 19:02:12 virola kernel: [    2.476104] sd 0:0:0:0: [sda] 3907029168 512-byte logical blocks: (2.00 TB/1.82 TiB)
Feb  9 19:02:12 virola kernel: [    2.476110] sd 0:0:0:0: [sda] Write Protect is off
Feb  9 19:02:12 virola kernel: [    2.476120] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Feb  9 19:02:12 virola kernel: [    2.476154] sd 1:0:0:0: [sdb] 3907029168 512-byte logical blocks: (2.00 TB/1.82 TiB)
Feb  9 19:02:12 virola kernel: [    2.476161] sd 1:0:0:0: [sdb] Write Protect is off
Feb  9 19:02:12 virola kernel: [    2.476173] sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Feb  9 19:02:12 virola kernel: [    2.487685]  sdb: sdb1
Feb  9 19:02:12 virola kernel: [    2.488047] sd 1:0:0:0: [sdb] Attached SCSI disk
Feb  9 19:02:12 virola kernel: [    2.499036] usb 2-9.1: New USB device found, idVendor=0624, idProduct=0249, bcdDevice= 0.00
Feb  9 19:02:12 virola kernel: [    2.499039] usb 2-9.1: New USB device strings: Mfr=4, Product=5, SerialNumber=6
Feb  9 19:02:12 virola kernel: [    2.499041] usb 2-9.1: Product: Keyboard/Mouse Function
Feb  9 19:02:12 virola kernel: [    2.499042] usb 2-9.1: Manufacturer: Avocent
Feb  9 19:02:12 virola kernel: [    2.499044] usb 2-9.1: SerialNumber: 20121018
Feb  9 19:02:12 virola kernel: [    2.509618] hidraw: raw HID events driver (C) Jiri Kosina
Feb  9 19:02:12 virola kernel: [    2.514290] usbcore: registered new interface driver usbhid
Feb  9 19:02:12 virola kernel: [    2.514290] usbhid: USB HID core driver
Feb  9 19:02:12 virola kernel: [    2.515086] input: Avocent Keyboard/Mouse Function as /devices/pci0000:00/0000:00:14.0/usb2/2-9/2-9.1/2-9.1:1.0/0003:0624:0249.0001/input/input0
Feb  9 19:02:12 virola kernel: [    2.573721] hid-generic 0003:0624:0249.0001: input,hidraw0: USB HID v1.00 Keyboard [Avocent Keyboard/Mouse Function] on usb-0000:00:14.0-9.1/input0
Feb  9 19:02:12 virola kernel: [    2.573895] input: Avocent Keyboard/Mouse Function as /devices/pci0000:00/0000:00:14.0/usb2/2-9/2-9.1/2-9.1:1.1/0003:0624:0249.0002/input/input1
Feb  9 19:02:12 virola kernel: [    2.573981] hid-generic 0003:0624:0249.0002: input,hidraw1: USB HID v1.00 Mouse [Avocent Keyboard/Mouse Function] on usb-0000:00:14.0-9.1/input1
Feb  9 19:02:12 virola kernel: [    2.574117] input: Avocent Keyboard/Mouse Function as /devices/pci0000:00/0000:00:14.0/usb2/2-9/2-9.1/2-9.1:1.2/0003:0624:0249.0003/input/input2
Feb  9 19:02:12 virola kernel: [    2.574204] hid-generic 0003:0624:0249.0003: input,hidraw2: USB HID v1.00 Mouse [Avocent Keyboard/Mouse Function] on usb-0000:00:14.0-9.1/input2
Feb  9 19:02:12 virola kernel: [    2.657634]  sda: sda1
Feb  9 19:02:12 virola kernel: [    2.658168] sd 0:0:0:0: [sda] Attached SCSI disk
Feb  9 19:02:12 virola kernel: [    2.787590] md/raid1:md0: not clean -- starting background reconstruction
Feb  9 19:02:12 virola kernel: [    2.787591] md/raid1:md0: active with 2 out of 2 mirrors
Feb  9 19:02:12 virola kernel: [    2.787600] md0: detected capacity change from 0 to 1966713733120
Feb  9 19:02:12 virola kernel: [    2.925417] raid6: sse2x1   gen() 13333 MB/s
Feb  9 19:02:12 virola kernel: [    2.993403] raid6: sse2x1   xor()  9392 MB/s
Feb  9 19:02:12 virola kernel: [    3.061414] raid6: sse2x2   gen() 15486 MB/s
Feb  9 19:02:12 virola kernel: [    3.129416] raid6: sse2x2   xor() 10306 MB/s
Feb  9 19:02:12 virola kernel: [    3.197404] raid6: sse2x4   gen() 18777 MB/s
Feb  9 19:02:12 virola kernel: [    3.265404] raid6: sse2x4   xor() 11595 MB/s
Feb  9 19:02:12 virola kernel: [    3.333402] raid6: avx2x1   gen() 25975 MB/s
Feb  9 19:02:12 virola kernel: [    3.401402] raid6: avx2x1   xor() 17639 MB/s
Feb  9 19:02:12 virola kernel: [    3.469403] raid6: avx2x2   gen() 29472 MB/s
Feb  9 19:02:12 virola kernel: [    3.537402] raid6: avx2x2   xor() 18631 MB/s
Feb  9 19:02:12 virola kernel: [    3.605402] raid6: avx2x4   gen() 34131 MB/s
Feb  9 19:02:12 virola kernel: [    3.673401] raid6: avx2x4   xor() 21595 MB/s
Feb  9 19:02:12 virola kernel: [    3.673402] raid6: using algorithm avx2x4 gen() 34131 MB/s
Feb  9 19:02:12 virola kernel: [    3.673402] raid6: .... xor() 21595 MB/s, rmw enabled
Feb  9 19:02:12 virola kernel: [    3.673403] raid6: using avx2x2 recovery algorithm
Feb  9 19:02:12 virola kernel: [    3.674190] xor: automatically using best checksumming function   avx
Feb  9 19:02:12 virola kernel: [    3.674625] async_tx: api initialized (async)
Feb  9 19:02:12 virola kernel: [    3.707536] device-mapper: uevent: version 1.0.3
Feb  9 19:02:12 virola kernel: [    3.707601] device-mapper: ioctl: 4.39.0-ioctl (2018-04-03) initialised: dm-devel@redhat.com
Feb  9 19:02:12 virola kernel: [    4.280523] md: resync of RAID array md0
Feb  9 19:02:12 virola kernel: [    4.448908] EXT4-fs (dm-0): mounted filesystem with ordered data mode. Opts: (null)
Feb  9 19:02:12 virola kernel: [   62.996028] EXT4-fs (dm-0): re-mounted. Opts: errors=remount-ro

Mis à jour par Quentin Gibeaux il y a presque 5 ans

Rien de spécial dans les données smart :
/dev/sdb :

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   050    Pre-fail  Always       -       0
  2 Throughput_Performance  0x0004   100   100   000    Old_age   Offline      -       0
  3 Spin_Up_Time            0x0027   100   100   001    Pre-fail  Always       -       6251
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       15
  5 Reallocated_Sector_Ct   0x0033   100   100   050    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000a   100   100   000    Old_age   Always       -       0
  8 Seek_Time_Performance   0x0004   100   100   000    Old_age   Offline      -       0
  9 Power_On_Hours          0x0032   001   001   000    Old_age   Always       -       40260
 10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       7
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       4
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       243
194 Temperature_Celsius     0x0022   100   100   000    Old_age   Always       -       35 (Min/Max 20/38)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
241 Total_LBAs_Written      0x0032   100   100   000    Old_age   Always       -       327674860307
242 Total_LBAs_Read         0x0032   100   100   000    Old_age   Always       -       218450628875

/dev/sda
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   050    Pre-fail  Always       -       0
  2 Throughput_Performance  0x0004   100   100   000    Old_age   Offline      -       0
  3 Spin_Up_Time            0x0027   100   100   001    Pre-fail  Always       -       11252
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       15
  5 Reallocated_Sector_Ct   0x0033   100   100   050    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000a   100   100   000    Old_age   Always       -       0
  8 Seek_Time_Performance   0x0004   100   100   000    Old_age   Offline      -       0
  9 Power_On_Hours          0x0032   001   001   000    Old_age   Always       -       40271
 10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       7
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       4
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       237
194 Temperature_Celsius     0x0022   100   100   000    Old_age   Always       -       38 (Min/Max 20/41)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
241 Total_LBAs_Written      0x0032   100   100   000    Old_age   Always       -       314816304858
242 Total_LBAs_Read         0x0032   100   100   000    Old_age   Always       -       319256236691

Mis à jour par Christian P. Momon il y a presque 5 ans

  • Assigné à mis à Christian P. Momon

Avancement :

Personalities : [raid1] [linear] [multipath] [raid0] [raid6] [raid5] [raid4] [raid10] 
md0 : active raid1 sdb1[1] sda1[0]
      1920618880 blocks [2/2] [UU]
      [==========>..........]  resync = 52.8% (1014439296/1920618880) finish=1442.1min speed=10472K/sec

Mis à jour par Christian P. Momon il y a presque 5 ans

  • Statut changé de Nouveau à Résolu

Fin du resync :

(April) root@virola:~# date ; cat /proc/mdstat 
Tue 11 Feb 2020 03:19:07 AM UTC
Personalities : [raid1] [linear] [multipath] [raid0] [raid6] [raid5] [raid4] [raid10] 
md0 : active raid1 sdb1[1] sda1[0]
      1920618880 blocks [2/2] [UU]

Tout étant rentré dans l'ordre et puisque pas d'explication supplémentaire du phénomène, passage du ticket à résolu.

Mis à jour par Quentin Gibeaux il y a plus de 4 ans

  • Statut changé de Résolu à Fermé
Actions

Formats disponibles : Atom PDF