[CentOS] SAS Hard Disk 15K rpm on CentOS 5.8

Kaushal Shriyan

kaushalshriyan at gmail.com
Mon Jul 30 05:10:15 UTC 2012


Hi,

I get these below information. Please help me understand about "SMART
Health Status: FAILURE PREDICTION THRESHOLD EXCEEDED: ascq=0x5
[asc=5d, ascq=5]" and what does that error mean. Does it mean that the
SAS disk 0 is failing and has serious issues and needs to be replaced
and also do i need to run health status test with different
options/flags or switches.

/usr/sbin/smartctl -d megaraid,0 -a /dev/sda
smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.18-308.1.1.el5] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

Vendor:               SEAGATE
Product:              ST3300657SS
Revision:             ES65
User Capacity:        300,000,000,000 bytes [300 GB]
Logical block size:   512 bytes
Logical Unit id:      0x5000c500477d63bb
Serial number:        6SJ46RZB
Device type:          disk
Transport protocol:   SAS
Local Time is:        Fri Jul 27 06:50:49 2012 IST
Device supports SMART and is Enabled
Temperature Warning Disabled or Not Supported
SMART Health Status: FAILURE PREDICTION THRESHOLD EXCEEDED: ascq=0x5
[asc=5d, ascq=5]

Current Drive Temperature:     33 C
Drive Trip Temperature:        68 C
Elements in grown defect list: 2048
Vendor (Seagate) cache information
  Blocks sent to initiator = 2518358542
  Blocks received from initiator = 3507177513
  Blocks read from cache and sent to initiator = 19620730
  Number of read and write commands whose size <= segment size = 35772014
  Number of read and write commands whose size > segment size = 103
Vendor (Seagate/Hitachi) factory information
  number of hours powered up = 3903.10
  number of minutes until next internal SMART test = 32

Error counter log:
           Errors Corrected by           Total   Correction
Gigabytes    Total
               ECC          rereads/    errors   algorithm
processed    uncorrected
           fast | delayed   rewrites  corrected  invocations   [10^9
bytes]  errors
read:    7599922     9416         0   7609338    7609338
360.930           0
write:         0        0         0         0          0
1851.172           0
verify: 96309427  1108629         0  97418056   97419201
62032.969        3455

Non-medium error count:        3

SMART Self-test log
Num  Test              Status                 segment  LifeTime
LBA_first_err [SK ASC ASQ]
     Description                              number   (hours)
# 1  Background long   Completed                  32       1
      - [-   -    -]
# 2  Background short  Completed                  32       1
      - [-   -    -]

Long (extended) Self Test duration: 3200 seconds [53.3 minutes]
[root@~]# /usr/sbin/smartctl -d megaraid,1 -a /dev/sda
smartctl 5.42 2011-10-20 r3458 [x86_64-linux-2.6.18-308.1.1.el5] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

Vendor:               SEAGATE
Product:              ST3300657SS
Revision:             ES65
User Capacity:        300,000,000,000 bytes [300 GB]
Logical block size:   512 bytes
Logical Unit id:      0x5000c500477d49fb
Serial number:        6SJ44JEH
Device type:          disk
Transport protocol:   SAS
Local Time is:        Fri Jul 27 06:50:55 2012 IST
Device supports SMART and is Enabled
Temperature Warning Disabled or Not Supported
SMART Health Status: OK

Current Drive Temperature:     32 C
Drive Trip Temperature:        68 C
Elements in grown defect list: 0
Vendor (Seagate) cache information
  Blocks sent to initiator = 1288378070
  Blocks received from initiator = 3519302272
  Blocks read from cache and sent to initiator = 19977846
  Number of read and write commands whose size <= segment size = 35969964
  Number of read and write commands whose size > segment size = 102
Vendor (Seagate/Hitachi) factory information
  number of hours powered up = 3903.00
  number of minutes until next internal SMART test = 32

Error counter log:
           Errors Corrected by           Total   Correction
Gigabytes    Total
               ECC          rereads/    errors   algorithm
processed    uncorrected
           fast | delayed   rewrites  corrected  invocations   [10^9
bytes]  errors
read:   16718097        0         0  16718097   16718097
352.119           0
write:         0        0         0         0          0
1810.643           0
verify: 197332427        0         0  197332427   197332427
5156.549           0

Non-medium error count:        0

SMART Self-test log
Num  Test              Status                 segment  LifeTime
LBA_first_err [SK ASC ASQ]
     Description                              number   (hours)
# 1  Background long   Completed                  32       1
      - [-   -    -]
# 2  Background short  Completed                  32       1
      - [-   -    -]

Long (extended) Self Test duration: 3200 seconds [53.3 minutes]

Regards

Kaushal



More information about the CentOS mailing list