The Community Forums

Interact with an entire community of cPanel & WHM users!
  1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

S.M.A.R.T. Should I be worried?

Discussion in 'General Discussion' started by haze, May 24, 2002.

  1. haze

    haze Well-Known Member

    Joined:
    Dec 21, 2001
    Messages:
    1,550
    Likes Received:
    3
    Trophy Points:
    38
    I just got the following email, should I be worried?

    IMPORTANT: Do not ignore this email.
    You should backup all the data on the hard drives listed below and replace them as soon as possible.
    S.M.A.R.T has detected that they are not peforming within normal operating paramaters.

    Excessive ATA Errors on disk /dev/hda. Please consider replacing this drive.

    SMART Error Log:
    SMART Error Logging Version: 1
    Error Log Data Structure Pointer: 05
    ATA Error Count: 16
    Non-Fatal Count: 0

    Error Log Structure 1:
    DCR FR SC SN CL SH D/H CR Timestamp
    08 00 08 4c a2 23 e0 ca 3147640
    08 00 08 24 d3 8a e0 c8 3147640
    08 00 60 2c d3 8a e0 c8 3147640
    08 00 18 8c d3 8a e0 c8 3147640
    08 da 00 00 4f c2 e0 b0 3147640
    00 04 00 0b 4f c2 e0 51 32124

    Error Log Structure 2:
    DCR FR SC SN CL SH D/H CR Timestamp
    08 00 08 7c a5 22 e0 ca 3582248
    08 00 08 8c 90 22 e0 ca 3582248
    08 00 02 41 00 00 e0 c8 3582248
    08 00 08 0c 30 03 e0 c8 3582248
    08 00 01 01 00 00 a0 08 3582260
    00 04 01 01 00 00 a0 51 32175

    Error Log Structure 3:
    DCR FR SC SN CL SH D/H CR Timestamp
    08 00 08 fc c0 76 e4 c8 3616438
    08 00 08 24 d3 8a e0 c8 3616438
    08 00 60 2c d3 8a e0 c8 3616438
    08 00 18 8c d3 8a e0 c8 3616439
    08 da 00 00 4f c2 e0 b0 3616439
    00 04 00 0b 4f c2 e0 51 32144

    Error Log Structure 4:
    DCR FR SC SN CL SH D/H CR Timestamp
    08 00 08 7c a5 22 e0 ca 519082
    08 00 08 8c 90 22 e0 ca 519082
    08 00 02 41 00 00 e0 c8 519082
    08 00 08 0c 30 03 e0 c8 519082
    08 00 01 01 00 00 a0 08 519095
    00 04 01 01 00 00 a0 51 32162

    Error Log Structure 5:
    DCR FR SC SN CL SH D/H CR Timestamp
    08 00 08 b4 a6 76 e4 c8 579232
    08 00 08 24 d3 8a e0 c8 579232
    08 00 60 2c d3 8a e0 c8 579232
    08 00 18 8c d3 8a e0 c8 579233
    08 da 00 00 4f c2 e0 b0 579233
    00 04 00 0b 4f c2 e0 51 32138
     
  2. Daniel

    Daniel Well-Known Member

    Joined:
    Aug 13, 2001
    Messages:
    165
    Likes Received:
    0
    Trophy Points:
    16
    I received an email like this also. The problem is it doesn't tell what server so I have no idea what server to check. :p
     
  3. kwimberl

    kwimberl Well-Known Member

    Joined:
    Aug 13, 2001
    Messages:
    123
    Likes Received:
    0
    Trophy Points:
    16
    Look at where the Email came from and it will give you the server name.
     
  4. haze

    haze Well-Known Member

    Joined:
    Dec 21, 2001
    Messages:
    1,550
    Likes Received:
    3
    Trophy Points:
    38
    Well, I got the DC to confirm there is a problem. I need to back everything up. This is a personal server, so I havent had any back up in place ( other than having a hard copy of my sites on my HD ). The question, is, how do I back up everything? I assume I will be needing to install CPanel again.
     
  5. TRAIN YARD SOFTWARE

    TRAIN YARD SOFTWARE Well-Known Member

    Joined:
    Dec 20, 2001
    Messages:
    224
    Likes Received:
    0
    Trophy Points:
    16
    SMART

    We just got this error yesterday. DC has replaced drive.


    -Ed
    TYS
     
  6. shaun

    shaun Well-Known Member

    Joined:
    Nov 9, 2001
    Messages:
    698
    Likes Received:
    0
    Trophy Points:
    16
    Location:
    San Clemente, Ca
    A client also got this message on one of our servers and i took the server down and ran the Manufacture Scan util on the drive and it came out good....
     
  7. shaun

    shaun Well-Known Member

    Joined:
    Nov 9, 2001
    Messages:
    698
    Likes Received:
    0
    Trophy Points:
    16
    Location:
    San Clemente, Ca
    A client also got this message on one of our servers and i took the server down and ran the Manufacture Scan util on the drive and it came out good....
     
  8. Brownie

    Brownie Well-Known Member

    Joined:
    Aug 10, 2001
    Messages:
    145
    Likes Received:
    0
    Trophy Points:
    16
    Im also getting this error :\
     
  9. DefHosting

    DefHosting Member

    Joined:
    May 23, 2002
    Messages:
    10
    Likes Received:
    0
    Trophy Points:
    1
    Got this error also the other day. Ran smartcheck and everything appears to be ok. Keeping a close eye on it though.
     
  10. Brownie

    Brownie Well-Known Member

    Joined:
    Aug 10, 2001
    Messages:
    145
    Likes Received:
    0
    Trophy Points:
    16
    can I just ask what kernel you're all running? I remember seeing a thread a long time ago about smart not liking a certain kernel.

    Im using 2.4.9-31
     
  11. TRAIN YARD SOFTWARE

    TRAIN YARD SOFTWARE Well-Known Member

    Joined:
    Dec 20, 2001
    Messages:
    224
    Likes Received:
    0
    Trophy Points:
    16
    Kernel Version 2.4.17 (SMP)
    Kernel Version 2.4.18 (SMP)
    Kernel Version 2.4.18 (SMP)
    Kernel Version 2.4.18 (SMP)
    Kernel Version 2.4.18 (SMP)
    Kernel Version 2.4.18 (SMP)
    Kernel Version 2.4.18 (SMP)
     
  12. bdraco

    bdraco Guest

    Since SMART errors are logged by the device itself it almost never wrong (unless there is something wrong with the drive, it which case it would be a good idea to replace it anyways). Check your dmesg as well, you will probably find disk errors.
     
  13. jumpdomain

    jumpdomain Well-Known Member

    Joined:
    Aug 12, 2001
    Messages:
    109
    Likes Received:
    0
    Trophy Points:
    16
    SMART check has been correct 100% of the time on our servers... Every time, there were also drive errors in the dmesg such as seek errors.
     
  14. TRAIN YARD SOFTWARE

    TRAIN YARD SOFTWARE Well-Known Member

    Joined:
    Dec 20, 2001
    Messages:
    224
    Likes Received:
    0
    Trophy Points:
    16
    [quote:f54f75b5a7][i:f54f75b5a7]Originally posted by bdraco[/i:f54f75b5a7]

    Since SMART errors are logged by the device itself it almost never wrong (unless there is something wrong with the drive, it which case it would be a good idea to replace it anyways). Check your dmesg as well, you will probably find disk errors.
    [/quote:f54f75b5a7]

    What is the OK time frame to fix problem. from right when email comes in at 3am or etc.?

    -Ed
    TYS
     
  15. Brownie

    Brownie Well-Known Member

    Joined:
    Aug 10, 2001
    Messages:
    145
    Likes Received:
    0
    Trophy Points:
    16
    its been confirmed by a tech - my servers drive is forked :\ Waiting on a replacement now
     
  16. bdraco

    bdraco Guest

    [quote:5c2e8c2eaf][i:5c2e8c2eaf]Originally posted by TRAIN YARD SOFTWARE[/i:5c2e8c2eaf]

    [quote:5c2e8c2eaf][i:5c2e8c2eaf]Originally posted by bdraco[/i:5c2e8c2eaf]

    Since SMART errors are logged by the device itself it almost never wrong (unless there is something wrong with the drive, it which case it would be a good idea to replace it anyways). Check your dmesg as well, you will probably find disk errors.
    [/quote:5c2e8c2eaf]

    What is the OK time frame to fix problem. from right when email comes in at 3am or etc.?

    -Ed
    TYS[/quote:5c2e8c2eaf]

    There are two levels of warnings. If you get and error that says
    & Please consider replacing this drive&, you probably have a while till failure. If you get &Disk Failure soon on ????& then you better backup everything before rebooting again.
     
  17. kwimberl

    kwimberl Well-Known Member

    Joined:
    Aug 13, 2001
    Messages:
    123
    Likes Received:
    0
    Trophy Points:
    16
    After having 16 drives all show that they are failing SMART with this new script, I began to really look into this.

    They are all of my Samsung drives.

    I spent nearly an hour on the phone with 3 samsung reps this afternoon. I had already run all of their own diagnostic utils on them and they all say they are fine.

    After much ado, it appears that this smartcheck is NOT compatible wth samsung drives. There is NOT a problem according to samsung with my drives.

    Anyway, I need a way to disable this check? I edited the script myself to disable it, but the rsync takes care of that after the next update.

    Nick?
     
  18. shaun

    shaun Well-Known Member

    Joined:
    Nov 9, 2001
    Messages:
    698
    Likes Received:
    0
    Trophy Points:
    16
    Location:
    San Clemente, Ca
    ya a cupple clients get this message too, i tell them to run manufacture util on them and they come back fine. one of them was a maxtor drive.
     
  19. Brownie

    Brownie Well-Known Member

    Joined:
    Aug 10, 2001
    Messages:
    145
    Likes Received:
    0
    Trophy Points:
    16
    [quote:82908c0f37][i:82908c0f37]Originally posted by kwimberl[/i:82908c0f37]

    After having 16 drives all show that they are failing SMART with this new script, I began to really look into this.

    They are all of my Samsung drives.

    I spent nearly an hour on the phone with 3 samsung reps this afternoon. I had already run all of their own diagnostic utils on them and they all say they are fine.

    After much ado, it appears that this smartcheck is NOT compatible wth samsung drives. There is NOT a problem according to samsung with my drives.

    Anyway, I need a way to disable this check? I edited the script myself to disable it, but the rsync takes care of that after the next update.

    Nick?[/quote:82908c0f37]

    chattr +i /scripts/nameofscript should stop the script been overwritten :)
     
  20. andyf

    andyf Well-Known Member

    Joined:
    Jan 7, 2002
    Messages:
    246
    Likes Received:
    0
    Trophy Points:
    16
    Location:
    UK
    I've got this error on a dev box, thats running a drive which has been working fine for 3 years running a windows OS, so I suspect the errors can also be caused by a mis-configuration and incorrect settings for the IDE controller, not just a failing disk.

    Andy
     
Loading...

Share This Page