The Community Forums

Interact with an entire community of cPanel & WHM users!
  1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

[cPanel smartcheck] Possible Hard Drive Failure Soon

Discussion in 'General Discussion' started by ncconquer, Oct 1, 2006.

  1. ncconquer

    ncconquer Well-Known Member

    Joined:
    Jun 20, 2004
    Messages:
    80
    Likes Received:
    0
    Trophy Points:
    6
    Hi,
    We've been getting this message from a lot of our servers

    S.M.A.R.T Errors on /dev/hda
    From Command: /usr/sbin/smartctl -q errorsonly -H -l selftest -l error /dev/hda
    Please note the following marginal Attributes:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    3 Spin_Up_Time 0x0007 067 001 011 Pre-fail Always In_the_past 5000
    ----END /dev/hda--
    is anybody else having this problem?
     
  2. katmai

    katmai Well-Known Member

    Joined:
    Mar 13, 2006
    Messages:
    526
    Likes Received:
    0
    Trophy Points:
    16
    Location:
    Brno, Czech Republic
    strongly recommend offsite backup and reload os on a new hard drive. this indicates hard drive failure soon, and it's not quite kidding. i experienced this once and ignored and finally the server crashed and we had a pain in the back to restore data.

    my advice. replace hard drive ASAP
     
  3. chirpy

    chirpy Well-Known Member

    Joined:
    Jun 15, 2002
    Messages:
    13,475
    Likes Received:
    20
    Trophy Points:
    38
    Location:
    Go on, have a guess
    Although, not always. While it can indicate the impending failure of the drive , it might not be. If the error count doesn't increase then it could simply be an incompatibility issue between smartmontools and the drive. Search the forums for many threads that have dealt with the same issue in the past.
     
  4. AndyReed

    AndyReed Well-Known Member
    PartnerNOC

    Joined:
    May 29, 2004
    Messages:
    2,222
    Likes Received:
    3
    Trophy Points:
    38
    Location:
    Minneapolis, MN
    Althoguh it depends on your HD make and type, the RAW_VALUE 5000 is what concerns me the most. Ask you data center check on the condition of your HD. Overall, I suggest you take katmai advice.
     
  5. essentials

    essentials Member

    Joined:
    Mar 25, 2002
    Messages:
    11
    Likes Received:
    0
    Trophy Points:
    1
    At the command prompt:

    dmesg

    see if any errors show up.

    If you see any errors I'd have the DC mirror the drive that is failing and replace it - much easier than restoring. ;)
     
  6. ujr

    ujr Well-Known Member

    Joined:
    Mar 19, 2004
    Messages:
    290
    Likes Received:
    0
    Trophy Points:
    16
    you may want to add that dmesg is to find boot (kernel ring) error messages, useful also to see if a HD is failing, but not a sure fire way.
     
  7. katmai

    katmai Well-Known Member

    Joined:
    Mar 13, 2006
    Messages:
    526
    Likes Received:
    0
    Trophy Points:
    16
    Location:
    Brno, Czech Republic
    i suggested replacing the drive as a preventive measure, in order to have a stable server/business you kinda don't want to take chances, like drive failing while you are on vacation or something. not to mention that you may lose important data, and the restore could be sometime a pain in the back. better planned rather than unplanned
     

Share This Page