The Community Forums

Interact with an entire community of cPanel & WHM users!
  1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Smartcheck drive failure.

Discussion in 'General Discussion' started by paulm, Apr 26, 2007.

  1. paulm

    paulm Well-Known Member

    Joined:
    Oct 13, 2003
    Messages:
    60
    Likes Received:
    0
    Trophy Points:
    6
    I got my first smartcheck drive error last night, Not sure if it is some type of false reading but checked all my backups and they are on the secondary drive ready for use if needed.

    Not having actually done a restore from cPanel backups serverwide before I assume I will need to resetup cpanel, nameservers, bind, cpanel config wizard etc then just move all of the backups to home for a restore of all of them which will create my zone files, add to httpd.conf and everything else that needs to be done.

    My question is if there is anything else I should backup that cPanel would not keep in the site files themselves? Should I backup the httpd.conf and purftpd.conf or just let cPanel handle it when restoring the sites?
     
  2. paulm

    paulm Well-Known Member

    Joined:
    Oct 13, 2003
    Messages:
    60
    Likes Received:
    0
    Trophy Points:
    6
    Here are my errors:

    root@srv1 [/scripts]# ./smartcheck
    Using smartcheck config 5.32 for smartctl(5.1)
    Checking /dev/hda....
    Errors:
    SMART overall-health self-assessment test result: FAILED!
    Drive failure expected in less than 24 hours. SAVE ALL DATA.
    Failed Attributes:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE WHEN_FAILED RAW_VALUE
    5 Reallocated_Sector_Ct 0x0033 005 005 005 Pre-fail FAILING_NOW 1803

    Checking /dev/hdd....Ok
    Notification => me@domain.com via EMAIL [level => 3]
    root@srv1 [/scripts]# /usr/local/cpanel/3rdparty/bin/smartctl -a /dev/hda
    smartctl version 5.1-11 Copyright (C) 2002-3 Bruce Allen
    Home page is http://smartmontools.sourceforge.net/

    === START OF INFORMATION SECTION ===
    Device Model: HDS722512VLAT20
    Serial Number: VN631ECCD85KBD
    Firmware Version: V33OA6EA
    Device is: Not in smartctl database [for details use: -P showall]
    ATA Version is: 6
    ATA Standard is: ATA/ATAPI-6 T13 1410D revision 3a
    Local Time is: Thu Apr 26 05:57:59 2007 EDT
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled

    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: FAILED!
    Drive failure expected in less than 24 hours. SAVE ALL DATA.
    See vendor-specific Attribute list for failed Attributes.

    General SMART Values:
    Off-line data collection status: (0x00) Offline data collection activity was
    never started.
    Auto Off-line Data Collection: Disabled.
    Self-test execution status: ( 0) The previous self-test routine completed
    without error or no self-test has ever
    been run.
    Total time to complete off-line
    data collection: (2707) seconds.
    Offline data collection
    capabilities: (0x1b) SMART execute Offline immediate.
    Automatic timer ON/OFF support.
    Suspend Offline collection upon new
    command.
    Offline surface scan supported.
    Self-test supported.
    No Conveyance Self-test supported.
    No Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
    power-saving mode.
    Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
    General Purpose Logging supported.
    Short self-test routine
    recommended polling time: ( 1) minutes.
    Extended self-test routine
    recommended polling time: ( 45) minutes.

    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x000b 084 084 060 Pre-fail - 13435004
    2 Throughput_Performance 0x0005 100 100 050 Pre-fail - 0
    3 Spin_Up_Time 0x0007 192 192 024 Pre-fail - 154 (Average 157)
    4 Start_Stop_Count 0x0012 100 100 000 Old_age - 22
    5 Reallocated_Sector_Ct 0x0033 005 005 005 Pre-fail FAILING_NOW 1803
    7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail - 0
    8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail - 0
    9 Power_On_Hours 0x0012 098 098 000 Old_age - 20057
    10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail - 0
    12 Power_Cycle_Count 0x0032 100 100 000 Old_age - 22
    192 Power-Off_Retract_Count 0x0032 100 100 050 Old_age - 475
    193 Load_Cycle_Count 0x0012 100 100 050 Old_age - 475
    194 Temperature_Celsius 0x0002 203 203 000 Old_age - 27 (Lifetime Min/Max 19/40)
    196 Reallocated_Event_Count 0x0032 100 100 000 Old_age - 2523
    197 Current_Pending_Sector 0x0022 100 100 000 Old_age - 0
    198 Offline_Uncorrectable 0x0008 100 100 000 Old_age - 0
    199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age - 0

    SMART Error Log Version: 1
    No Errors Logged

    SMART Self-test log, version number 1
    No self-tests have been logged
     
  3. katmai

    katmai Well-Known Member

    Joined:
    Mar 13, 2006
    Messages:
    526
    Likes Received:
    0
    Trophy Points:
    16
    Location:
    Brno, Czech Republic
    i so much suggest getting a new drive and doing a restore before it's too late. no matter what, the backups could be corrupted if you keep using that drive. i had 3 drives crashing after had that error, and recovery was a pain. big one. strongly suggest not to ignore it and reload the os on a new drive, or replace that drive.
     
Loading...

Share This Page