Community Forums
Connect with us on LinkedIn
Community Notice
+ Reply to Thread
Results 1 to 7 of 7
  1. #1
    Registered User
    Join Date
    Dec 2004
    Posts
    3

    Default Hard Disk Problems

    OK,

    Really hoping somebody can help here. Our deidcated server started locking up a few days ago - could ping, but no ssh or http access. Rebooting solved the problem.

    It happened again, then again. Some investigation showed:
    1) All the filesystems were READ-ONLY
    2) The following messages appearedin /var/log/messages

    Jan 30 11:24:26 srv1 kernel: blk: queue c0402f40, I/O limit 4095Mb (mask 0xffffffff)
    Jan 30 11:24:26 srv1 kernel: blk: queue c0403080, I/O limit 4095Mb (mask 0xffffffff)
    Jan 30 11:24:26 srv1 kernel: hda: status error: status=0x58 { DriveReady SeekComplete DataRequest }
    Jan 30 11:24:26 srv1 kernel:
    Jan 30 11:24:26 srv1 kernel: hda: drive not ready for command

    Has anybody seen this before? My guess is we have been hacked, or have a bad hard drive.

    HELP - any thoughts.

    Simon

  2. #2
    Member
    Join Date
    Dec 2001
    Posts
    1,558

    Default

    What OS is this ? Have you clicked the option to enable DMA from within WHM ? Have you tried turning off DMA to see if this resolves the problem ? My guess is it may just be a misconfiguration.
    Beau Henderson

  3. #3
    Super Moderator This forum account has been confirmed by cPanel staff to represent a vendor. chirpy's Avatar
    Join Date
    Jun 2002
    Location
    Go on, have a guess
    Posts
    13,495

    Default

    Also, if they're SMART capable, it might be worth building the latest smartmontools and doing some tests and reports on the drives:
    http://smartmontools.sourceforge.net/
    Jonathan Michaelson

    Need your cPanel servers secured and tuned?
    cPanel Server Configuration, Security, Recovery and Antivirus/AntiSpam Services
    Developers of the most effective (and free) Firewall & Security Solution for cPanel Servers - csf
    http://www.configserver.com

  4. #4
    cPanel Partner NOC cPanel Partner NOC Badge jester.ro's Avatar
    Join Date
    Feb 2004
    Location
    Bucharest, Romania
    Posts
    304

    Default

    looks like a hdd preparing to crash.
    backup and ask the dc to change the harddrive.

  5. #5
    Member
    Join Date
    Dec 2001
    Posts
    1,558

    Default

    Quote Originally Posted by jester.ro
    looks like a hdd preparing to crash.
    backup and ask the dc to change the harddrive.
    Although that's a possibility, its not a direct sign of failure. I've seen this sort of error most commonly with misconfigured DMA settings.
    Beau Henderson

  6. #6
    cPanel Partner NOC cPanel Partner NOC Badge jester.ro's Avatar
    Join Date
    Feb 2004
    Location
    Bucharest, Romania
    Posts
    304

    Default

    maybe, but turning dma off kicks back the performance to such a level that you can't use the server anymore.

    do a hdparm /dev/hda (if your drive is hda)
    and paste the results here

    alson, i have only fedora 1 for my cpanels, and i see that smarttools are installed by efault(never used them tough)

    do a smartctl -a /dev/hda
    and look at the results

    mine look like this:



    1 Raw_Read_Error_Rate 0x000b 200 200 051 Pre-fail Always - 0
    3 Spin_Up_Time 0x0007 100 253 021 Pre-fail Always - 0
    4 Start_Stop_Count 0x0032 100 100 040 Old_age Always - 15
    5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
    7 Seek_Error_Rate 0x000b 200 200 051 Pre-fail Always - 0
    9 Power_On_Hours 0x0032 091 091 000 Old_age Always - 6686
    10 Spin_Retry_Count 0x0013 100 253 051 Pre-fail Always - 0
    11 Calibration_Retry_Count 0x0013 100 253 051 Pre-fail Always - 0
    12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 15
    194 Temperature_Celsius 0x0022 109 006 000 Old_age Always - 34
    196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
    197 Current_Pending_Sector 0x0012 200 200 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0012 200 200 000 Old_age Always - 0
    199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0
    200 Multi_Zone_Error_Rate 0x0009 200 200 051 Pre-fail Offline - 0


    so you must have "0" for every id that represents an error, otherwise...

  7. #7
    Member
    Join Date
    Sep 2004
    Location
    Cleveland, Ohio
    Posts
    378

    Default

    This just happened to a friend of mine this past week. Updated to FC3, and a few days later, LogWatch was starting to report kernel errors very similar to that about the harddrive. I told managed about it and they confirmed that the drive is at fault. I got my friend to backup the accounts and submit a ticket to managed.com. 2 days now and still no replacement done or any acknowledgement to it (also submitted 2 tickets now and noone is ever on the live support).

Similar Threads & Tags
Similar threads

  1. Faulty Hard Disk ??
    By djblamire in forum cPanel and WHM Discussions
    Replies: 0
    Last Post: 06-04-2006, 04:50 AM
  2. Hard disk is 85%
    By mta in forum cPanel and WHM Discussions
    Replies: 2
    Last Post: 07-22-2004, 06:09 AM
  3. Replies: 0
    Last Post: 06-01-2004, 07:10 PM
  4. Hard Disk Space
    By tswaibel in forum cPanel and WHM Discussions
    Replies: 2
    Last Post: 03-20-2004, 07:22 AM
  5. Hard disk S.M.A.R.T report
    By rix in forum cPanel and WHM Discussions
    Replies: 2
    Last Post: 09-24-2003, 02:06 PM
Linkedin       Facebook       Twitter       RSS       Flickr       YouTube