cPanel appears to have recently corrected the auto-detection for SATA drives. When this correction went out, we got a flood of support requests asking about SMART emails being received. Almost all of the requests were regarding errors that occurred in the past.
AndyReed posted details about old errors in a post here:
http://forums.cpanel.net/showthread.php?t=53613#4
But we're also noticing that reports are being generated due to attributes 190 having an "In_the_past" value for WHEN_FAILED. This attribute is the Airflow_Temperature_Cel and indicates when the drive's temperature exceeded a threshold. This is certainly good information to have if your drive is running hot, but after you install a fan and and get it running cool again, the WHEN_FAILED value will continue to trip SMART email alerts.
I've already contact Seagate and have confirmed that it is impossible to reset the attributes of the drive.
The smartd SMART daemon makes use of /etc/smartd.conf to control which which options and drives should be monitored with SMART checks. Within this file, there as a -I option which allows you to ignore certain attributes in error reports. cPanel's smartcheck script does not use this file so we can't omit attributes from the error alerts. Is there a sanctioned method of ignoring the "In_the_past" value for a given attribute in /scripts/smartcheck?
If not, we'll probably be disabling /scripts/smartcheck across all of our servers and continuing to use the smartd service to monitor drives instead. It might be a prudent move to have cPanel simply setup /etc/smartd.conf and allow the SMART daemon to do the actual monitoring. You could even add smartd to tailwatchd for good measure.
AndyReed posted details about old errors in a post here:
http://forums.cpanel.net/showthread.php?t=53613#4
But we're also noticing that reports are being generated due to attributes 190 having an "In_the_past" value for WHEN_FAILED. This attribute is the Airflow_Temperature_Cel and indicates when the drive's temperature exceeded a threshold. This is certainly good information to have if your drive is running hot, but after you install a fan and and get it running cool again, the WHEN_FAILED value will continue to trip SMART email alerts.
I've already contact Seagate and have confirmed that it is impossible to reset the attributes of the drive.
The smartd SMART daemon makes use of /etc/smartd.conf to control which which options and drives should be monitored with SMART checks. Within this file, there as a -I option which allows you to ignore certain attributes in error reports. cPanel's smartcheck script does not use this file so we can't omit attributes from the error alerts. Is there a sanctioned method of ignoring the "In_the_past" value for a given attribute in /scripts/smartcheck?
If not, we'll probably be disabling /scripts/smartcheck across all of our servers and continuing to use the smartd service to monitor drives instead. It might be a prudent move to have cPanel simply setup /etc/smartd.conf and allow the SMART daemon to do the actual monitoring. You could even add smartd to tailwatchd for good measure.