View Single Post
  #1 (permalink)  
Old 10-18-2004, 01:30 AM
Rubas Rubas is offline
Registered User
 
Join Date: Sep 2003
Posts: 142
Rubas is on a distinguished road
Exclamation server crash every 2nd day @ same time :)

Situation:
It is a brand new sister server (with a only a small difference to the other cpanel server -> it has the latest firmware for the raid controller).

WHM 9.9.0 cPanel 9.9.2-S8
Redhat EL ES 3
2.4.21-20.ELsmp

Dual Xeon 2,8 Ghz
4 GB RAM
Raid 5 with 4 SCSI drives (Adaptec 2910 -> aacraid)

Now the server runs rock stable BUT every 2nd day at 2:00 AM it is over!

The logs shows me nothing interesting - one time I found an entry about a scsi error at last.
I think it is a problem like the "smart check" with scsi raid systems - but with the smart check on the system crashed every time I ran upcp!

Okay I have to be an issue with a cronjob, but every cron job run at least daily!

/var/log
Quote:
Oct 18 02:00:00 cpanel03 CROND[5295]: (root) CMD (/usr/local/sim/sim -q >> /dev/null 2>&1)
sim runs every 5mins
Oct 18 02:00:00 cpanel03 CROND[5299]: (root) CMD (/usr/local/sbin/spri -q >> /dev/null 2>&1)
spri runs every 45mins
Oct 18 02:00:00 cpanel03 CROND[5301]: (root) CMD (/usr/local/cpanel/bin/dcpumon >/dev/null 2>&1)
dcpumon runs every 5mins

Oct 18 02:00:00 cpanel03 CROND[5310]: (root) CMD (/scripts/upcp)

Oct 18 02:00:00 cpanel03 CROND[5297]: (root) CMD (/usr/local/sbin/lsm -c >> /dev/null 2>&1)
lsm runs every 10mins
Oct 18 02:00:00 cpanel03 CROND[5304]: (root) CMD (/bin/rm /tmp/cpanel.TMP* >>/dev/null 2>&1)
lsm runs every 60mins
Oct 18 02:00:00 cpanel03 CROND[5308]: (root) CMD (/usr/local/cpanel/whostmgr/bin/dnsqueue > /dev/null 2>&1)
dnsqueue runs every 15mins
Oct 18 02:00:01 cpanel03 crontab[5842]: (root) LIST (root)
Oct 18 02:00:01 cpanel03 crontab[5844]: (root) LIST (root)
Oct 18 02:00:01 cpanel03 crontab[5845]: (root) LIST (root)
Oct 18 02:00:01 cpanel03 crontab[5846]: (root) REPLACE (root)
/var/message
Quote:
Oct 18 02:00:06 cpanel03 proftpd[5272]: cpanel03.xxxx (127.0.0.1[127.0.0.1]) - FTP login timed out, disconnected
Oct 18 02:00:06 cpanel03 proftpd[5272]: cpanel03.xxxx (127.0.0.1[127.0.0.1]) - FTP session closed.

It has to be a issue with Oct 18 02:00:00 cpanel03 CROND[5310]: (root) CMD (/scripts/upcp)

But upcp runs daily!
I can run upcp 10times without a problem and the only difference if I start it is
Quote:
if (!$ishuman) {
system("/scripts/cpbackup");
}
At the crash days the server stops working befor it could make the backup.
But I have also no problem to rund 10times /scripts/cpbackup.


Like I said it only crash every second day and everthing is running at least every day!
I couldn't explain it - first I thought of the "smartcheck issue" and I disabled it with "touch /var/cpanel/disablesmartcheck", next I thought I have to do with the cpanellog issue (cpanellogd spawning thousands of processes (fix) - cpanellogd spawning thousands of processes (fix)).

Actually I had no idea

Next I will do is to seperate /scripts/upcp to testify my upcp theory.

What does upcp only every 2nd day do and not every day?

Last edited by Rubas; 10-18-2004 at 01:35 AM.
Reply With Quote