WHM Service Status Oddity

H

HelloJeff

Guest
Hi,

Forgive me for not having read the WHM manual entirely before asking questions. I have, however, searched through the online WHM documentation and searched these forums and google, and have not yet come up with an answer. So, here goes:

Under "Service Status" in WHM, 2 services on a host are showing as "failed" with the flashing red light icons. After logging into the host, I see that both services appear to be running just fine. I even restarted one of them, and refreshed the Service Status page and there were no changes, still the 2 "failed" services.

The 2 services are:

exim (exim-4.50-1_cpanel_smtpctl_av_rewrite_mm2_mmmtrap_exiscan_md5pass) failed
ftpd failed

While reloading the Service Status page, I ran a "ps -ef | grep cpanel" on the host, and saw that whostmgr was being executed with the ./servup parameter:

root 14296 14294 0 18:13 ? 00:00:00 /usr/local/cpanel/whostmgr/bin/whostmgr ./servup

I, however, know little about what exactly whostmgr is looking for when checking services (is it attempting to establish connections to the services, or running ps auxw and grepping the output, or both/neither/dude you're way off track?)

Here is output from ps, lsof, and netstat regarding both exim and ftpd on the host:


EXIM
-------------------
# ps -ef|grep exim
mailnull 20203 1 0 12:41 ? 00:00:06 /usr/bin/perl /usr/local/cpanel/bin/eximstats
mailnull 3035 1 0 17:25 ? 00:00:01 /usr/sbin/exim -bd -q60m
mailnull 3201 1 0 17:25 ? 00:00:00 /usr/sbin/exim -tls-on-connect -bd -oX 465
mailnull 3938 3035 0 17:53 ? 00:00:00 /usr/sbin/exim -bd -q60m
mailnull 14720 3035 0 17:54 ? 00:00:00 /usr/sbin/exim -bd -q60m
(extra output intentionally omitted)


# lsof -i | grep -i exim | grep LISTEN
exim 3035 mailnull 3u IPv4 -2060592704 TCP *:smtp (LISTEN)
exim 3201 mailnull 3u IPv4 -2060592688 TCP *:smtps (LISTEN)


FTPD
-------------------

root 26562 1 0 17:15 ? 00:00:00 pure-ftpd (SERVER) root 26656 1 0 17:15 ? 00:00:00 /usr/sbin/pure-authd -s /var/run/ftpd.sock -r /usr/sbin/pureauth


# netstat -an|grep ":21"
tcp 0 0 0.0.0.0:21 0.0.0.0:* LISTEN


If I knew how exactly whostmgr was checking for properly running services, it might provide better insight into the issue as far as troubleshooting. There doesn't appear to be a manpage for it, and the online documentation as far as I can tell just shows you how to run it. Thanks for any assistance.


edit: the version in use is 10.2.0-STABLE_83

Also, running ./whostmgr ./servup I just see a "GET" to the webserver. I won't go into detail about if I used strace or not, because I'm not sure if that is a violation of policy or not. (Hey, at least I'm trying to answer the question myself :)

Just adding another update to say that an update to 10.2.0-CURRENT_131 was done, and the problem persists.
 
Last edited by a moderator:

aby

Well-Known Member
May 31, 2005
638
0
166
India
HelloJeff said:
Hi,

Forgive me for not having read the WHM manual entirely before asking questions. I have, however, searched through the online WHM documentation and searched these forums and google, and have not yet come up with an answer. So, here goes:

Under "Service Status" in WHM, 2 services on a host are showing as "failed" with the flashing red light icons. After logging into the host, I see that both services appear to be running just fine. I even restarted one of them, and refreshed the Service Status page and there were no changes, still the 2 "failed" services.

The 2 services are:

exim (exim-4.50-1_cpanel_smtpctl_av_rewrite_mm2_mmmtrap_exiscan_md5pass) failed
ftpd failed

While reloading the Service Status page, I ran a "ps -ef | grep cpanel" on the host, and saw that whostmgr was being executed with the ./servup parameter:

root 14296 14294 0 18:13 ? 00:00:00 /usr/local/cpanel/whostmgr/bin/whostmgr ./servup

I, however, know little about what exactly whostmgr is looking for when checking services (is it attempting to establish connections to the services, or running ps auxw and grepping the output, or both/neither/dude you're way off track?)

Here is output from ps, lsof, and netstat regarding both exim and ftpd on the host:


EXIM
-------------------
# ps -ef|grep exim
mailnull 20203 1 0 12:41 ? 00:00:06 /usr/bin/perl /usr/local/cpanel/bin/eximstats
mailnull 3035 1 0 17:25 ? 00:00:01 /usr/sbin/exim -bd -q60m
mailnull 3201 1 0 17:25 ? 00:00:00 /usr/sbin/exim -tls-on-connect -bd -oX 465
mailnull 3938 3035 0 17:53 ? 00:00:00 /usr/sbin/exim -bd -q60m
mailnull 14720 3035 0 17:54 ? 00:00:00 /usr/sbin/exim -bd -q60m
(extra output intentionally omitted)


# lsof -i | grep -i exim | grep LISTEN
exim 3035 mailnull 3u IPv4 -2060592704 TCP *:smtp (LISTEN)
exim 3201 mailnull 3u IPv4 -2060592688 TCP *:smtps (LISTEN)


FTPD
-------------------

root 26562 1 0 17:15 ? 00:00:00 pure-ftpd (SERVER) root 26656 1 0 17:15 ? 00:00:00 /usr/sbin/pure-authd -s /var/run/ftpd.sock -r /usr/sbin/pureauth


# netstat -an|grep ":21"
tcp 0 0 0.0.0.0:21 0.0.0.0:* LISTEN


If I knew how exactly whostmgr was checking for properly running services, it might provide better insight into the issue as far as troubleshooting. There doesn't appear to be a manpage for it, and the online documentation as far as I can tell just shows you how to run it. Thanks for any assistance.


edit: the version in use is 10.2.0-STABLE_83

Also, running ./whostmgr ./servup I just see a "GET" to the webserver. I won't go into detail about if I used strace or not, because I'm not sure if that is a violation of policy or not. (Hey, at least I'm trying to answer the question myself :)

Just adding another update to say that an update to 10.2.0-CURRENT_131 was done, and the problem persists.

Try /etc/init.d/chkservd start

----
Aby :)
 
H

HelloJeff

Guest
Thank you aby! I didn't verify the 2 services were still red before running chkservd, but after starting it, then checking the Service Status, everything is green, so I'm almost (100%) positive that did it. I'm reading this now so I can understand the process better: http://codenode.com/chkservdexplained

Thanks again.