upcp hanging on "checking nameservers"

MaraBlue

Well-Known Member
May 3, 2005
332
2
168
Carmichael, CA
cPanel Access Level
Root Administrator
Woke up this morning to intermittent outages, after upcp ran last night. Cpanel seems to have made changes to DNS/named. I should add the only sites/email that is up have backup DNS. Nothing was changed with our DNS / nameservers, except whatever cpanel's upcp did overnight.

I've run /scripts/upcp --force twice now, and each time it hangs on "checking nameservers and repairing nameserver config" forever. I kill that upcp process (to save anyone else from looking up the command...it's been years since I had to kill anything:
Code:
ps auxf | grep upcp
kill -9 [insert master process id here gotten from above command]
I understand there was a similar problem in late April. Last night's upcp email was very light, so I'm confused as to what could have changed.

After the second time I killed the upcp (when it hung on "repairing nameserver config", I attempted to rebuild named:

Code:
[email protected] [/etc]# mv /etc/named.conf /etc/named.old
[email protected] [/etc]# /scripts/rebuildnamedconf
named/
named/named.zero
named/localdomain.zone
named/named.broadcast
named/named.ip6.local
named/named.local
named/localhost.zone
Controls section not found, adding ...
Adding controls clause ...
And *again* it hung...

Which is where I'm at now. I'll add more to this as I know more, but I thought maybe if others are having the same issues (RELEASE tree, was automatic updates, but this is the LAST time I'll do that), maybe you could shed some light on what else I could check or how you fixed it.

The next thing I'll try is hand-editting named.conf, etc.

/scripts/fixeverything *also* hangs on "Repairing Nameserver Config....", but for the LIFE ON ME I don't see any changes! ??????
 
Last edited:

MaraBlue

Well-Known Member
May 3, 2005
332
2
168
Carmichael, CA
cPanel Access Level
Root Administrator
More that I've tried, nothing is fixing this

restored /etc/named.conf, httpd.conf
restarted both bind and httpd

Code:
/scripts/rebuildnamedconf
fails....and just hangs...


from last night's upcp:
Updating DNS Server...NSD is not the configured local nameserver.
Use /scripts/setupnameserver to change this setting.

Ran that, BIND restarted successfully (finally!)

Tried to add an A record for the hostname (again!) through WHM and got this:
Code:
Bind reloading on biscuit using rndc zone: [helloworldwebdesign.com] Error reloading bind on biscuit: rndc: connect failed: connection refused
Restarted BIND again from WHM
Code:
Attempting to restart named  	
Waiting for named to restart.... . . . . . . . . . . finished.

named status

named has failed, please contact the sysadmin (result was "named is not running"). Jun 29 13:55:36 biscuit named: named shutdown failed
Code:
[email protected] [/scripts]# service named start
named: already [email protected] [/scripts]# 

[email protected] [/scripts]# service named stop
Stopping named:                                            [FAILED]
[email protected] [/scripts]# service named start
named: already [email protected] [/scripts]# 

[email protected] [/scripts]# service named start
[email protected] [/scripts]# ./restartsrv_named
When I try to find if named is running or not:

Code:
[email protected] [/scripts]# ps auxf | grep named
root      2895  0.0  0.0  6628  568 pts/0    T    12:46   0:00          |           \_ more named.old
root      4579  0.0  0.3  7968 3340 pts/1    T    13:38   0:00          |           |       \_ /usr/bin/perl /scripts/restartsrv_named
root      4628  0.0  0.0  5272 1008 pts/1    T    13:41   0:00          |           \_ /bin/sh /sbin/service named status
root      4631  0.0  0.0     0    0 pts/1    Z    13:41   0:00          |           |   \_ [named] <defunct>
root      5253  0.0  0.0  6456  632 pts/1    S+   14:00   0:00          |           \_ grep named
root      3476  0.0  0.3  9652 3336 pts/2    T    13:04   0:00                          \_ /usr/bin/perl /scripts/restartsrv_named
When I attempt to restart named in WHM, I get:

Code:
named has failed, please contact the sysadmin (result was "named is not running").
When I start named from SSH, I get:

Code:
# service named start
named: already running
 
Last edited:

bradandersen

Active Member
Oct 6, 2003
42
0
156
Me too

I'm having the same problem. It looks like upcp/sysup hangs on the 'rndc status' request. Not sure what the problem is, but it is a pain.

Brad
 

bradandersen

Active Member
Oct 6, 2003
42
0
156
Resolved:

Originally Posted by thehostinghut
I had to do:
WHM:
Nameserver Setup <==== I ran this again and it started working!!!
 

eagle

Well-Known Member
Jan 17, 2003
139
0
166
Similar problem, on the release tree:

upcp hanging on

Downloading needed headers
No actions to take
...Done
Updating system packages...
Kill process and email arrives. upcp force did not help.

WHM 11.23.2 cPanel 11.23.4-R26138
CENTOS Enterprise 3.9 i686 on standard - WHM X v3.1.0
 

Snowman30

Well-Known Member
PartnerNOC
Apr 7, 2002
679
0
316
cPanel Access Level
DataCenter Provider
we have had 3 servers all running WHM 11.23.2 cPanel 11.23.3-C25461 do the same thing overnight

is there a solution to this yet?
 

Snowman30

Well-Known Member
PartnerNOC
Apr 7, 2002
679
0
316
cPanel Access Level
DataCenter Provider
ok we found a temporarly soultion

ps aux | grep named

kill -9 all the processes

then service named start

and it comes back online
 

eagle

Well-Known Member
Jan 17, 2003
139
0
166
Uhm, restarting named for upcp problem? Are you in the wrong thread ;) ?

We use other resolvers, so it can't be a non-resolving problem on the server itself (and the script should bail out I think if it was).
 

Snowman30

Well-Known Member
PartnerNOC
Apr 7, 2002
679
0
316
cPanel Access Level
DataCenter Provider
Uhm, restarting named for upcp problem? Are you in the wrong thread ;) ?

We use other resolvers, so it can't be a non-resolving problem on the server itself (and the script should bail out I think if it was).
well this has occured on 7 of our servers now and regardless of whether its a upcp or a bind issue if you have bind installed on the server regardless of whether you use it or not upcp hangs at the point where it checks Bind

give it a try
 

eagle

Well-Known Member
Jan 17, 2003
139
0
166
Alright, we will. The problem occurs intermittently, so we have to wait for it.

If it is, it will be yum checking for updates, as far as I can see.