[Zope] Need help : Zope servers hanging.

Tony McDonald tony.mcdonald@ncl.ac.uk
Mon, 31 Jan 2000 09:25:04 +0000


Hi all,
I need some help here - over the past few days two different Zope 
servers have gone into the 'hanging' state, where they don't reply to 
further requests. When the first event happened I didn't take a 
'top', today, I've managed to get one. The process causing the 
problem is 12482. From previous messages, I believe that the python 
CPU can go up to 100%, obviously this isn't happening here. This 
happened when I asked the Zope server to make a MySQL query. The 
MySQL server is running fine and I can get to it from a command line 
interface.

Both servers are Zope 2.1.2 source distributions running under Solaris 5.6

This server is running three different Zope sites using Apache as the 
backend (ie I'm using pcgi to get to my servers). I can't get to it 
using the pcgi route (ie a ReWrite Rule from Apache), nor from the 
ZServer incarnation of the server.

I also can't get to it from the monitor connection (telnet localhost 8099).

I can't let this situation continue as these are live sites. I need 
to restart the server whenever this happens.

Process list:

load averages:  1.39,  1.11,  0.63 
08:51:59
262 processes: 258 sleeping, 2 zombie, 2 on cpu
CPU states: 74.6% idle, 25.0% user,  0.4% kernel,  0.0% iowait,  0.0% swap
Memory: 512M real, 25M free, 560M swap in use, 736M swap free

   PID USERNAME THR PRI NICE  SIZE   RES STATE   TIME    CPU COMMAND
12482 nnle       9  -5    0   38M   20M cpu/0 267:57 24.94% python
23032 nnle       1  23    0 1992K 1456K cpu/2   0:00  0.32% top
  6736 nnle       8  33    0   12M 9360K sleep   3:55  0.00% roxen
15072 nnle       8  33    0   14M   11M sleep   0:59  0.00% python
  1848 nnle       7  33    0   10M 7728K sleep   0:40  0.00% python
15071 nnle       4 -25    0 4240K 1304K sleep   0:00  0.00% python
  1847 nnle       4 -25    0 4240K  856K sleep   0:00  0.00% python
   656 nnle       1 -25    0  928K  512K sleep   0:00  0.00% start
12481 nnle       4  -5    0 4240K  856K sleep   0:00  0.00% python
18305 nnle       1  23    0 2056K 1832K sleep   0:00  0.00% tcsh
19302 nnle       1  23    0 2000K 1040K sleep   0:00  0.00% tcsh
19481 nnle       1  33    0 1000K  672K sleep   0:00  0.00% grep


The only other data I have is that the pcgi for this site is shown as 
running in the process list quite a few times.
   nobody 23059  4659  0 08:54:28 ?        0:00 
/home/nnle/MED_DUR_NOTTS/pcgi/pcgi-wrapper 
/home/nnle/MED_DUR_NOTTS/Zope.cgi
   nobody 23135  4716  0 09:03:18 ?        0:00 
/home/nnle/MED_DUR_NOTTS/pcgi/pcgi-wrapper 
/home/nnle/MED_DUR_NOTTS/Zope.cgi
   nobody 23069  4574  0 08:57:03 ?        0:00 
/home/nnle/MED_DUR_NOTTS/pcgi/pcgi-wrapper 
/home/nnle/MED_DUR_NOTTS/Zope.cgi
   nobody 23068  4753  0 08:56:52 ?        0:00 
/home/nnle/MED_DUR_NOTTS/pcgi/pcgi-wrapper 
/home/nnle/MED_DUR_NOTTS/Zope.cgi
   nobody 23144  4694  0 09:04:11 ?        0:00 
/home/nnle/MED_DUR_NOTTS/pcgi/pcgi-wrapper 
/home/nnle/MED_DUR_NOTTS/Zope.cgi
     nnle 12481     1  0   Jan 21 ?        0:00 /usr/local/bin/python 
/home/nnle/MED_DUR_NOTTS/z2.py
   nobody 23064  4737  0 08:55:59 ?        0:00 
/home/nnle/MED_DUR_NOTTS/pcgi/pcgi-wrapper 
/home/nnle/MED_DUR_NOTTS/Zope.cgi
     nnle 12482 12481 25   Jan 21 ?       280:44 /usr/local/bin/python 
/home/nnle/MED_DUR_NOTTS/z2.py


*any* help at all on this would be really appreciated.
Tone

------
Dr Tony McDonald,  FMCC, Networked Learning Environments Project 
http://nle.ncl.ac.uk/
The Medical School, Newcastle University Tel: +44 191 222 5888
Fingerprint: 3450 876D FA41 B926 D3DD  F8C3 F2D0 C3B9 8B38 18A2