[Zope] zope timeouts

Michele Marcionelli michele.marcionelli at math.ethz.ch
Tue Jul 20 11:00:17 EDT 2010


Ciao Jens

thank you for your answer. Now since I'm new with "gdb" I have some questions.
I did following. I created a file called "gdb_batch" with following content:

    info threads
    thread 1
    bt
    thread 2
    bt
    thread 3
    bt
    thread 4
    bt
    thread 5
    bt

and run the "gdb" command like this

    gdb python $(cat Z2.pid) --batch -x gdb_batch  > gdb_bt.log 2>&1

Can you maybe help me to "decode" the output (see attachment)?

For instance: what does this mean?

(gdb) info threads
  5 Thread 0xb712bb90 (LWP 7532)  0x005fd410 in __kernel_vsyscall ()
  4 Thread 0xb672ab90 (LWP 7533)  lookdict_string (mp=0xb78c4714, key=0xb7c19db8, hash=1293002269) at Objects/dictobject.c:331
  3 Thread 0xb5d29b90 (LWP 7534)  0x005fd410 in __kernel_vsyscall ()
  2 Thread 0xb5328b90 (LWP 7535)  0x005fd410 in __kernel_vsyscall ()
* 1 Thread 0xb7fc16c0 (LWP 7174)  0x005fd410 in __kernel_vsyscall ()

What does mean the "*"? And what is the "difference" between the threads 1,2,3,5 (with the __kernel_vsyscall) and the 4?

What should I look for?

Thank you,
Michele

-------------- next part --------------
A non-text attachment was scrubbed...
Name: gdb_bt.log
Type: application/octet-stream
Size: 56064 bytes
Desc: not available
Url : http://mail.zope.org/pipermail/zope/attachments/20100720/ae794869/attachment.obj 
-------------- next part --------------



On Jul 20, 2010, at 14:54 , Jens Vagelpohl wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Hi Michele,
> 
> I would consider the coincience with strace "reviving" the process a red
> herring. When strace runs it does not influence the process you are
> watching with it, except for telling you what OS-level calls are made.
> 
> The most common reason for such timeouts are problems with software that
> tries to use other network resources during the running request to talk
> to services like RDBMS systems, LDAP, etc. If these background
> connections hang for some reason you'll end up with a hanging Zope
> thread. Enough of them and Zope itself appears to hang.
> 
> When your process hangs you should be able to attach to it using "gdb"
> and printing a stack trace for each thread. That may give you a big clue
> where the problem is.
> 
> jens
> 
> 
> On 7/20/10 13:53 , Michele Marcionelli wrote:
>> Hello
>> 
>> I'm running an old version of Zope which is required by our corporate design using also an older version of the CMS Silva:
>> 
>>    Linux Distribution: Red Hat Enterprise Linux Server release 5.5
>>    Linux Kernel: 2.6.18-194.8.1.el5PAE
>>    Zope: 2.8.9.1
>>    Python: 2.3.6 (with several modules, like: ldap, pil, mysql, pyxml, ...)
>>    Silva: 1.5.13
>> 
>> My problem ist that my zope server hat timeouts on displaying pages after a while that it has been started (currently this "while" is approx. 15 minutes). The only thing that helps is restarting zope.
>> 
>> By accident I discovered about two month ago, that running an "strace -f" on the zope process, "reactivated" it... so I could live with this hack until yesterday.
>> 
>> Now I'm running "strace" permanently (with all the disadvantages) and zope works "fine" for about 1 to 1.5 hours.
>> 
>> Here is a short recurring output of "strace":
>> 
>>> [pid 13408] <... futex resumed> )       = 0
>>> [pid 13406] futex(0xbcc12c0, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished ...>
>>> [pid 13405] futex(0xbcc12c0, FUTEX_WAKE_PRIVATE, 1 <unfinished ...>
>>> [pid 13408] futex(0xbcc12c0, FUTEX_WAKE_PRIVATE, 1 <unfinished ...>
>>> [pid 13407] <... futex resumed> )       = 0
>>> [pid 13405] <... futex resumed> )       = 1
>>> [pid 13408] <... futex resumed> )       = 1
>>> [pid 13405] futex(0xbcc12c0, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished ...>
>>> [pid 13408] futex(0xbcc12c0, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished ...>
>>> [pid 13406] <... futex resumed> )       = 0
>>> [pid 13406] futex(0xbcc12c0, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished ...>
>>> [pid 13407] futex(0xbcc12c0, FUTEX_WAKE_PRIVATE, 1) = 1
>>> [pid 13405] <... futex resumed> )       = 0
>>> [pid 13407] futex(0xbcc12c0, FUTEX_WAKE_PRIVATE, 1 <unfinished ...>
>>> [pid 13405] futex(0xbcc12c0, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished ...>
>>> [pid 13408] <... futex resumed> )       = 0
>>> [pid 13407] <... futex resumed> )       = 1
>>> [pid 13407] futex(0xbcc12c0, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished ...>
>>> [pid 13405] <... futex resumed> )       = -1 EAGAIN (Resource temporarily unavailable)
>> 
>> 
>> I need definitively help... Any idea?
>> 
>> Thx,
>> Michele


-- 
ETH Z?rich
Michele Marcionelli
IT Support Gruppe D-MATH
HG G 32.1
R?mistrasse 101
CH?8092 Z?rich

michele.marcionelli at math.ethz.ch
http://www.math.ethz.ch/~michele

+41 44 632 6193 Telefon
+41 44 632 1354 Fax



More information about the Zope mailing list