[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

wave of crashs with 2.10



Hi,

today we had a nice wave of crash in France (maybe more),
our 3 2.10 servers crashed (two hubs and one leaf) with less than one
minute of difference.


irc-test.club-internet.fr  -  hub.grolier.net  - irc.enst.fr - ( eu34 ) 
  
irc-test is a solaris 2.5.1 sparc, hub.grolier.net is solaris 2.6 x86 
and Samuel can give you enst configuration (ipc if i recall well)

My two servers were running with -g compil mode then i have two nice
core ...

on irc-test:

GDB 4.16 (sparc-sun-solaris2.5.1), Copyright 1996 Free Software
Foundation, Inc...
Core was generated by `/opt/irc-kali/irc/ircnet/ircd -c
/opt/irc-kali/irc/ircnet/ircd.conf -d /opt/irc'.
Program terminated with signal 11, Segmentation fault.
Reading symbols from /usr/lib/libz.so.1.0.4...done.
Reading symbols from /usr/lib/libm.so.1...done.
Reading symbols from /usr/lib/libnsl.so.1...done.
Reading symbols from /usr/lib/libsocket.so.1...done.
Reading symbols from /usr/lib/libc.so.1...done.
Reading symbols from /usr/lib/libdl.so.1...done.
Reading symbols from /usr/lib/libintl.so.1...done.
Reading symbols from /usr/lib/libmp.so.1...done.
Reading symbols from /usr/lib/libw.so.1...done.
Reading symbols from
/usr/platform/SUNW,Ultra-Enterprise/lib/libc_psr.so.1...done.
#0  0x2f5e4 in close_chid (id=0x158 "") at ../ircd/s_id.c:114
114         if (timeofday - last > 900 || id[0] == current)
(gdb) bt
#0  0x2f5e4 in close_chid (id=0x158 "") at ../ircd/s_id.c:114
#1  0x2f754 in collect_chid () at ../ircd/s_id.c:167
#2  0x1f450 in collect_channel_garbage (now=907843938) at
../ircd/channel.c:3122
#3  0x22308 in io_loop (delay=361472) at ../ircd/ircd.c:1029
#4  0x222a0 in main (argc=0, argv=0xeffffc74) at ../ircd/ircd.c:1002
(gdb) up
#1  0x2f754 in collect_chid () at ../ircd/s_id.c:167
167                 if (close_chid((*chptr)->chname) == 0)
(gdb) up 
#2  0x1f450 in collect_channel_garbage (now=907843938) at
../ircd/channel.c:3122
3122            collect_chid();
(gdb) up
#3  0x22308 in io_loop (delay=361472) at ../ircd/ircd.c:1029
1029                    nextgarbage = collect_channel_garbage(timeofday);
(gdb) up
#4  0x222a0 in main (argc=0, argv=0xeffffc74) at ../ircd/ircd.c:1002
1002                    delay = io_loop(delay);
(gdb) list
997     #ifdef USE_SYSLOG
998             syslog(LOG_NOTICE, "Server Ready");
999     #endif
1000            timeofday = time(NULL);
1001            while (1)
1002                    delay = io_loop(delay);
1003    }
1004    
1005    
1006    time_t  io_loop(delay)

on hub.grolier.net:
Program terminated with signal 11, Segmentation Fault.
Reading symbols from /usr/lib/libm.so.1...done.
Reading symbols from /usr/lib/libnsl.so.1...done.
Reading symbols from /usr/lib/libsocket.so.1...done.
Reading symbols from /usr/lib/libc.so.1...done.
Reading symbols from /usr/lib/libdl.so.1...done.
Reading symbols from /usr/lib/libmp.so.2...done.
#0  collect_channel_garbage (now=907842925) at ../ircd/channel.c:3126
3126                    if (chptr->users == 0)
(gdb) bt
#0  collect_channel_garbage (now=907842925) at ../ircd/channel.c:3126
#1  0x8059ed8 in io_loop (delay=134955828) at ../ircd/ircd.c:1029
#2  0x8059e98 in main (argc=0, argv=0x8047c20) at ../ircd/ircd.c:1002
(gdb) up
Initial frame selected; you cannot go up.
(gdb) list
997     #ifdef USE_SYSLOG
998             syslog(LOG_NOTICE, "Server Ready");
999     #endif
1000            timeofday = time(NULL);
1001            while (1)
1002                    delay = io_loop(delay);
1003    }
1004    
1005    
1006    time_t  io_loop(delay)

I hope that can help, if you need special operation on cores i can do them
(i m not expert of gdb ;))


Gael
----------------------------------------------------------------------- 
Gael MARTINEZ                                Mailto:mgc@xxxxxxxxxxxxxxx
Grolier Interactive Europe OG / Club-Internet 
NOC - Responsable Exploitation Centre Serveur

Restez Zen .... UniX peut le faire.