[asterisk-users] 1.4.20.1 hang -- extra info + gdb hangs
Steve Totaro
stotaro at totarotechnologies.com
Wed Jun 11 06:33:50 CDT 2008
On Wed, Jun 11, 2008 at 6:23 AM, Ex Vito <ex.vitorino at gmail.com> wrote:
> Here is an update,
>
> 1. Reviewed 'core show locks' with the help of russellb @ #asterisk-devs
> last friday
>
> 2. Recommended recompilling asterisk with DONT_OPTIMIZE and
> getting a stack trace with:
> # gdb /usr/sbin/asterisk $(pidof asterisk)
> (gdb) set pagination off
> (gdb) thread apply all bt
>
> We did reinstall asterisk with the new compile flags back then and just
> experienced another hang now (weekend, monday and tuesday
> were very low activity days).
>
> Unfortunatelly, gdb seems to hang on startup, after what seems to be a
> thread list. It never gets to the "reading symbols from..." steps. As such,
> no gdb prompt -> no stack trace ! :-/
>
> ps shows gdb process as <defunct> and, as such, it responds to no signals;
> asterisk seems to not respond to signals as well... (maybe that's why gdb
> hangs... I really do not know how gdb works in regards to attaching itself
> to a running process)
>
> Again we have a 'core show locks' + 'core show threads' output from asterisk
> which we have no skills to read...
>
> Lastly, asterisk log displays 12x...
>
> [Jun 11 09:41:07] ERROR[4837] chan_sip.c: SIP transaction failed:
> 588233f5261d52ac621587ca327b5083 at 192.168.161.40
> [Jun 11 09:41:07] ERROR[4837] chan_sip.c: We could NOT get the channel
> lock for SIP/000e08de4cbe-097555c8!
>
> ...then...
>
> [Jun 11 09:41:19] WARNING[4837] chan_sip.c: Maximum retries exceeded
> on transmission 588233f5261d52ac621587ca327b5083 at 192.168.161.40 for
> seqno 102 (Critical Request)
>
> ...and finally about 1200 of these:
>
> [Jun 11 09:42:59] WARNING[4842] chan_iax2.c: Max retries exceeded to
> host 192.168.166.40 on IAX2/private-13779 (type = 6, subclass = 11,
> ts=40022, seqno=10)
>
> ...with several "combinations" of:
> - the number inside WARNING[xxx] -> 13 different
> - the host IP: 192.168.166.40 and 192.168.170.40
> - the iax channel -> 12 different
>
>
> Till today, our gut feelings were:
>
> 1. The TC400B installation / usage change
> (idea: asterisk responds to no signals because it is waiting in
> kernel space,
> maybe something's wrong with zaptel, wctc4xxp, our HW ?)
>
> 2. The activation of a voicemail account with MWI
>
> We now have an extra possibility:
>
> - This system exchanges IAX calls with several other systems
> - The hanging one is running asterisk 1.4.20.1, but all the others
> are running 1.4.19
> - The changelog from 1.4.19 -> 1.4.20.1 includes several chan_iax
> fixes --> could the absense of such fixes in this system's iax peers
> be leading it to hang ?
>
> Possibility:
> 3. Upgrade all peers to 1.4.20.1
>
>
> Again, if anyone can chime in with their contribution, thanks in advance.
>
> Question of the day: why on earth does gdb hang ?! (our guess: because
> asterisk does not respond to signals... now why ?!)
>
>
> Cheers,
> --
> exvito
>
Try switching from IAX to SIP.
Thanks,
Steve T
More information about the asterisk-users
mailing list