[asterisk-bugs] [JIRA] (ASTERISK-25911) chan_iax2: IAX Max Retries - hung IAX channels in Ring state - cannot clear channels until Asterisk restart
Richard Mudgett (JIRA)
noreply at issues.asterisk.org
Fri Apr 21 09:33:58 CDT 2017
[ https://issues.asterisk.org/jira/browse/ASTERISK-25911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=236627#comment-236627 ]
Richard Mudgett edited comment on ASTERISK-25911 at 4/21/17 9:33 AM:
---------------------------------------------------------------------
Okay. So we are still seeing this issue. Today we got it again:
{noformat}
[2017-04-21 11:52:41] WARNING[3509] chan_iax2.c: Max retries exceeded to host 185.161.127.73 on IAX2/osl1-voip-cluster01-upstream04-9829 (type = 6, subclass = 11, ts=387427, seqno=104)
[2017-04-21 11:52:41] WARNING[3504] chan_iax2.c: Max retries exceeded to host 185.161.127.72 on IAX2/osl1-voip-cluster01-upstream03-15532 (type = 6, subclass = 11, ts=180441, seqno=49)
[2017-04-21 11:52:41] WARNING[3500] chan_iax2.c: Max retries exceeded to host 185.161.127.73 on IAX2/osl1-voip-cluster01-upstream04-16195 (type = 6, subclass = 11, ts=1246488, seqno=71)
[2017-04-21 11:52:41] WARNING[29600] chan_iax2.c: Max retries exceeded to host 185.161.127.72 on IAX2/osl1-voip-cluster01-upstream03-7394 (type = 6, subclass = 2, ts=1959345, seqno=49)
[2017-04-21 11:52:41] WARNING[29358] chan_iax2.c: Max retries exceeded to host 185.161.127.72 on IAX2/osl1-voip-cluster01-upstream03-2294 (type = 6, subclass = 11, ts=230881, seqno=34)
[2017-04-21 11:52:41] WARNING[28199] chan_iax2.c: Max retries exceeded to host 185.161.127.73 on IAX2/osl1-voip-cluster01-upstream04-16195 (type = 6, subclass = 2, ts=1247489, seqno=72)
[2017-04-21 11:52:41] WARNING[26616] chan_iax2.c: Max retries exceeded to host 185.161.127.72 on IAX2/osl1-voip-cluster01-upstream03-7394 (type = 6, subclass = 11, ts=1958345, seqno=48)
{noformat}
What could cause this to happen suddenly? All servers are connected with 1 switch. Network is so stupid simple and just "works".
When it happens, the asterisk server becomes unstable and still needs a restart.
Is it really such a bad idea to interconnect Asterisk with IAX ?
was (Author: woopstar):
Okay. So we are still seeing this issue. Today we got it again:
[2017-04-21 11:52:41] WARNING[3509] chan_iax2.c: Max retries exceeded to host 185.161.127.73 on IAX2/osl1-voip-cluster01-upstream04-9829 (type = 6, subclass = 11, ts=387427, seqno=104)
[2017-04-21 11:52:41] WARNING[3504] chan_iax2.c: Max retries exceeded to host 185.161.127.72 on IAX2/osl1-voip-cluster01-upstream03-15532 (type = 6, subclass = 11, ts=180441, seqno=49)
[2017-04-21 11:52:41] WARNING[3500] chan_iax2.c: Max retries exceeded to host 185.161.127.73 on IAX2/osl1-voip-cluster01-upstream04-16195 (type = 6, subclass = 11, ts=1246488, seqno=71)
[2017-04-21 11:52:41] WARNING[29600] chan_iax2.c: Max retries exceeded to host 185.161.127.72 on IAX2/osl1-voip-cluster01-upstream03-7394 (type = 6, subclass = 2, ts=1959345, seqno=49)
[2017-04-21 11:52:41] WARNING[29358] chan_iax2.c: Max retries exceeded to host 185.161.127.72 on IAX2/osl1-voip-cluster01-upstream03-2294 (type = 6, subclass = 11, ts=230881, seqno=34)
[2017-04-21 11:52:41] WARNING[28199] chan_iax2.c: Max retries exceeded to host 185.161.127.73 on IAX2/osl1-voip-cluster01-upstream04-16195 (type = 6, subclass = 2, ts=1247489, seqno=72)
[2017-04-21 11:52:41] WARNING[26616] chan_iax2.c: Max retries exceeded to host 185.161.127.72 on IAX2/osl1-voip-cluster01-upstream03-7394 (type = 6, subclass = 11, ts=1958345, seqno=48)
What could cause this to happen suddenly? All servers are connected with 1 switch. Network is so stupid simple and just "works".
When it happens, the asterisk server becomes unstable and still needs a restart.
Is it really such a bad idea to interconnect Asterisk with IAX ?
> chan_iax2: IAX Max Retries - hung IAX channels in Ring state - cannot clear channels until Asterisk restart
> -----------------------------------------------------------------------------------------------------------
>
> Key: ASTERISK-25911
> URL: https://issues.asterisk.org/jira/browse/ASTERISK-25911
> Project: Asterisk
> Issue Type: Bug
> Security Level: None
> Components: Channels/chan_iax2
> Affects Versions: 13.7.2, 13.9.0, 13.9.1
> Environment: Ubuntu server
> Reporter: Andreas Krüger
> Assignee: Unassigned
> Attachments: 01-08-2016-backtrace-threads.txt, 01-08-2016-core-show-channels-infos.txt, 01-08-2016-core-show-channels.txt, 01-08-2016-core-show-locks.txt, 01-08-2016-iax2-show-channels.txt, 01-08-2016-iax2-show-netstats.txt, 2016-06-15-backtrace-threads.txt, 2016-06-15-core-show-channels-infos.txt, 2016-06-15-core-show-channels.txt, 2016-06-15-core-show-locks.txt, 2016-06-15-iax2-show-channels.txt, 2016-06-15-iax2-show-netstats.txt, 2016-06-16-backtrace-threads.txt, 2016-06-16-core-show-channels-infos.txt, 2016-06-16-core-show-channels.txt, 2016-06-16-core-show-locks.txt, 2016-06-16-iax2-show-channels.txt, 2016-06-16-iax2-show-netstats.txt, 2016-06-17-backtrace-threads.txt, 2016-06-17-core-show-channels-infos.txt, 2016-06-17-core-show-channels.txt, 2016-06-17-core-show-locks.txt, 2016-06-17-iax2-show-channels.txt, 2016-06-17-iax2-show-netstats.txt, 2016-08-26-backtrace-threads.txt, 2016-08-26-core-show-channels-infos.txt, 2016-08-26-core-show-channels.txt, 2016-08-26-core-show-locks.txt, 2016-08-26-iax2-show-channels.txt, 2016-08-26-iax2-show-netstats.txt, 23-08-2016-backtrace-threads.txt, 23-08-2016-cli-output-full.txt, 23-08-2016-core-show-channels-infos.txt, 23-08-2016-core-show-channels.txt, 23-08-2016-core-show-locks.txt, 23-08-2016-iax2-show-channels.txt, 23-08-2016-iax2-show-netstats.txt, backtrace-threads.txt, core-show-channels-infos.txt, core-show-channels.txt, debug_log_25911_odn1-voip-cluster02-asterisk01, debug_log_25911_odn1-voip-cluster02-upstream01, iax2-show-channels.txt, iax2-show-netstats.txt, iax.conf, upload (1).png
>
>
> Hi there,
> We ran into a problem, when there is some, but not high, load on some of our asterisk servers, we suddenly see an IAX max retries error in the console.
> When this happens, everything stops to work and we cannot get asterisk to work again unless we restart the service (not the server).
> I tried to start asterisk trough GDB, but since asterisk never crashes, there is nothing to show in gdb about the problem.
> I've also sat up a monitoring tool to check for network glitches and neither this has happened.
> I've also tried to increase the max retries in chan_iax2.c and recompile asterisk, as I've read on some forums that it should resolve the issue, but this is neither the case.
> {code}
> sed -i "s/static int max_retries = 4;/static int max_retries = 12;/" channels/chan_iax2.c
> {code}
> I've attached the output from the console we see. This messages just keeps popping up and seems not to end. This could for me look like theres some cleanup not working in chan_iax2.c when the max retries happens. The error we're facing happens on this line:
> https://github.com/asterisk/asterisk/blob/13.7/channels/chan_iax2.c#L3572
> I could use some advice to debug this problem further and resolve it, because when this error happens, Asterisk does not work at all until it's get restarted.
> The problem is not persistent and I have a hard time to reproduce it. But we see it when the load increases. Doing 10k calls within 7 hour seems to make it happen.
> I looked into the code, and see that it uses a reference to a callno, which for me looks like a counter that increases ? - Could we maybe see some sort of race condition or maybe the callno runs out of scope?
--
This message was sent by Atlassian JIRA
(v6.2#6252)
More information about the asterisk-bugs
mailing list