[asterisk-users] Asterisk 1.4 reliability problems
Steve Totaro
stotaro at totarotechnologies.com
Tue Mar 18 06:40:10 CDT 2008
On Tue, Mar 18, 2008 at 5:40 AM, Ben Willcox
<ben.willcox at british-gymnastics.org> wrote:
> Hello All,
>
> We have been experiencing some ongoing reliability problems with
> Asterisk for quite some time, and I am trying to find out if anyone else
> has experienced the same problems.
>
> We are running asterisk 1.4.17~dfsg-2+b1 on Debian Lenny, with a Digium
> PRI card, and have approximately 120 sip peers, mostly Snom 360s, with a
> few Grandstream GXP2000 and a handful of Handytone 486 units.
>
> The symptoms, when they occur, are as follows:
>
> -The inability to receive incoming calls to our ISDN PRI (callers get a
> busy tone), this starts off becoming intermittent but becomes permanent.
>
> -Asterisk cli commands work once, but then no longer return any data
> until disconnecting and reconnecting to the cli, i.e. sip show peers,
> show channels etc.
>
> -Internal SIP calls stop working
>
> -Calls remain stuck in queues, the queue members do not ring, and show
> as Busy when issuing a 'queue show' command.
>
>
> We've actually had these sort of problems for many months now, which
> originally started when we were running Asterisk 1.2 on Gentoo. We have
> done a large amount of fault finding and testing, which has involved a
> replacement ISDN card, reinstall on complete different server hardware,
> and changing to Asterisk 1.4 on Debian Lenny.
>
> I believe there may be two separate issues here - we did track down one
> problem to our cacti and nagios monitoring scripts, which were
> connecting and disconnecting to the manager interface several times per
> minute, which eventually caused asterisk to give the above symptoms,
> although in addition to the above, asterisk would consume 100% cpu on
> the box, and eventually need a hard-reboot of the server. I posted about
> this to the list a few weeks ago, and it was confirmed that this could
> cause such a problem. After stopping these services the problems were
> much reduced.
>
> However, we have now completely disabled the manager interface
> (enabled=no in manager.conf), and yesterday the problem occurred again -
> a restart of asterisk got everything going again.
> So really I'm at a loss as to where to go from here. A colleague of mine
> also has the same problem at his site running Asterisk 1.4 on Debian
> Lenny, he has never used the manager interface, and has completely
> different server hardware and ISDN card, so I wonder if it's a Debian
> specific problem?
>
> One option is to try reverting back to Asterisk 1.2, but that isn't
> really a long-term solution. We also had major problems with 1.2 with
> our Snom 360 phones, as with any Snom firmware > 6.2.2 there was a
> serious problem whereby on hangup the channels were not cleared down,
> meaning we had many outgoing ISDN calls held open for many hours until
> we realised the problem. This problem does not occur in Asterisk 1.4,
> although we have many log messages such as:
>
> chan_sip.c: Remote host can't match request BYE to call <callid>
>
> so I don't know if this is anything to worry about?
>
> Any help would be gratefully received!
>
> Thanks,
> Ben
I have seen this when banging on the AMI but you eliminated that.
Why not try a different OS such as CentOS for now? That would be my next step.
Thanks,
Steve Totaro
More information about the asterisk-users
mailing list