[asterisk-dev] possible zaptel problem with SMP and RAID1
François Delawarde
fdelawarde at wirelessmundi.com
Mon Jul 9 06:53:05 CDT 2007
Hi again,
Tzafrir Cohen wrote:
> Hi
>
> On Mon, Jul 09, 2007 at 12:29:30PM +0200, François Delawarde wrote:
>
>> Hello,
>>
>> I thought this mail would be more appropriate in this mailing list, if
>> not sorry about it.
>>
>> I've been having interrupt problems since I'm trying to use analog
>> zaptel hardware (mainly openvox A400 and OPVXA1200) on two dual core
>> machines (AMD64 X2, different motherboards and network cards) with
>> software RAID1 in two SATA drives. These problems didn't occur on my
>> previous setups without any RAID.
>>
>
> What version of Zaptel do you use?
>
> Is it patched in any way?
>
> OPVXA1200 uses its own driver, originally based on wctdm.
>
Zaptel 1.4.3 with 1 line hookstate patch from bug 0008290 (adapted from
1.2.10 to 1.4.3)
I also tried non-patched Zaptels from 1.2 and 1.4 series.
>> The problem appears to happen randomly, a few times per minute (or
>> sometimes per 5 minutes), zttest utility drops to 60-90%, saying that I
>> had too many interrupts (showing lines like "8192 samples in 7212 sample
>> interval"). Along with that come an audible "bip" and some rare times a
>> small cut in conversation, or a small bit of echo during a very short
>> time. I'll add that a higher disk load (running dbench) appears to
>> increase a little the frequence of those problems (but not totally sure).
>>
>> zttool show no missed interrupts with watchdog option enable before
>> compilation. No shared interrupts. No IDE drives (related to possible
>> DMA problem). No frame buffer, console only server. Tried with all
>> PREEMPT kernel options, all HZ options, with and without IRQ balance,
>> trying SMP afinity to switch interrupts to another core, all without
>> result, except for PREEMPT options that makes zttest constantly report
>> 99.975586% instead of 100% when there are no problems.
>>
>> I'm no kernel expert, but since the only pattern I found in all tests
>> seemed to be related to RAID, I was wondering if spinlocks disabling
>> interrupts like RAID drivers seem to be doing in SMP configuration would
>> be the cause of delaying of zaptel interrupts, leading to the kind of
>> problems I have. Any idea on that?
>>
>
> First off, better preemption should generally help you. You need timely
> response (be that at the price of some throughput performance).
>
That's what I originally thought and thus tried those options to see if
it could resolve my problem. Right now, running on 2.6.21.6 with "Low
latency Desktop", and HZ=1000, without success.
Any idea?
>
>> For info, the problem occurred on these combinations of setups:
>> - OS: Debian etch (tested on sarge)
>> - Processors: two different AMD64 X2, one of each is in AM2 socket.
>> - Partitions: ext3 on RAID1 (tested with ext3 on LVM on RAID1 and ext3
>> on Encrypted LVM on RAID1)
>> - Custom kernel 2.6.21.6 with IMQ and Layer 7 (tested with 2.6.18 and
>> with/without these two patches, also tested with XEN kernel with
>> horrible, but expected results).
>> - zaptel 1.4.3 (tried 1.2 series, and 1.4 since 1.4.1).
>> - a few services: DNS, DHCP, Samba, PHP/MySQL interface, astmanproxy
>> (tested without any).
>>
>> Worked well on:
>> - OS: Debian etch
>> - AMD64 Sempron
>> - Kernel 2.6.18 with IMQ and Layer 7
>> - No RAID
>> - zaptel 1.4 series
>> - same services as above
>>
More information about the asterisk-dev
mailing list