[Asterisk-Users] NMI issues...
Michael Loftis
mloftis at wgops.com
Thu Jan 20 18:05:42 MST 2005
I dont' think it's the digium hardware fault. Why? I've got probably 8-10
machines like this and they all get occasional NMIs with the 'uhhuh'
message. It's defiently kernel related...hell these boards don't work
under anything older than 2.4.27.
--On Tuesday, December 21, 2004 00:42 -0600 "I put the Who? in Mishehu"
<mishehu at shavedgoats.net> wrote:
> I have read thru what other users have tried in this list when they have
> experienced seemingly similar issues to what I have, without success. I
> suspect there might be an issue regarding both the X100P and TDM04b cards
> being used in an Intel SE7525GP2 motherboard, as I had to even wait for a
> BIOS update from Intel in order to utilize my 3ware 9500 SATA raid
> controller on it (3ware Kb -->
> http://www.3ware.com/kb/article.aspx?id=12435 ). The following is a list
> of the configuration and the issues I have encountered. Any help would be
> greatly appreciated.
>
> System A:
> Intel SE7525GP2 motherboard. Has BIOS revision 06, BMC 2.40 and FRUSDR
> 1.40 2x2.8GHz Xeon processors w/800Mhz FSB, actively cooled retail box set
> 1GB of ECC registered DDR333 RAM, 2 DIMMs
> 3ware 9500 SATA Raid, controlling a 3-disk raid5 array
> 550 Watt 24-pin ATX power supply
>
> System B:
> Asus A7N8X-E Deluxe
> AMD Athlon XP 3200 (barton core)
> 512 MB RAM
> ATI 9600SE OEM card
> 200GB SATA drive
>
> Process is as follows:
> 1. On system A until further notice: The Intel SE7525GP2 conflicted with
> the firmware on the 3ware 9500 SATA RAID controller until BIOS 06 was
> released.
> 2. After having been flashed, Slackware 10.0 was then installed on it.
> It currently runs stock kernel 2.6.9, udev 042, and the most recent
> hotplug from 2004-09-23.
> 3. Installed a digium-supplied X101P into the machine into a 32bit PCI
> slot. 4. Downloaded zaptel 1.0.3, compiled against kernel 2.6.9. Upon
> modprobe wcfxo, NMI's are immediately reported (syslog to follow). The
> instructions from README.udev were followed and parameters inserted into
> the rules file for udev. Problem occurs regardless of whether or not
> hotplug inserts the modules or the are manually inserted by me. Repeated
> modprobe -r wcfxo; modprobe wcfxo produces no different result as well. 5.
> Attempted all combinations of moving the card to other compatible slots,
> CVS zaptel from 2004-12-17, and the TDM04b card. Snippet of syslog as
> follows:
>
> Dec 19 15:33:17 nisui kernel: Freshmaker version: 71
> Dec 19 15:33:17 nisui kernel: Freshmaker passed register test
> Dec 19 15:33:18 nisui kernel: Module 0: Installed -- AUTO FXO (FCC mode)
> Dec 19 15:33:18 nisui kernel: Uhhuh. NMI received for unknown reason 31 on
> CPU 0.
> Dec 19 15:33:18 nisui kernel: Dazed and confused, but trying to continue
> Dec 19 15:33:18 nisui kernel: Do you have a strange power saving mode
> enabled?
> Dec 19 15:33:18 nisui kernel: Module 1: Installed -- AUTO FXO (FCC mode)
> Dec 19 15:33:18 nisui kernel: Module 2: Installed -- AUTO FXO (FCC mode)
> Dec 19 15:33:19 nisui kernel: Module 3: Installed -- AUTO FXO (FCC mode)
> Dec 19 15:33:19 nisui kernel: Found a Wildcard TDM: Wildcard TDM400P REV
> E/F (4 modules)
> Dec 19 15:33:19 nisui kernel: Uhhuh. NMI received for unknown reason 31 on
> CPU 0.
> Dec 19 15:33:19 nisui kernel: Dazed and confused, but trying to continue
> Dec 19 15:33:19 nisui kernel: Do you have a strange power saving mode
> enabled?
> Dec 19 15:33:19 nisui kernel: Uhhuh. NMI received for unknown reason 21 on
> CPU 0.
>
> The NMI's will continue to report reasons 21 & 31 until the module is
> removed from the kernel.
>
> 6. Attempted several kernel reconfigurations, including ones that
> stripped out USB support and all SMP support. Even on this machine with a
> uniprocessor kernel, NMI was still generated.
> 7. Took the X100P and installed it in System B. System B only reports:
> Notice: Configuration file is /etc/zaptel.conf
> line 143: Unable to open master device '/dev/zap/ctl'
> Whenever I check, /dev/zap/ctl does exist. No NMI is generated on this
> machine, though I do not know if it is capible of doing so. Otherwise, it
> appears to operate normally.
> 8. Tried asterisklivecd that is maintained by somebody in Italy.
> Actually got a freshmaker error on that trial, though it appeared to be
> somewhat outdated.
>
> Just to anticipate questions, whenever the TDM04b was tested, the power
> connector was in fact connected. An Intel Etherexpress Pro 100 ethernet
> card was tested on Machine A to verify that there was no problem with the
> slots. The eepro100 initializes and operates perfectly. Attached to this
> email is a copy of the kernel's configuration, and a list of currently
> loaded modules is as follows:
>
> Module Size Used by
> md5 4992 1
> ipv6 260352 16
> hw_random 6548 0
> pciehp 97540 0
> shpchp 101124 0
> pci_hotplug 13060 2 pciehp,shpchp
> e1000 86788 0
> evdev 10368 0
>
> And output from /proc/interrupts immediately after loading the wctdm
> module: CPU0 CPU1 CPU2 CPU3
> 0: 3791068 10 10 10 IO-APIC-edge timer 1:
> 7 1 0 0 IO-APIC-edge i8042 9:
> 0 0 0 0 IO-APIC-level acpi
> 12: 57 0 0 1 IO-APIC-edge i8042 14:
> 0 1 0 0 IO-APIC-edge ide0 169:
> 4222 0 1 0 IO-APIC-level eth0 185:
> 0 0 0 0 IO-APIC-level ehci_hcd 209:
> 22767 0 1 0 IO-APIC-level 3w-9xxx 217:
> 1063 0 0 1 IO-APIC-level wctdm NMI:
> 63 0 0 0
> LOC: 3791179 3791182 3791180 3791179
> ERR: 0
> MIS: 0
>
> Thanks in advance,
>
> mishehu
--
GPG/PGP --> 0xE736BD7E 5144 6A2D 977A 6651 DFBE 1462 E351 88B9 E736 BD7E
More information about the asterisk-users
mailing list