Redhat 7.2 kernel-smp-2.4.9-7 failure on 2550

Woodcock, Steve SWoodcock at scholastic.co.uk
Mon Nov 5 07:42:00 CST 2001


Our 2550 fails to boot with latest smp kernel errata
(kernel-smp-2.4.9-7). It says:

  AAC: NMI ISR: NMI_DMA_0_ERROR

and hangs. This is just after the "Press 'I' to enter interactive
startup" message.

The non-smp kernel (kernel-2.4.9-7) boots fine.

Not sure how to proceed. I tried comparing the dmesg output between
the 2.4.7-10smp kernel (this is what we were using before the upgrade)
and the 2.4.9-7 kernel which boots, and there are some suspicious
messages about shared and conflicting IRQs... could this be the
problem? 

Apart from that, the aacraid driver has changed from the "Sep 6 2001"
version to "Oct 18 2001".

Any clues would be appreciated.

Regards, Steve Woodcock

Here's the dmesg output for the 2.4.9-7 non-smp kernel:

Linux version 2.4.9-7 (bhcompile at stripples.devel.redhat.com) (gcc version 2.96 20000731 (Red Hat Linux 7.1 2.96-98)) #1 Thu Oct 18 13:47:25 EDT 2001
BIOS-provided physical RAM map:
 BIOS-e820: 0000000000000000 - 00000000000a0000 (usable)
 BIOS-e820: 0000000000100000 - 000000003fff0000 (usable)
 BIOS-e820: 000000003fff0000 - 000000003fffec00 (ACPI data)
 BIOS-e820: 000000003fffec00 - 000000003ffff000 (reserved)
 BIOS-e820: 00000000fec00000 - 00000000fec10000 (reserved)
 BIOS-e820: 00000000fee00000 - 00000000fee10000 (reserved)
 BIOS-e820: 00000000fff80000 - 0000000100000000 (reserved)
Scanning bios EBDA for MXT signature
127MB HIGHMEM available.
On node 0 totalpages: 262128
zone(0): 4096 pages.
zone(1): 225280 pages.
zone(2): 32752 pages.
Kernel command line: ro root=/dev/sda2
Initializing CPU#0
Detected 997.531 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 1992.29 BogoMIPS
Memory: 1026696k/1048512k available (1721k kernel code, 20412k reserved, 91k data, 224k init, 131008k highmem)
Dentry-cache hash table entries: 131072 (order: 8, 1048576 bytes)
Inode-cache hash table entries: 65536 (order: 7, 524288 bytes)
Mount-cache hash table entries: 16384 (order: 5, 131072 bytes)
Buffer-cache hash table entries: 65536 (order: 6, 262144 bytes)
Page-cache hash table entries: 262144 (order: 9, 2097152 bytes)
CPU: Before vendor init, caps: 0383fbff 00000000 00000000, vendor = 0
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU: After vendor init, caps: 0383fbff 00000000 00000000 00000000
CPU:     After generic, caps: 0383fbff 00000000 00000000 00000000
CPU:             Common caps: 0383fbff 00000000 00000000 00000000
CPU: Intel Pentium III (Coppermine) stepping 0a
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.40 (20010327) Richard Gooch (rgooch at atnf.csiro.au)
mtrr: detected mtrr type: Intel
PCI: PCI BIOS revision 2.10 entry at 0xfc73e, last bus=3
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: Discovered primary peer bus 01 [IRQ]
Unknown bridge resource 2: assuming transparent
PCI: Discovered primary peer bus 02 [IRQ]
PCI: Using IRQ router ServerWorks [1166/0200] at 00:0f.0
isapnp: Scanning for PnP cards...
isapnp: No Plug & Play device found
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Initializing RT netlink socket
apm: BIOS not found.
mxt_scan_bios: enter
Starting kswapd v1.8
allocated 64 pages and 64 bhs reserved for the highmem bounces
VFS: Diskquotas version dquot_6.5.0 initialized
pty: 2048 Unix98 ptys configured
Serial driver version 5.05c (2001-07-08) with MANY_PORTS MULTIPORT SHARE_IRQ SERIAL_PCI ISAPNP enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
Real Time Clock Driver v1.10e
block: queued sectors max/low 681530kB/550458kB, 2048 slots per queue
RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz PCI bus speed for PIO modes; override with idebus=xx
ServerWorks OSB4: IDE controller on PCI bus 00 dev 79
ServerWorks OSB4: chipset revision 0
ServerWorks OSB4: not 100% native mode: will probe irqs later
    ide0: BM-DMA at 0x08b0-0x08b7, BIOS settings: hda:DMA, hdb:pio
    ide1: BM-DMA at 0x08b8-0x08bf, BIOS settings: hdc:pio, hdd:pio
hda: TEAC CD-ROM CD-224E, ATAPI CD/DVD-ROM drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide-floppy driver 0.97.sv
Floppy drive(s): fd0 is 1.44M
FDC 0 is a National Semiconductor PC87306
ide-floppy driver 0.97.sv
md: md driver 0.90.0 MAX_MD_DEVS=256, MD_SB_DISKS=27
md: Autodetecting RAID arrays.
md: autorun ...
md: ... autorun DONE.
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
IP: routing cache hash table of 8192 buckets, 64Kbytes
TCP: Hash tables configured (established 262144 bind 65536)
Linux IP multicast router 0.06 plus PIM-SM
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
RAMDISK: Compressed image found at block 0
Freeing initrd memory: 470k freed
VFS: Mounted root (ext2 filesystem).
SCSI subsystem driver Revision: 1.00
aacraid raid driver version, Oct 18 2001
PCI: Found IRQ 5 for device 02:02.1
PCI: Sharing IRQ 5 with 03:04.0
PCI: Found IRQ 5 for device 02:02.1
PCI: Sharing IRQ 5 with 03:04.0
percraid device detected
Device mapped to virtual address 0xf882e000
percraid:0 device initialization successful
percraid:0 AacHba_ClassDriverInit complete
PCI: Found IRQ 5 for device 02:02.1
PCI: Sharing IRQ 5 with 03:04.0
scsi0 : percraid
  Vendor: DELL      Model: PERCRAID RAID5    Rev: 0001
  Type:   Direct-Access                      ANSI SCSI revision: 02
  Vendor: DELL      Model: PERCRAID RAID5    Rev: 0001
  Type:   Direct-Access                      ANSI SCSI revision: 02
Attached scsi removable disk sda at scsi0, channel 0, id 0, lun 0
Attached scsi removable disk sdb at scsi0, channel 0, id 1, lun 0
SCSI device sda: 8388608 512-byte hdwr sectors (4295 MB)
sda: Write Protect is off
Partition check:
 sda: sda1 sda2 sda3
SCSI device sdb: 62700288 512-byte hdwr sectors (32103 MB)
sdb: Write Protect is off
 sdb: sdb1 < sdb5 sdb6 >
PCI: Found IRQ 11 for device 03:04.1
(scsi1) <Adaptec AIC-7899 Ultra 160/m SCSI host adapter> found at PCI 3/4/1
(scsi1) Wide Channel B, SCSI ID=7, 32/255 SCBs
(scsi1) Downloading sequencer code... 396 instructions downloaded
scsi1 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.2.4/5.2.0
       <Adaptec AIC-7899 Ultra 160/m SCSI host adapter>
blk: queue f773da18, I/O limit 4095Mb (mask 0xffffffff)
  Vendor: ARCHIVE   Model: Python 06408-XXX  Rev: 8130
  Type:   Sequential-Access                  ANSI SCSI revision: 03
blk: queue f7733818, I/O limit 4095Mb (mask 0xffffffff)
Journalled Block Device driver loaded
kjournald starting.  Commit interval 5 seconds
EXT3-fs: mounted filesystem with ordered data mode.
Freeing unused kernel memory: 224k freed
spurious 8259A interrupt: IRQ7.
Adding Swap: 2097136k swap-space (priority -1)
EXT3 FS 2.4-0.9.11, 3 Oct 2001 on sd(8,2), internal journal
kjournald starting.  Commit interval 5 seconds
EXT3 FS 2.4-0.9.11, 3 Oct 2001 on sd(8,21), internal journal
EXT3-fs: mounted filesystem with ordered data mode.
kjournald starting.  Commit interval 5 seconds
EXT3 FS 2.4-0.9.11, 3 Oct 2001 on sd(8,22), internal journal
EXT3-fs: mounted filesystem with ordered data mode.
st: Version 20010812, bufsize 32768, wrt 30720, max init. bufs 4, s/g segs 16
Attached scsi tape st0 at scsi1, channel 0, id 6, lun 0
0x378: FIFO is 16 bytes
0x378: writeIntrThreshold is 8
0x378: readIntrThreshold is 8
0x378: PWord is 8 bits
0x378: Interrupts are ISA-Pulses
0x378: ECP port cfgA=0x14 cfgB=0x40
0x378: ECP settings irq=<none or set by other means> dma=<none or set by other means>
parport0: PC-style at 0x378 (0x778) [PCSPP,TRISTATE,COMPAT,ECP]
parport0: irq 7 detected
parport0: cpp_daisy: aa5500ff(08)
parport0: assign_addrs: aa5500ff(08)
parport0: cpp_daisy: aa5500ff(08)
parport0: assign_addrs: aa5500ff(08)
ip_conntrack (8191 buckets, 65528 max)
eepro100.c:v1.09j-t 9/29/99 Donald Becker http://cesdis.gsfc.nasa.gov/linux/drivers/eepro100.html
eepro100.c: $Revision: 1.36 $ 2000/11/17 Modified by Andrey V. Savochkin <saw at saw.sw.com.sg> and others
PCI: Found IRQ 9 for device 02:04.0
IRQ routing conflict for 02:04.0, have irq 10, want irq 9
eth0: Intel Corporation 82557 [Ethernet Pro 100], 00:06:5B:1A:49:5B, IRQ 10.
  Receiver lock-up bug exists -- enabling work-around.
  Board assembly 07195d-000, Physical connectors present: RJ45
  Primary interface chip i82555 PHY #1.
  General self-test: passed.
  Serial sub-system self-test: passed.
  Internal registers self-test: passed.
  ROM checksum self-test: passed (0x04f4518b).
  Receiver lock-up workaround activated.
AAC:Batte



More information about the Linux-PowerEdge mailing list