Drac crash my servers!

Joost Waversveld joost at waversveld.nl
Tue May 30 15:32:59 CDT 2006


We experienced the same with two 1850's, but our ERA/O modules still 
works (fortunately!)

It's not on all of our 1850's (yet!)....

This is strange....

Claudio De Luca wrote:
> Hello,
> 
> i have 18 Dell PowerEdge1850 and today every server is gone down with a 
> interval of 5 minutes, on the server i had the error HDF Irq timeout 
> status=0x80, we have reboot also all the server and now all the drac 
> card are died, someone have the same problem?
> 
> Log of my two server
> 
> May 30 18:36:47 mexes kernel: ide2: reset timed-out, status=0x80
> May 30 18:36:47 mexes kernel: hdf: status timeout: status=0x80 { Busy }
> May 30 18:36:47 mexes kernel: hdf: status timeout: 
> error=0x80LastFailedSense 0x08
> May 30 18:36:47 mexes kernel: hdf: drive not ready for command
> May 30 18:36:52 mexes kernel: hdf: status timeout: status=0x80 { Busy }
> May 30 18:36:52 mexes kernel: hdf: status timeout: 
> error=0x80LastFailedSense 0x08
> May 30 18:36:52 mexes kernel: hdf: drive not ready for command
> May 30 18:37:22 mexes kernel: hdf: ATAPI reset timed-out, status=0x80
> May 30 18:37:48 mexes kernel: audit(1149007068.589:15512): avc: denied { 
> getattr } for pid=6639 comm="mysqld" name="mysql.sock" dev=sda2 
> ino=363050 sconte
> xt=rootystem_r:mysqld_t tcontext=rootbject_r:var_lib_t tclass=sock_file
> May 30 18:37:52 mexes kernel: ide2: reset timed-out, status=0x80
> May 30 18:37:52 mexes kernel: cdrom: This disc doesn't have any tracks I 
> recognize!
> May 30 18:22:36 mexes syslogd 1.4.1: restart.
> May 30 18:22:36 mexes syslog: syslogd startup succeeded
> May 30 18:22:36 mexes kernel: klogd 1.4.1, log source = /proc/kmsg started.
> May 30 18:22:36 mexes kernel: Linux version 2.6.9-34.ELsmp 
> (bhcompile at hs20-bc1-7.build.redhat.com) (gcc version 3.4.5 20051201 (Red 
> Hat 3.4.5-2)) #1 SMP Fri
> Feb 24 16:54:53 EST 2006
> May 30 18:22:36 mexes kernel: BIOS-provided physical RAM map:
> May 30 18:22:36 mexes kernel: BIOS-e820: 0000000000000000 - 
> 00000000000a0000 (usable)
> May 30 18:22:36 mexes kernel: BIOS-e820: 0000000000100000 - 
> 000000007ffc0000 (usable)
> May 30 18:22:36 mexes kernel: BIOS-e820: 000000007ffc0000 - 
> 000000007ffcfc00 (ACPI data)
> May 30 18:22:36 mexes kernel: BIOS-e820: 000000007ffcfc00 - 
> 000000007ffff000 (reserved)
> May 30 18:22:36 mexes kernel: BIOS-e820: 00000000e0000000 - 
> 00000000fec90000 (reserved)
> May 30 18:22:36 mexes kernel: BIOS-e820: 00000000fed00000 - 
> 00000000fed00400 (reserved)
> May 30 18:22:36 mexes kernel: BIOS-e820: 00000000fee00000 - 
> 00000000fee10000 (reserved)
> May 30 18:22:36 mexes kernel: BIOS-e820: 00000000ffb00000 - 
> 0000000100000000 (reserved)
> May 30 18:22:36 mexes kernel: 1151MB HIGHMEM available.
> May 30 18:22:36 mexes kernel: 896MB LOWMEM available.
> May 30 18:22:36 mexes kernel: found SMP MP-table at 000fe710
> May 30 18:22:36 mexes syslog: klogd startup succeeded
> May 30 18:22:36 mexes kernel: NX (Execute Disable) protection: active
> May 30 18:22:36 mexes kernel: DMI 2.3 present.
> May 30 18:22:36 mexes kernel: Using APIC driver default
> May 30 18:22:36 mexes kernel: ACPI: PM-Timer IO Port: 0x808
> May 30 18:22:36 mexes kernel: ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] 
> enabled)
> May 30 18:22:36 mexes kernel: Processor #0 15:4 APIC version 20
> May 30 18:22:36 mexes kernel: ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] 
> enabled)
> May 30 18:22:36 mexes kernel: Processor #1 15:4 APIC version 20
> May 30 18:22:36 mexes kernel: ACPI: LAPIC (acpi_id[0x03] lapic_id[0x06] 
> disabled)
> May 30 18:22:36 mexes kernel: ACPI: LAPIC (acpi_id[0x04] lapic_id[0x07] 
> disabled)
> May 30 18:22:36 mexes kernel: ACPI: LAPIC_NMI (acpi_id[0x01] high edge 
> lint[0x1])
> May 30 18:22:36 mexes kernel: ACPI: LAPIC_NMI (acpi_id[0x02] high edge 
> lint[0x1])
> May 30 18:22:36 mexes kernel: ACPI: LAPIC_NMI (acpi_id[0x03] high edge 
> lint[0x1])
> May 30 18:22:36 mexes kernel: ACPI: LAPIC_NMI (acpi_id[0x04] high edge 
> lint[0x1])
> May 30 18:22:36 mexes kernel: Enabling APIC mode: Flat. Using 0 I/O APICs
> May 30 18:22:36 mexes kernel: ACPI: IOAPIC (id[0x02] address[0xfec00000] 
> gsi_base[0])
> May 30 18:22:36 mexes kernel: IOAPIC[0]: apic_id 2, version 32, address 
> 0xfec00000, GSI 0-23
> May 30 18:22:36 mexes kernel: ACPI: IOAPIC (id[0x03] address[0xfec80000] 
> gsi_base[32])
> May 30 18:22:36 mexes kernel: IOAPIC[1]: apic_id 3, version 32, address 
> 0xfec80000, GSI 32-55
> May 30 18:22:36 mexes kernel: ACPI: IOAPIC (id[0x04] address[0xfec83000] 
> gsi_base[64])
> May 30 18:22:36 mexes kernel: IOAPIC[2]: apic_id 4, version 32, address 
> 0xfec83000, GSI 64-87
> May 30 18:22:36 mexes kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 0 
> global_irq 2 dfl dfl)
> May 30 18:22:36 mexes kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 9 
> global_irq 9 high level)
> May 30 18:22:36 mexes kernel: ACPI: HPET id: 0xffffffff base: 0xfed00000
> May 30 18:22:36 mexes kernel: Using ACPI (MADT) for SMP configuration 
> information
> May 30 18:22:36 mexes kernel: Built 1 zonelists
> May 30 18:22:36 mexes kernel: Kernel command line: ro root=LABEL=/
> May 30 18:22:36 mexes kernel: Initializing CPU#0
> May 30 18:22:36 mexes kernel: CPU 0 irqstacks, hard=c03ea000 soft=c03ca000
> 
> 
> ------------------------------------------------------------------------
> 
> _______________________________________________
> Linux-PowerEdge mailing list
> Linux-PowerEdge at dell.com
> http://lists.us.dell.com/mailman/listinfo/linux-poweredge
> Please read the FAQ at http://lists.us.dell.com/faq



More information about the Linux-PowerEdge mailing list