Hard lockups on PE 6600 (SOLVED - maybe) [also PE2x50 lockups and pe2605 console hang]

Ben Russo ben at umialumni.com
Mon Nov 11 09:47:00 CST 2002


I have a Dell 2650 running RedHat Linux 7.2
It has an on board DELL factory RAID controller
and 3 72GB Disks.

I am using the RAID monitoring scripts in a cronjob
that I found a link to on Matt's site.  

But they are not reporting
any problems...  for example:

[root at app1 RAID]# cat raid.check.commands
open afa0
logfile start raid.current.config
container list
disk list
logfile end
exit
[root at app1 RAID]# cat raid.current.config
File raid.current.config receiving all output.

AFA0>
COMMAND: container list
Executing: container list
Num          Total  Oth Chunk          Scsi   Partition
Label Type   Size   Ctr Size   Usage   B:ID:L Offset:Size
----- ------ ------ --- ------ ------- ------ -------------
 0    Mirror 33.8GB            Open    0:00:0 64.0KB:33.8GB
 /dev/sda             Root-Mirror-d0,1 0:01:0 64.0KB:33.8GB


AFA0>
COMMAND: disk list
Executing: disk list

B:ID:L  Device Type     Blocks    Bytes/Block Usage            Shared
Rate
------  --------------  --------- ----------- ---------------- ------
----
0:00:0   Disk            71132959  512         Initialized      NO    
160
0:01:0   Disk            71132959  512         Initialized      NO    
160
0:02:0   Disk            71132959  512         Initialized      NO    
160

AFA0>
COMMAND: logfile end
Executing: logfile end



However this morning I found this in the syslog file:

MON NOV 11 05:43:14
kern  Alert  
kernel: AAC:ID(0:00:0); Error Event [command:0x28]

MON NOV 11 05:43:14  
kern  Alert  
kernel: AAC:ID(0:00:0); Medium Error [k:0x3,c:0x11,q:0x0]

MON NOV 11 05:43:14
kern  Alert  
kernel: AAC:ID(0:00:0); Unrecovered Read Error

MON NOV 11 05:43:18  
kern  Alert  
kernel: AAC:ID(0:00:0); Error Event [command:0x28]

MON NOV 11 05:43:18
kern  Alert  
kernel: AAC:ID(0:00:0); Medium Error [k:0x3,c:0x11,q:0x0]

MON NOV 11 05:43:18
kern  Alert
kernel: AAC:ID(0:00:0) Medium Error, LBN Range 38366208:38366335

MON NOV 11 05:43:18
kern  Alert  
kernel: AAC:ID(0:00:0) Starting BBR sequence

MON NOV 11 05:43:18
kern  Alert
kernel: AAC:ID(0:00:0); Unrecovered Read Error



Do I have to worry about these?  What is a BBR Sequence?
Or is this just a bad block on the disk that has been remapped
and I can ignore it so long as it is a one time event?

-Ben.




More information about the Linux-PowerEdge mailing list