Disk problems?

jason andrade jason at rtfmconsult.com
Thu Jan 23 07:22:00 CST 2003


On Thu, 23 Jan 2003, Andy Stubbs wrote:

> I've been running a database on one of our machines - a PE2500 with a
> Perc3/Di RAID controller.

a nice machine.

> I have 2 containers - 1 mirrored contained (0:0:0 and 0:1:0) and one raid
> 5 container (1:3:0 1:4:0 and 1:5:0) for the DB data.
>
> However, there is an entry in the system log for the day before:
> Jan 20 03:49:10 db1 kernel: aacraid:Enclosure 0 - Temperature 168, over
> threshold 120

this happens a fair bit - coincidentally on our 2550 box (we've never
seen it on a 2450 or 2400) and i've been told that it's a pretty spurious
error (of course it's annoying because then how can you tell which errors
on overtemperature are meaningful.. from the box itself)

> Jan 22 16:59:44 db1 kernel: aacraid:ID(1:03:0); Error Event [command:0x37]
> Jan 22 16:59:44 db1 kernel: aacraid:ID(1:03:0); Recovered Error [k:0x1,c:0x1c,q:0x2]
> Jan 22 16:59:44 db1 kernel: aacraid:ID(1:03:0); Grown Defect List Not Found
> Jan 22 16:59:46 db1 kernel: aacraid:ID(1:04:0); Error Event [command:0x37]
> Jan 22 16:59:46 db1 kernel: aacraid:ID(1:04:0); Recovered Error [k:0x1,c:0x1c,q:0x2]
> Jan 22 16:59:46 db1 kernel: aacraid:ID(1:04:0); Grown Defect List Not Found
> Jan 22 16:59:48 db1 kernel: aacraid:ID(1:05:0); Error Event [command:0x37]
> Jan 22 16:59:48 db1 kernel: aacraid:ID(1:05:0); Recovered Error [k:0x1,c:0x1c,q:0x2]
> Jan 22 16:59:48 db1 kernel: aacraid:ID(1:05:0); Grown Defect List Not Found
>
> This looks like Bad News[tm].

yes and no.  do you have any other admins who might have been using afacli
at that time ?  it is a bit of a strange error in that it affects all of
the disks in that second container (only).

> AFA0> container list
> Executing: container list
> Num          Total  Oth Chunk          Scsi   Partition
> Label Type   Size   Ctr Size   Usage   B:ID:L Offset:Size
> - ----- ------ ------ --- ------ ------- ------ -------------
>  0    Mirror 16.9GB            Open    0:00:0 64.0KB:16.9GB
>  /dev/sda             db1_os           0:01:0 64.0KB:16.9GB
>
>  1    RAID-5 67.7GB        8KB Open    1:03:0 64.0KB:33.8GB
>  /dev/sdb             db1_db           1:04:0 64.0KB:33.8GB
>                                        1:05:0 64.0KB:33.8GB
>
>
> Can anybody shed some light on this? Or at least give me a couple of
> pointers as to how I can better diagnose the problem?

the above looks good to be.  you can also use the following commands:

disk list  (to see the disks in the system and if any have problems)
enclosure show status

regards,

-jason




More information about the Linux-PowerEdge mailing list