2650 - PERC 3/Di and failed drive

Craig White craigwhite at azapple.com
Sat Apr 28 22:03:56 CDT 2007


Spent about 2.5 hours on the phone w/ Dell tech support but dead ended
(and about 4 more hours sandwiched around this call)

I have a 5 disk RAID 5 array (2650 - PERC 3/Di ) and one drive failed
and I can't get it back online. In the PERC Array Config Utility, the
0:0 drive reports 'Missing Member'

afacli shows...

AFA0> container list
Executing: container list
Num          Total  Oth Chunk          Scsi   Partition
Label Type   Size   Ctr Size   Usage   B:ID:L Offset:Size
----- ------ ------ --- ------ ------- ------ -------------
 0    RAID-5  273GB       32KB Open    ?:??:?  - Missing -
 /dev/sda                              0:01:0 64.0KB:68.3GB
                                       0:02:0 64.0KB:68.3GB
                                       0:03:0 64.0KB:68.3GB
                                       0:04:0 64.0KB:68.3GB


AFA0> disk list
Executing: disk list

B:ID:L  Device Type     Blocks    Bytes/Block Usage            Shared
Rate
------  --------------  --------- ----------- ---------------- ------
----
0:00:0   Disk            143374650 512         Initialized      NO
160
0:01:0   Disk            143374650 512         Initialized      NO
160
0:02:0   Disk            143374650 512         Initialized      NO
160
0:03:0   Disk            143374650 512         Initialized      NO
160
0:04:0   Disk            143374650 512         Initialized      NO
160

AFA0> disk show space
Executing: disk show space

Scsi B:ID:L Usage      Size
----------- ---------- -------------
  0:00:0     Free      64.0KB:68.3GB
  0:01:0     Container 64.0KB:68.3GB
  0:01:0     Free      68.3GB:7.50KB
  0:02:0     Container 64.0KB:68.3GB
  0:02:0     Free      68.3GB:7.50KB
  0:03:0     Container 64.0KB:68.3GB
  0:03:0     Free      68.3GB:7.50KB
  0:04:0     Container 64.0KB:68.3GB
  0:04:0     Free      68.3GB:7.50KB

AFA0> container show failover
Executing: container show failover

Container Scsi B:ID:L
--------- ----------------------------------
  0       0:00:0

I went through a number of exercising including ejecting the disk and
re-inserting it, shutting down, removing disk, starting up, shutting
down, inserting disk, starting up but the disk doesn't seem to ever show
any interest in coming online again.

This shows the 'missing' drive set to be a failover drive which comes as
the result of a series of instructions from Dell Tech support which I
believe went like this...

disk blink (0,0,0) 15
enclosure prepare slot 0 0
# at which point I would eject, wait 10 secs and reinsert drive
container list
disk show space
container rescan # which always fails...seems that command isn't
supported here
disk remove dead_partitions (0,0,0)
container set failover 0 (0,0,0)

which leaves me in the state as you see above which never assigns the
'spare' drive as the 'missing member' of the RAID and thus my array
remains 'degraded'

Dell Technician suggested I copy all off to a hard drive, re-set the
array but that's rather crappy that the first drive belch I get on this
system takes the RAID array out.

Anyone have suggestions?

Craig



More information about the Linux-PowerEdge mailing list