we've had very good results in our systems with drive failures failing to take down the raid.
The one time we did have a real problem it turned out that a fire in the ventilator unit right above our rack had resulted in coating the inside of the raid with soot. We brought it back, cleaned it up, and put it in the lab where it's still working fine. We /could/ put it back into production, but as a matter of policy once there's one failure the hardware just doesn't go back into production, no matter what.