RAID Technology
RAID Technology
What is RAID
• Redundant Array of Independent (Inexpensive)
Disks
• Appear to the operating system as a single
logical drive.
• Data are distributed over the stations
• Redundant capacity is used for parity allowing
for data repair
Levels of RAID
• 6 levels of RAID (0-5) have been accepted by
industry
• Other kinds have been proposed in literature
• Level 2 and 4 are not commercially available,
they are included for clarity
RAID 0
• All data (user and system) are distributed over
the disks so that there is a reasonable chance for
parallelism
• Disk is logically a set of strips (blocks, sectors,
…). Strips are numbered and assigned
consecutively to the disks (see picture.)
RAID 0:
RAID 0:
• Offers the best performance from all RAID
configurations.
• High data transfer rates are reached if
– Separate controllers and caches are used for
each disk.
• If response time is important (transactions) more
I/O requests can be handled in parallel
RAID 0
• RAID-0 is recommended for noncritical data
that
– requires high-speed reads and writes,
– is low cost, and
– is used in applications such as video or
image editing.
RAID 1
• RAID 1 does not use parity, it simply mirrors the
data to obtain reliability
• Plus:
– Reading request can be served by any of the two disks
containing the requested data (minimum search time)
– Writing request can be performed in parallel to the two
disks
– Recovery from error is easy, just copy the data from the
correct disk
RAID 1
RAID 1
• RAID 1 can reach high transfer rates and fast
response times if most of the requests are reading
requests. In case most requests are writing requests,
RAID 1 is not much faster than RAID 0.
• Price for disks is doubled
• Best for transaction-oriented, high-availability
environments and applications requiring high fault
tolerance, such as accounting or payroll.
RAID 2
• Uses striping across disks, with some disks storing
error checking and correcting (ECC) information.
• RAID 2 also uses a dedicated Hamming code parity,
a linear form of ECC, to correct single bit error.
• Synchronized disks, each I/O operation is
performed in a parallel way
• Controller can correct without additional delay
RAID 2
RAID 3
• Is still expensive, only used in case many
frequent errors can be expected
• RAID 3 needs only one, for one parity disks
• In case one disk crashes, the data can still be
reconstructed
• RAID 3 have high data transfer times, but perform
only one I/O at the time so that response times in
transaction oriented environments are not so good
RAID 3
RAID 3
• Most useful for environments where large
blocks of data would be read or written,
such as with image or video processing.
• Not well suited for transaction-oriented
applications.
RAID 4
• Larger strips and one parity disk
• Blocks are kept on one disk, allowing for
parallel access by multiple I/O requests
• Writing penalty: when a block is written,
the parity disk must be adjusted
• Parity disk may be a bottleneck
• Good response times, less good transfer
rates
RAID 4
RAID 5
• Distribution of the parity strip to avoid the bottle
neck.
• The array's architecture enables read and write
operations to span multiple drives. This results
in performance better than that of a single drive
• Compared with other RAID systems, RAID-5
offers the best protection for the least cost
RAID 5
RAID 5
• Because some requests can be serviced
concurrently, RAID-5 provides the best read
throughput of all the parity models and gives
acceptable throughput on write operations