Solaris RAID Implementation

In 1987, Patterson, Gibson and Katz at the University of California Berkeley, published a paper entitled "A Case for Redundant Arrays of Inexpensive Disks (RAID)" . This paper described various types of disk arrays, referred to by the acronym RAID. The basic idea of RAID was to combine multiple small, inexpensive disk drives into an array of disk drives which yields performance exceeding that of a Single Large Expensive Drive (SLED). Additionally, this array of drives appears to the computer as a single logical storage unit or drive.

The Mean Time Between Failure (MTBF) of the array will be equal to the MTBF of an individual drive, divided by the number of drives in the array. Because of this, the MTBF of an array of drives would be too low for many application requirements. However, disk arrays can be made fault-tolerant by redundantly storing information in various ways.

Five types of array architectures, RAID-1 through RAID-5, were defined by the Berkeley paper, each providing disk fault-tolerance and each offering different trade-offs in features and performance. In addition to these five redundant array architectures, it has become popular to refer to a non-redundant array of disk drives as a RAID-0 array.

The basic idea behind RAID is that the array of drives appears to the computer as a single logical storage unit or volume.

RAID is a dominant enterprise configuration of disks in Solaris. Primary reasons to use RAID include:

There are six levels of RAID as well as a non-redundant array of independent disks (RAID 0). There are at least three different practically used RAID configurations that are often called levels 0, 1, 5. The Solaris Volume Manager software uses logical volumes (sets of disk slices), to implement RAID 0, RAID 1, and RAID 5:

RAID Level 0 -- "striping/concatenation". RAID Level 0 is not redundant, hence does not truly fit the "RAID" acronym. In level 0, data is split across drives, resulting in higher data throughput. Since no redundant information is stored, performance is very good, but the failure of any disk in the array results in data loss. RAID level 0, has the lowest cost of any RAID organization because it does not employ redundancy at all. This scheme offers good performance since it never needs to update redundant information. Surprisingly, it does not have the best performance in iether read or write oprations. Redundancy schemes that duplicate data, such as mirroring, can perform better on reads by selectively scheduling requests on the disk with the shortest expected seek and rotational delays. Without, redundancy, any single disk failure will result in data-loss. Non-redundant disk arrays are widely used in environments where performance and capacity, rather than reliability, are the primary concerns. The size of a data block, which is known as the "stripe width", varies with the implementation. When it comes time to read back data stored in RAID level 0, all disks can be read in parallel.
RAID Level 1 -- "mirroring". Mirroring remains in enterprise environment popular due to its simplicity and high level of data availability. It uses twice as many disks as a non-redundant disk array. Those two disk care called submirrors (it can be also more then two submirros, but this implmentation is used very unfrequently). Whenever data is written to a submirror A the same data is also written to submirror B, so that there are always two copies of the information. When data is read, it can be retrieved from the disk with the shorter seek and rotational delays. If a disk fails, the other copy is used to service requests. Mirroring is frequently used in database applications where availability and transaction time are more important than storage efficiency. The cost of disk storage per megabyte doubles in RAIL level 1.
RAID Level 5 -- provides fault tolerance by distributing parity information across some or all of an array's member disk drives. RAID-5 is a good choice in multi-user environments which are not write performance sensitive. However, at least three, and more typically five drives are required for RAID-5 arrays. Reads substantially outperfor small writes. Level 5 is often used with write-back caching to reduce the asymmetry. The block-interleaved distributed-parity eliminates the parity disk bottleneck by distributing the parity information uniformly over all of the disks. Another advantage to distributing the parity information is that it also distributes data over all of the disks rather than over all but one. This allows all disks to participate in servicing read operations in contrast to redundancy schemes with dedicated parity disks in which the parity disk cannot participate in servicing read requests. Block-interleaved distributed-parity disk array have the best small read, large write performance of any redundancy disk array. Small write requests are inefficient compared with mirroring due the need to perform read-modify-write operations to update parity. This is the major performance weakness of RAID level 5 disk arrays.

Hardware RAID

The hardware based system manages the RAID subsystem independently from the host and presents to the host only a single disk per RAID array. This way the host doesn't have to be aware of the RAID subsystems(s).

Software RAID

Under Solaris both SVM and Veritas Volume Manager offer RAID-0/1 and 5. Special and pretty complex driver is needed to implement software RAID solution. This is more error prone and less compatible then hardware based solutions, especially Fiber Channel based, but it is cheaper.

Just like any other application, software-based arrays occupy host system memory, consume CPU cycles and are operating system dependent. By contending with other applications that are running concurrently for host CPU cycles and memory, software-based arrays degrade overall server performance. Also, unlike hardware-based arrays, the performance of a software-based array is directly dependent on server CPU performance and load.

Except for the array functionality, hardware-based RAID schemes have very little in common with software-based implementations. Since the host CPU can execute user applications while the array adapter's processor simultaneously executes the array functions, the result is true hardware multi-tasking. Hardware arrays also do not occupy any host system memory, nor are they operating system dependent.

Hardware arrays are also highly fault tolerant. Since the array logic is based in hardware, software is NOT required to boot. Some software arrays, however, will fail to boot if the boot drive in the array fails. For example, an array implemented in software can only be functional when the array software has been read from the disks and is memory-resident. What happens if the server can't load the array software because the disk that contains the fault tolerant software has failed? Software-based implementations commonly require a separate boot drive, which may be included or not in the array.

NEWS CONTENTS

News

Softpanorama May the source be with you, but remember the KISS principle ;-)	Home	Switchboard	Unix Administration	Red Hat	TCP/IP Networks	Neoliberalism	Toxic Managers
	(slightly skeptical) Educational society promoting "Back to basics" movement against IT overcomplexity and bastardization of classic Unix

Old News ;-)	Books/Certification books	Certification	Recommended Links	Reference	Selected Blueprints	Selected man pages	RAID Levels
FAQs	Mirroring Root Filesystem	RAID 0 volumes (striping)	RAID 1 volumes (mirroring)	RAID 5 volumes	RAID 0+1	Humor	Etc

Top Visited <p>Your browser does not support iframes.</p>					Switchboard
					Latest
					Past week
					Past month

RAID Level	Min. Num of Drives	Description	Strengths	Weaknesses
Raid 0	2	Data striping without redundancy	Highest performance	No data protection; One drive fails, all data is lost
Raid 1	2	Disk mirroring	Very high performance; Very high data protection; Very minimal penalty on write performance	High redundancy cost overhead; Because all data is duplicated, twice the storage capacity is required
Raid 2	Not Used In LAN	No practical use	Previously used for RAM error environments correction (known as Hamming Code ) and in disk drives before the use of embedded error correction	No practical use; Same performance can be achieved by RAID 3 at lower cost
Raid 3	3	Byte-level data striping with dedicated parity drive	Excellent performance for large, sequential data requests	Not well-suited for transaction-oriented network applications; Single parity drive does not support multiple, simultaneous read and write requests
Raid 4	3 (not widely used	Block-level data striping with dedicated parity drive	Data striping supports multiple simultaneous read requests	Write requests suffer from same single parity-drive bottleneck as RAID 3; RAID 5 offers equal data protection and better performance at same cost
Raid 5	3	Block-level data striping with distributed parity	Best cost/performance for transaction-oriented networks; Very high performance, very high data protection; Supports multiple simultaneous reads and writes; Can also be optimized for large, sequential requests	Write performance is slower than RAID 0 or RAID 1
Raid 0/1	4	Combination of RAID 0 (data striping) and RAID 1 (mirroring)	Highest performance, highest data protection (can tolerate multiple drive failures)	High redundancy cost overhead; Because all data is duplicated, twice the storage capacity is required; Requires minimum of four drives

Solaris RAID Implementation

Hardware RAID

Software RAID

NEWS CONTENTS

News

SVM specifics

* concat/stripe logical devices (metadevices)

** State database replicas

Recommended Links

Reference

RAID Level

Min. Num of Drives

Description

Strengths

Weaknesses

Raid 0

Raid 1

Raid 2

Raid 3

Raid 4

Raid 5

Raid 0/1

Etc