What's Raid?

So what exactly is RAID? Nope, it's not the bug spray I'm talking about here. It is a technique that was developed to provide speed, reliability, and increased storage capacity using multiple disks, rather than single disk solutions. RAID basically takes multiple hard drives and allows them to be used as one large hard drive with benefits depending on the scheme or level of RAID being used. Depending on your needs, there are many different RAID variations and implementations available with prices ranging from less than $100 to over $25,000. Of course, the better the RAID implementation, the more expensive it's probably going to be. There is really no one best RAID implementation. Some implementations are better than others depending on the actual application.

 

Levels of Raid

RAID 0 has the best performance and capacity, but the lowest availability (no fault tolerance). If one drive fails, the entire array would fail because part of the data is missing without any ways in recovering them.
RAID 1 has the highest availability but lowest capacity, it requires using twice the number of drives . Performance is roughly the same as when using a single drive, although in some cases the dual write may be somewhat slower.
RAID 0+1 offers some performance improvements by combining striping and mirroring, but capacity is low since the mirror requires a duplicate set of drives.
RAID 3 has high performance and middle capacity, but the availability is lower when comparedtoRAID 1
RAID 5 has moderate benefits in all three areas. Read performance can be as fast as RAID 0, but write performance is slower, since the parity information must be calculated and written along with the data. Capacity is higher than for RAID 1 but it does not use striping, since the array use additional space for parity information, Availability is high with RAID 5 because of fault tolerance?If a drive fails, the missing data is recalculated from remaining operation drives.

 

DISK MIRRORING


DATA STRIPPING

 

Conclusion

So what have we learned here? Well we've learned that RAID is not just a bug spray. RAID is a good solution for companies or individuals craving more transfer performance, redundancy, and storage capacity in their data storage systems. There are many levels of RAID, which range from very simple and cheap to extremely complex and expensive. The benefits of having RAID in your system are obvious, however, RAID is not for everyone. Performance freaks like myself will always like what RAID has to offer, but the price tag of the better RAID implementations is still a hurdle to overcome. I tried to cover as much as I possibly could about what RAID is, it's benefits, and the various implementations. I tried not to focus on any specific company or hardware implementation, but looked at RAID in general. I hope you've enjoyed reading this article and maybe even learned something along the way. This isn't the definitive guide to RAID, but I hope it's helped you understand RAID more.

 

What is Seria ATA (SATA) ?

Basics

Serial ATA (SATA) is the next-generation interface standard for low-cost direct-attached storage in desktop PC, workstation, and entry-level server environments. As a serial technology (bits transmitted in a single stream, rather than along parallel paths) SATA eliminates the restrictions on performance, reliability, and scalability that are inherent in today?s parallel ATA (IDE) standard. Because SATA cost-effectively enables RAID protection, is easily scalable, and has a high performance roadmap, it will become the dominant direct-attach storage interface for budget-conscious users.

Storage Interface Evolution

When it was introduced nearly twenty years ago, parallel ATA, also known as IDE, provided a simple low-cost storage interface standard that met the performance and flexibility needs of desktop PCs. Concurrently, the more robust and costly SCSI interface evolved to fulfill the higher performance, reliability, and scalability requirements of enterprise-class applications. However, as CPU capabilities and the complexity of applications and data types accelerate, technology restrictions limit the future applicability of parallel interfaces. To better meet future processing needs, both ATA and SCSI are moving to more flexible and capable serial technology. Over time, SATA for the low-end and Serial-Attached SCSI (SAS) for the high-end will become the industry standard storage interfaces.

Parallel vs. Serial Interfaces
Interface Technology Transfer Rate Cabling Connectivity Connectivity
Current
Planned
ATA/IDE - Parallel - 133 MB/s - At max today - Wide ribbon
- 40-pin
- 18-inch length
- 2 drives per channel
- Master/slave relationship
- Shared bandwidth among drives
SATA - Serial - 150 MB/s - 600 MB/s - Thin, round ribbon
- 4-pin
- 1-meter length
- Single drive per channel
- Point-to-point connection
- Full bandwidth per drive
SCSI - Parallel - 320 MB/s - None planned - Wide, round ribbon
- 68-pin
- 12.5 meter (LVD) length
- Up to 15 devices per channel
SAS - Serial - 300 MB/s - 1200 MB/s - Thin, round ribbon
- 6-meter length
- 128 devices
- Expanders allow up to 16,000 devices

Why SATA?

SATA has been developed as a backward compatible, evolutionary replacement for ATA. Employing a serial technology version of the ATA design, SATA offers compelling technology, performance, and usability benefits for data-intensive applications in direct-attached storage environments. Within the next three years SATA will replace ATA/IDE as the low-cost interface-of-choice.

Serial ATA
Features Benefits
- High performance roadmap
(1.5 to 6.0 gigabits/sec)
- Scalable performance growth
- Lowest-cost per megabyte - Wide market appeal
- Command optimization - Makes SATA RAID more practical
- Point-to-point connections - Greater data reliability
- Full backward compatibility - Easier, faster, cheaper migration
- Single thin 1-meter cable - Greater flexibility; space savings
- Backplane connection Hot-plug/hot-swap flexibility - Hot-plug/hot-swap flexibility

When to Choose SATA

The low-cost/high-benefit nature of SATA makes it an ideal fit for budget-conscious desktop, high-end workstation, and entry-level server users whose application needs require high performance without either the additional robustness, or the external connectivity features of SCSI technologies.

Interface Cost Optimal Data Type Storage Environment Application Environments
ATA/IDE

 

 

- Low - Reference data: low frequency access, sequential data; e.g., file sharing, email, web, backup, archive - Internal DAS - Desktop PCs
SATA - Desktop PCs, workstations,
entry-level servers
SCSI - Moderate - High-frequency transactional & random access data; e.g., database, online purchases, OLTP, CRM - Internal DAS & External NAS/SAN - Mission critical enterprise servers, networked storage
SAS - Mission critical enterprise servers, large-scale networked storage

Glossary
ATA "Advanced Technology Attachment," a storage interface designed over 15 years ago and now the de facto I/O standard for desktop PCs. Though adequate for low data-demand applications, the combination of increased CPU capabilities, greater application through-put demands, and faster, more capable hard drives, severely limits the future usefulness of this interface.
Command Optimization Commands to a device are queued for immediate execution, without having to wait for responses, increasing performance and making RAID more practical.
CRM "Customer Resource Management" software.
IDE "Integrated Device Electronics," the current low-cost storage interface standard for desktop and portable PCs, synonymous with ATA.
NAS "Network Attached Storage," a storage design that connects a server to externally enclosed hard drives via a local area network.
OLTP "Online Transaction Processing" database.
Parallel Technology A design that allows a device (hard drive) to receive multiple bits of information at the same time. Parallel interfaces use short, wide cables carrying multiple signals, and pose inherent design limitations on data transfer speed and multiple device connections.
Point-to-Point Direct connection between the backplane and the storage device, allowing for the high-performance, full utilization of bandwidth.
SAN "Storage Area Network," a storage design that connects all the storage devices on a network with all the servers on a network for enhanced reliability and performance.
SAS "Serial Attached SCSI," the serial imple-mentation of the SCSI standard, providing greater flexibility, performance, reliability, and connectivity.
SATA "Serial ATA" is an evolutionary replacement for the Parallel ATA physical storage interface.
SCSI "Small Computer System Interface," the predominant storage I/O technology for high-reliability, high-performance server applications.
Serial Technology A design that allows data to be sent one bit at a time. Serial interfaces use thin cables, and are capable of faster speeds, greater reliability, and more flexibility in attaching multiple drives.

 


What's Hot-Swap?

The term "Hot Swap" refers to the common practice of either inserting, or removing SCSI disk drives in an operating bus, typically used in RAID subsystems or JBOD (just of a bunch of disks) environments. The ability to "Hot Swap" a disk drive is beneficial to customers. It allows them to remove potentially defective drives from the system, or upgrade capacity without having the inconvenience and expense of taking the entire system down to replace the drive. The ANSI documents cover this function under the chapter heading "Removal and Insertion of SCSI devices". Four distinct levels of functionality are defined in Table A.

The term "Hot Swap" is not actually defined in the ANSI standards, or the draft standards under development. It is interpreted as "the very restrictive Level 4 Removal and Insertion of disk drives." To avoid confusion, the two ter ms are linked together as "Level 4 Hot Swap."


The main difference between Level 4 and the easier levels is that the bus is allowed to operate (move data or operate in any legal SCSI bus phase). Since inserting a disk into any powered bus will result in some level of electrical transients, it is necessary to insure that those transients do not interfere with, or corrupt the control of data signals present on the bus.

Defining transfer rate
ATA-33: 33 MB/ sec.
ATA-66: 66 MB/ sec.
ATA-100: 100 MB/ sec.

 

 

What is SCSI? - Small Computer System Interface

SCSI (pronounced "scuzzy") stands for Small Computer System Interface, the technology that allows you to connect various internal and external devices to your PC or PC server. This connection is made using a SCSI card that fits inside your computer.

The Types of SCSI
Type Speed Hard drive/peripheral connections
Ultra320 SCSI
(16-bit Wide)
320 MByte/sec State-of-the-art hard drives
Ultra160 SCSI
(16-bit Wide)
160 MByte/sec Hard drives
Ultra2 SCSI
(16-bit Wide)
80 MByte/sec Hard drives
Ultra Wide SCSI
(16-bit Wide)
40 MByte/sec Hard drives and tape drives
Ultra SCSI
(8-bit Narrow)
20 MByte/sec CD-R, CD-RW, tape, removable storage (Jaz), and DVD drives
SCSI-2, Fast SCSI
(8-bit Narrow)
10 MByte/sec Scanners, Zip drives, and CD-ROM

The Benefits of SCSIPerformance

  • Supports up to 320 MByte/sec transfer rates per channel with Ultra320 SCSI
  • Connects high-performance devices such as hard disk drives, CD-RWs, and other high-speed peripherals to your PC

Connectivity

  • Connectivity for internal and external SCSI devices
  • Single SCSI card can connect up to 15 devices per channel

Compatibility

  • Accommodates previous generations of the same technology
  • SCSI allows older peripherals to co-exist with the latest technology without hampering speed or performance

Reliability

  • SCSI has traditionally been the most reliable choice for IT professionals with regard to data integrity, component failure, and product quality
  • SCSI presence as an I/O choice over many years attests to its continued execution

 

The Newest SCSI Features

Features added with Ultra320 SCSI:

  • 320 MByte/sec performance per channelPacket Protocol and its reduction in command overhead allows increased speed without bandwidth issuesQuick Arbitration Select (QAS) increases bus utilization by streamlining release and re-use of the bus by the various peripherals
  • Cyclic Redundancy Check (CRC) for all SCSI bus phases. CRC improves data integrity by detecting data integrity errors. Previous versions of SCSI only checked the data phase.

Server Technology Comparison
SATAFibre Channel SCSI
Best suited for Entry-level to mid-range serversServer-to-server, campus networks Mid-range to enterprise servers
Advantages Performance of first-generation Serial ATA products:

150 MByte/sec
Expected low cost, but still an emerging technology Performance: 200 MByte/sec

Hard drive reliability

Highest hard disk drive expandability

Performance:
320 MByte/sec per channel

High hard drive and peripheral reliability
Connectivity to the largest variety of peripherals

Expandability

Single-User Technology Comparison
USBATA SCSI
Best suited for Basic desktopBasic desktop Performance desktop/workstation
Advantages No added cost*

Easy, external connectivity for simple devices like joysticks, keyboards, mice, and entry-level scanners No added cost*

Industry-standard interface for connecting internal devices such as hard disk drives and CD-ROMs

Highest performance

Highest device reliability Connection to the largest variety of peripherals

Expandability

*ATA and USB connections are standard on all new WindowsR computer systems.

Glossary

Basic Input/Output System (BIOS): A motherboard BIOS controls the basic functions of the computer such as the keyboard, monitor, etc. The BIOS on a SCSI card is used to control SCSI hard disk drives and perform the hard disk boot function.

Bus Mastering: The ability to process SCSI commands on the SCSI card due to its built-in processor without using the system?s CPU.

Direct Memory Access (DMA): The fastest method of data transfer available for multitasking operating systems. Data is transferred from SCSI devices to system memory (RAM) via the SCSI card without using the system?s CPU.

Daisy Chain: A cable configuration in which internal and external SCSI devices such as hard drives, CDs, scanners, and tape drives are connected in a series to the SCSI card.

Input/Output (I/O): An operation, program, or device that enters data into or extracts data from a computer.

Low Voltage Differential (LVD): Introduced with Ultra2 SCSI, and further enhanced with Ultra160 and Ultra320 SCSI, LVD technology enables data transfer to 320 MByte/sec and supports cable lengths to 12 meters.

Quick Arbitration Select (QAS): An Ultra320 SCSI feature that improves the control release between SCSI devices, reducing command overhead.

Packet Protocol/Packetized SCSI: This Ultra320 SCSI feature speeds the transfer of data, command, and status packets over earlier generations of SCSI.

SCSI Bus: A host adapter and one or more SCSI peripherals connected by cables in a linear chain configuration. The host adapter may exist anywhere on the chain, allowing connection of both internal and external SCSI devices. A system may have more than one SCSI bus by using multiple host adapters.

SCSI ID: A unique identification number used for each device on the SCSI chain.

Termination: A feature that stops the data signal at the beginning and the end of the SCSI bus. The first and last devices on the SCSI bus must be terminated.

What's Fiber?

A high-speed, high-bandwidth serial protocol for channels and networks that interconnect over twisted-pair wires, coaxial cable or fiber optic cable. The "fabric" topology of Fiber Channel offers up to 16 million ports with cable lengths of up to 10 kilometers. SCSI will use the lower cost "Arbitrated Loop" topology (FC-AL) of Fiber Channel. FC-AL using fiber optic media offers speeds of up to 100 MBytes/sec and up to 127 ports all connected in serial with up to 25 meters between ports. Fiber Channel on copper wiring is available in several versions from 12.5 MBytes/sec with up to 100 meters of cable to 100 MBytes/sec with up to 25 meters of cable. Does not require ID switches or terminators. The FC-AL loop may be connected to a Fiber Channel "fabric" for connection to other nodes. SCSI on FC-AL will be expensive and will require some changes to software as well as hardware.

Fibre Channel SCSI

This refers to products with fibre channel physical and protocol layers using the SCSI command set. The Fibre Channel interface is completely different from parallel SCSI in that it is a serial interface, meaning command and data information is transmitted on one signal stream organized into packets. The fibre may be either a copper coaxial cable or a fiber optic cable. The signal on the first implementation of fibre channel uses a 1 GHz rate, thereby achieving 100 Mbytes/sec over the cable. Fibre channel also implements increased software control of configuration and pushes the total device count on the bus to 126 IDs, as opposed to only 8 or 16 on a parallel bus.

Data Transfer Defined:
SCSI I & II: 4 MB/ sec.
Fast SCSI: 10 MB/ sec.
Ultra SCSI & Fast Wide SCSI: 20 MB/ sec.
SCSI III (or Ultra Wide SCSI): 40 MB/ sec.
Wide ULTRA II SCSI (LVD): 80 MB/ sec.
ULTRA III & 160: 160MB/ sec.

Connector Interfaces
IDE: uses 40 pin connector
SCSI I: uses 25 pin connector
SCSI II: uses 50 pin connector
SCSI III: uses 68 pin connector
SCA or SCA2: uses 80 pin connector

 



How much memory do I need?

How can you tell when a server requires more memory? Quite often, the users of the network are good indicators. If network-related activity such as email, shared applications, or printing slows down, they'll probably let their Network Administrator know. Here are a few proactive strategies that can be used to gauge whether or not a server has sufficient memory:

  • Monitor server disk activity. If disk swapping is detected, it is usually a result of inadequate memory.

  • Most servers have a utility that monitors CPU, memory, and disk utilization. Review this at peak usage times to measure the highest spikes in demand.
Once it's determined that a server does need more memory, there are many factors to consider when deciding on how much is enough:

What functions does the server perform (application, communication, remote access, email, Web, file, multimedia, print, database)?

    Some servers hold a large amount of information in memory at once, while others process information sequentially. For example, a typical large database server does a lot of data processing; with more memory, such a server would likely run much faster because more of the records it needs for searches and queries could be held in memory - that is, "at the ready." On the other hand, compared to a database server, a typical file server can perform efficiently with less memory because its primary job is simply to transfer information rather than to process it.
What operating system does the server use?

    Each server operating system manages memory differently. For example, a network operating system (NOS) such as the Novell operating system handles information much differently than an application-oriented system such as Windows NT. Windows NT's richer interface requires more memory, while the traditional Novell functions of file and print serving require less memory.
How many users access the server at one time?

    Most servers are designed and configured to support a certain number of users at one time. Recent tests show that this number is directly proportional to the amount of memory in the server. As soon as the number of users exceeds maximum capacity, the server resorts to using hard disk space as virtual memory, and performance drops sharply. In recent studies with Windows NT, additional memory allowed an application server to increase by several times the number of users supported while maintaining the same level of performance.
What kind and how many processors are installed on the server?

    Memory and processors affect server performance differently, but they work hand in hand. Adding memory allows more information to be handled at one time, while adding processors allows the information to be processed faster. So, if you add processing power to a system, additional memory will enable the processors to perform at their full potential.
How critical is the server's response time?

    In some servers, such as Web or e-commerce servers, response time directly affects the customer experience and hence revenue. In these cases, some IT Managers choose to install more memory than they think they would ever need in order to accommodate surprise surges in use. Because server configurations involve so many variables, it's difficult to make precise recommendations with regard to memory. The following chart shows two server upgrade scenarios.

SERVER MEMORY MAP

WINDOWS® 2000 SERVER
Designed to help businesses of all sizes run better, Windows 2000 Server offers a manageable, reliable and internet-ready solution for today's growing enterprises. For optimal performance, consider adding more memory to take advantage of Windows 2000 Server's robust feature set. Windows 2000 Server is internet-ready and promises to run today's and tomorrow's applications better.

Baseline: 128MB
Optimal: 256MB - 1GB


Application Server Houses one or more applications to be accessed over a wide user base 256MB - 4GB
Directory ServerCentral Management of network resources 128MB - 1GB
Print ServerDistributes print jobs to appropriate printers 128MB - 512MB
Communication ServerManages a variety of communications such as PBX, Voicemail, Email, and VPN 512MB - 2GB
Web ServerInternet and intranet solutions 512MB - 2GB
Database ServerManages simple to complex databases of varying sizes 256MB - 4GB

 

Microsoft Windows ServerTM 2003
Memory Minimum Recommended Maximum
Standard 128MB 256MB 4GB
Enterprise 128MB 256MB
  • 32 GB for x86-based computers
  • 512 GB for Itanium-based computers*
  • Datacenter 512MB 1GB
  • 64 GB for x86-based computers
  • 512 GB for Itanium-based computers*
  • Web 128MB 256MB 2GB

    * Important: Per Microsoft, the 64-bit versions of Windows Server 2003, Enterprise Edition and Windows Server 2003, Datacenter Edition are only compatible with 64-bit Intel Itanium-based systems. They cannot be successfully installed on 32-bit systems.


    LINUX®
    Linux is a reliable, cost-effective alternative to traditional UNIX servers. Depending on the distribution, the Linux server platform features a variety of utilities, applications, and services.

    Baseline: 64MB - 128MB
    Optimal: 256MB - 1GB


    Application ServerHouses one or more applications to be accessed over a wide user base 64MB - 4GB
    Directory ServerCentral Management of network resources 128MB - 1GB
    Print ServerDistributes print jobs to appropriate printers 128MB - 512MB
    Communication ServerManages a variety of communications such as PBX, Voicemail, Email, and VPN 512MB - 2GB
    Web ServerInternet and intranet solutions 512MB - 2GB
    Database ServerManages simple to complex databases of varying sizes 256MB - 4GB



    * Please Note: These figures reflect work done in a typical server environment. Higher-end workstation tasks may require up to 4GB. Naturally, a chart such as this evolves as memory needs and trends change. Over time, developers of software and operating systems will continue to add features and functionality to their products. This will continue to drive the demand for more memory. More complex character sets, like Kanji, may require more memory than the standard Roman based (English) character sets.

     

    System Design Computers
    More questions, please contact our tech support:
    sdcom@sdcom.com

    SDCom.com, System Design Computers 1985 - 2004, All Rights Reserved