Getting Serial: The Ultimate Linux Box S-ATA Disk Subsystem
One of the big advantages of Linux is you don't need all that much buck to get maximum bang. With the advent of serial ATA disk drives and controllers, I thought we might undertake this approach in hardware as well. Street prices on comparable RAID controllers--I compared the 3Ware Escalade 8500-4, which we use here, to an Adaptec 2400--seem to be fairly close. The new Ultra 320 controllers were double those prices. Likewise with the disk drives; 73GB-class SCSI drives seemed to run at least double their serial ATA brethren, and the speed didn't appear to matter. So let's see how much bang we can get for half the bucks.
The folks at Monarch Computer Systems sent me a serial ATA testbed system: a nice Dual Athlon 2800+ system with 2GB of RAM running Red Hat 9, the 3Ware Escalade 8500-4 drive and three Seagate 80GB S-ATA-150 drives in RAID 5. It also has an ATI FireGL X1 you'll hear about in a subsequent article.
The 3Ware 8500-4 is short as RAID cards go; it's only slightly longer than the end of the 64-bit PCI slot that holds it. The geography of the card is different as well; instead of the connectors for the drive cables going on the top edge of the card (as they would for parallel ATA), they go on the end of the card, away from the bracket--the one closest to the drives. This makes routing the cables easier. Of course, the cables themselves are much smaller than regular parallel ATA cables, not much different in size from a piece of CAT5 network cable. The drives themselves look similar to their parallel cousins, except on the business end. There it looks a lot like the parallel and power connectors have each shrunk into something resembling the dongle slot for a PCMCIA Ethernet card. You can hear a slight but definite clunk as the cable achieves its desired position in either drive or card slot. There aren't any worries about whether the cable is all the way in, as there can be with parallel drives. On the other hand, whatever is holding the connector in doesn't hold it so well that removing it is difficult. Monarch had installed adapters so the standard ATX Molex connectors could power the drives.
3Ware has been quite responsive to the Linux community over the last few years. They have made sure the latest stable drivers end up in the mainstream kernel; the Monarch box was running a Red Hat standard 2.4.20 kernel. In addition, both Web-based and scriptable command-line monitoring tools are downloadable from the 3Ware web site.
And when the rubber meets the road, the serial ATA system holds its own. I compared tiobench numbers between it and the Dell Precision system I reviewed for print recently, which had an Ultra 320 RAID controller and drives, as a benchmark. I was surprised to find the SCSI system was beaten in a few areas, despite being designed to be twice as fast. Serial ATA sequential reads were the biggest factor; while on a single thread, the SCSI drives turned in 95MB/s to S-ATA's 54. When you ramped the thread count up, S-ATA decayed gracefully to 29MB/s through 8 threads, while the SCSI system plummeted immediately to the low 20s at 2 threads and stayed there. On random reads, the S-ATA machine started slower--1.0MB/s vs. 1.5MB/s--but as the thread count increased, it came out slightly ahead, 2.4 to 2.1MB/s. On random reads, more threads means statistically less distance to move for each read; your performance actually goes up. Writing was where the performance difference represented by the dollar cost showed up; the SCSI system wrote a consistent 33-35MB/s across the thread counts, where the S-ATA wrote 10-13MB/s. It was a similar story for random, where the SCSI turned in numbers in the 5MB/s range, whereas S-ATA ran 0.46MB/s.
A few words about configuration are in order here: neither of these systems is completely optimal. The Dell SCSI RAID box is RAID 0, meaning writing is slow in an unaccelerated environment. Conversely, the Monarch S-ATA box has RAID 5 on three drives, when my experience says four drives is the minimum optimal configuration for 3Ware RAID 5 systems. I expect that if we had both systems in full-out RAID 5 configurations, the S-ATA system would give SCSI a thorough thrashing on read, while putting up respectable numbers on write. Again, this is an educated guess; there's no substitute for testing a system against the real world (or as close as you can get without risking your nest egg).
The results are fairly clear: serial ATA is some fairly serious bang for not a lot of bucks. Prices from Froogle, Google's merchandise-finding page, showed the Seagate drives at $145 a pop and the 3Ware 8500-4 at $350 or so. And with 160MB and larger S-ATA disks around the corner and 300MB/s S-ATA controllers on the roadmap, serial-ATA RAID seems to be well on its way to becoming the best way to get a lot of cheap, fast storage in one Ultimate Linux Box.
The roadmap from here to the final Ultimate Linux Box looks like this: next time, we look at the ATI FireGL graphics card, followed by things concerned with sound (I have not forgotten!). I believe the final choice of motherboard and processor will be a welcome surprise, pointing the way to the future of Linux.
Realizing the promise of Apache® Hadoop® requires the effective deployment of compute, memory, storage and networking to achieve optimal results. With its flexibility and multitude of options, it is easy to over or under provision the server infrastructure, resulting in poor performance and high TCO. Join us for an in depth, technical discussion with industry experts from leading Hadoop and server companies who will provide insights into the key considerations for designing and deploying an optimal Hadoop cluster.
Sponsored by AMD
Built-in forensics, incident response, and security with Red Hat Enterprise Linux 6
Every security policy provides guidance and requirements for ensuring adequate protection of information and data, as well as high-level technical and administrative security requirements for a system in a given environment. Traditionally, providing security for a system focuses on the confidentiality of the information on it. However, protecting the data integrity and system and data availability is just as important. For example, when processing United States intelligence information, there are three attributes that require protection: confidentiality, integrity, and availability.
Learn more about catching the bad guy in this free white paper.
Sponsored by DLT Solutions
| Designing Electronics with Linux | May 22, 2013 |
| Dynamic DNS—an Object Lesson in Problem Solving | May 21, 2013 |
| Using Salt Stack and Vagrant for Drupal Development | May 20, 2013 |
| Making Linux and Android Get Along (It's Not as Hard as It Sounds) | May 16, 2013 |
| Drupal Is a Framework: Why Everyone Needs to Understand This | May 15, 2013 |
| Home, My Backup Data Center | May 13, 2013 |
- I once had a better way I
36 min 30 sec ago - Not only you I too assumed
53 min 53 sec ago - another very interesting
2 hours 46 min ago - Reply to comment | Linux Journal
4 hours 40 min ago - Reply to comment | Linux Journal
11 hours 34 min ago - Reply to comment | Linux Journal
11 hours 50 min ago - Favorite (and easily brute-forced) pw's
13 hours 41 min ago - Have you tried Boxen? It's a
19 hours 33 min ago - seo services in india
1 day 5 min ago - For KDE install kio-mtp
1 day 5 min ago
Enter to Win an Adafruit Pi Cobbler Breakout Kit for Raspberry Pi

It's Raspberry Pi month at Linux Journal. Each week in May, Adafruit will be giving away a Pi-related prize to a lucky, randomly drawn LJ reader. Winners will be announced weekly.
Fill out the fields below to enter to win this week's prize-- a Pi Cobbler Breakout Kit for Raspberry Pi.
Congratulations to our winners so far:
- 5-8-13, Pi Starter Pack: Jack Davis
- 5-15-13, Pi Model B 512MB RAM: Patrick Dunn
- 5-21-13, Prototyping Pi Plate Kit: Philip Kirby
- Next winner announced on 5-27-13!
Featured Jobs
| Linux Systems Administrator | Houston and Austin, Texas | Host Gator |
| Senior Perl Developer | Austin, Texas | Host Gator |
| Technical Support Rep | Houston and Austin, Texas | Host Gator |
| UX Designer | Austin, Texas | Host Gator |
| Web & UI Developer (JavaScript & j Query) | Austin, Texas | Host Gator |
Free Webinar: Hadoop
How to Build an Optimal Hadoop Cluster to Store and Maintain Unlimited Amounts of Data Using Microservers
Realizing the promise of Apache® Hadoop® requires the effective deployment of compute, memory, storage and networking to achieve optimal results. With its flexibility and multitude of options, it is easy to over or under provision the server infrastructure, resulting in poor performance and high TCO. Join us for an in depth, technical discussion with industry experts from leading Hadoop and server companies who will provide insights into the key considerations for designing and deploying an optimal Hadoop cluster.
Some of key questions to be discussed are:
- What is the “typical” Hadoop cluster and what should be installed on the different machine types?
- Why should you consider the typical workload patterns when making your hardware decisions?
- Are all microservers created equal for Hadoop deployments?
- How do I plan for expansion if I require more compute, memory, storage or networking?



Comments
Re: Getting Serial: The Ultimate Linux Box S-ATA Disk Subsystem
As long, of course, that you don't have to do any random writes. So that just about rules it out for file/print, web, database, multi-user application, mail server, anything write intensive, etc., as compared to scsi performance.
That's one limited Ultimate Linux Box.
Maybe it's time for new subcategories under Ultimate GNU/Linux Box:
RAID levels
He really should've been clearer on the difference between the RAID levels. RAID 5 usually performs very poorly on writes compared to RAID 0.
Its pretty silly to compare systems running different RAID levels actually.
Also - why do you suggest 4 + drives for a RAID 5 array on 3ware cards? Any specific problems, or does it simply perform better?
What drives do you use with your Escalade?
I'm using Western Digital 40g and 120g drives with a 3ware Escalade 8500-8 and having serious problems. The disks are connected via Highpoint SATA converters.
The controller constantly reports bad verifies and runs an array rebuild. It's not happy.
So... what disks are you using?
Oh be quiet: His comparison was useless
The comparison was worthless: The SATA system was a raid5.. so with random writes every write required two random reads, and two random writes. Furthermore, on three disks there is no room for writes to operating concurently in raid5 (because every operation will touch two drives). The scsi system was RAID0 so was able to run multiple commands at once.
The article was interesting but it wasn't a comparison. A real, apples to apples, comparison is obviously in order.
Re: Oh be quiet: His comparison was useless
Absolutly Write! Can't compare SCSI Raid 0 v SATA Raid 5. I bet a SATA Raid 0 array v SCSI Raid 5 would reverse the numbers!
I agree a test needs to be done, but not such a flawed test.
Re: Oh be quiet: His comparison was useless
absolutely. completely. 0 vs. 5 is the major issue here, not scsi vs. ata.