Biomedical Research and Linux
The boom in biomedical research over the past decade has occurred as a result of increased demands for better health care and greater understanding of the biomolecular mechanisms responsible for many diseases. Fueling many of the scientific advances in medicine have been the corresponding developments in computational processing power and speed. In addition to the large-scale data collection and processing required to understand complex biological and physiological processes, our ability to analyze and model medical phenomena at microscopic and macroscopic levels is reliant upon the systems used for these computationally intensive jobs. Linux systems are receiving recognition as ideal workhorses for performing many of these tasks.
To better understand the emerging biomedical applications of Linux, it is first necessary to examine the typical computational requirements and resources of many biomedical research centers. The popular perception of biology and medicine is that they are wet and gushy sciences, involving little or no quantitative or numerical analysis. In fact, this couldn't be farther from the truth. There is a concerted effort among scientists to bring the biomedical sciences into the realm of other quantitative disciplines. This effort is illustrated by the recent emergence or increasing prominence of fields such as bioinformatics and genomic-sequence databases, biomolecular modeling and computer-aided drug design, diagnostic medical imaging, epidemiology and biostatistical analyses of patient populations, as well as medical informatics and patient record databases.
The systems used by researchers in these fields span a spectrum of different platforms. One typically finds an abundant assortment of Intel and Power PC-based systems running the Windows and Mac operating systems for performing basic tasks such as text processing, routine data analysis, e-mail and Internet access. In addition, the recent trend among manufacturers has been to interface a broad range of biomedical research equipment with PCs for the ease of GUI instrument control and data collection. The choice of which platform should be employed for such interfaces is usually the decision of the equipment manufacturer and not the researcher.
Laboratories may also contain a few Unix workstations which are used to process large jobs, drive complex instrumentation or function as servers. Typically these machines are dedicated to specific tasks and are unavailable for general use.
One unfortunate aspect of this heterogeneous situation is that it creates headaches for system administrators. Such a wide variety of systems performing a multitude of different tasks and operated by a large number of users with a wide range of computer abilities, frequently spells disaster.
Linux serves as an ideal system for biomedical research laboratories, where there is an abundance of PCs, strict limits on the allocation of research funds and a need for significant processing power in conjunction with straightforward and flexible system customization. The stability of Linux, its cost-effectiveness and large user community make it attractive to both novice and experienced system administrators confined by tight research computing budgets. In addition, the ability of the various incarnations of Linux to run on a wide variety of platforms provides a uniform OS and networking capabilities for the mixture of different machines already in existence at many biomedical research centers.
Utilization of Linux systems is emerging within the Biomolecular Structure Group here at Harvard Medical School. Intel-based systems running Red Hat Linux have recently been implemented for simultaneous control and data collection from x-ray diffraction instruments (Figure 1). Individual data sets obtained with these instruments are about 100MB in size and are used to understand the atomic structure of biomolecules and for drug-design studies at the molecular level. Although the source code for driving and collecting data on the x-ray instrument had to be recompiled for Linux and the kernel was rebuilt, the result was a very stable system for performing this specialized and complex task. In this instance, the Linux system has successfully replaced a more expensive Silicon Graphics workstation.
Granted, the above application of Linux is rather specific, but Linux is also gaining more widespread usage among individual researchers. Installation of Linux on PCs in conjunction with either the Windows or Mac operating systems gives users the choice of the appropriate system with which to complete a job, while sitting at the same machine. For example, in a laboratory with an abundance of Macintoshes, the recently developed MkLinux for Power PC has proven to be beneficial in providing greater access to remote Unix systems and applications. More individuals are now able to use popular GUIs for genomic databases, bioinformatics applications, and biomolecular visualization software located on remote servers, while taking advantage of the multitasking capabilities of Linux (Figure 2). In addition, through implementation of a simple Tcl/Tk script, general users have the ability to reboot the machine back into Mac OS with a click of a button. In some cases, individuals are using the Linux systems as nothing more than X terminals. However, this use alone is valuable, since it provides increased access to applications located on a limited number of servers without having to purchase additional X terminals or X terminal emulation software.
Linux is bound to have a more prominent role in biomedical research computing in the future. The recent installations of Linux systems described above have already been recognized as highly successful and even superior to other systems, in terms of both price and performance. As the computing demands of biomedical research increase, scientists, system administrators, software developers and instrument manufacturers will all be considering Linux.
|Designing Electronics with Linux||May 22, 2013|
|Dynamic DNS—an Object Lesson in Problem Solving||May 21, 2013|
|Using Salt Stack and Vagrant for Drupal Development||May 20, 2013|
|Making Linux and Android Get Along (It's Not as Hard as It Sounds)||May 16, 2013|
|Drupal Is a Framework: Why Everyone Needs to Understand This||May 15, 2013|
|Home, My Backup Data Center||May 13, 2013|
- Nice article, thanks for the
4 min 3 sec ago
- I once had a better way I
5 hours 50 min ago
- Not only you I too assumed
6 hours 7 min ago
- another very interesting
8 hours 27 sec ago
- Reply to comment | Linux Journal
9 hours 53 min ago
- Reply to comment | Linux Journal
16 hours 47 min ago
- Reply to comment | Linux Journal
17 hours 4 min ago
- Favorite (and easily brute-forced) pw's
18 hours 55 min ago
- Have you tried Boxen? It's a
1 day 47 min ago
- seo services in india
1 day 5 hours ago
Enter to Win an Adafruit Pi Cobbler Breakout Kit for Raspberry Pi
It's Raspberry Pi month at Linux Journal. Each week in May, Adafruit will be giving away a Pi-related prize to a lucky, randomly drawn LJ reader. Winners will be announced weekly.
Fill out the fields below to enter to win this week's prize-- a Pi Cobbler Breakout Kit for Raspberry Pi.
Congratulations to our winners so far:
- 5-8-13, Pi Starter Pack: Jack Davis
- 5-15-13, Pi Model B 512MB RAM: Patrick Dunn
- 5-21-13, Prototyping Pi Plate Kit: Philip Kirby
- Next winner announced on 5-27-13!
Free Webinar: Hadoop
How to Build an Optimal Hadoop Cluster to Store and Maintain Unlimited Amounts of Data Using Microservers
Realizing the promise of Apache® Hadoop® requires the effective deployment of compute, memory, storage and networking to achieve optimal results. With its flexibility and multitude of options, it is easy to over or under provision the server infrastructure, resulting in poor performance and high TCO. Join us for an in depth, technical discussion with industry experts from leading Hadoop and server companies who will provide insights into the key considerations for designing and deploying an optimal Hadoop cluster.
Some of key questions to be discussed are:
- What is the “typical” Hadoop cluster and what should be installed on the different machine types?
- Why should you consider the typical workload patterns when making your hardware decisions?
- Are all microservers created equal for Hadoop deployments?
- How do I plan for expansion if I require more compute, memory, storage or networking?