LJ March 2010 - Troubleshooting Part 1

Troubleshooting Part 1 - great article, looking forward to the series.

I’m a 20+ year IT veteran and a 10+ year linux user but I’m still trying to get my head around this ‘Load Average’ concept.

Currently contracting to a large Australian telco on capacity planning their media assets and have been looking into load average more closely. Between us we might be able to get to the bottom of it. (I’m aware you have limited space to explain a topic some could write a thesis on but here goes anyway…)

Re your statement;
‘If I have a load of 1, the cpu is busy enough that 1 process is having to wait for cpu time’.

I have been using the philosophy that load average corresponds to the number of cpu’s, so on a single cpu system a load average of 1 equals a system running at 100%. It would need to exceed a load average of 1 in order to have waiting cpu processes. I also don’t believe the load average corresponds directly to the number of waiting processes – a load average of 2 does not mean 2 waiting processes.

The other issue comes from virtualisation technologies. One of the drivers here is to increase utilisation of the large number of physical servers that have low cpu utilisation. So does a load average of 1 (or 4 for a quad cpu system) mean high load or optimal performance? Is it good to run a system at 100% cpu for extended periods? My experience says no, but virtualisation goals say yes.

Comments welcome.

Nice point about the Linux File Cache, lost count of the number of times I’ve had to explain this one.

AlanG

References
http://www.linuxjournal.com/article/9001

http://www.teamquest.com/resources/gunther/display/5/

Webcast
How to Build an Optimal Hadoop Cluster to Store and Maintain Unlimited Amounts of Data Using Microservers

Realizing the promise of Apache® Hadoop® requires the effective deployment of compute, memory, storage and networking to achieve optimal results. With its flexibility and multitude of options, it is easy to over or under provision the server infrastructure, resulting in poor performance and high TCO. Join us for an in depth, technical discussion with industry experts from leading Hadoop and server companies who will provide insights into the key considerations for designing and deploying an optimal Hadoop cluster.

Learn More

Sponsored by AMD

White Paper
Red Hat White Paper: Using an Open Source Framework to Catch the Bad Guy

Built-in forensics, incident response, and security with Red Hat Enterprise Linux 6

Every security policy provides guidance and requirements for ensuring adequate protection of information and data, as well as high-level technical and administrative security requirements for a system in a given environment. Traditionally, providing security for a system focuses on the confidentiality of the information on it. However, protecting the data integrity and system and data availability is just as important. For example, when processing United States intelligence information, there are three attributes that require protection: confidentiality, integrity, and availability.

Learn more about catching the bad guy in this free white paper.

Learn More

Sponsored by DLT Solutions