Resources for “Compression Tools Compared”

 in
Resources for the print article.
______________________

Comments

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

rzip looked promising. I

Anonymous's picture

rzip looked promising. I started hacking it a bit here and there mainly in main.c and in the stream.c and got rid of libbz in it so now I may use it as preprocessor before gzip.

I threw urandom data at this hacked variant and was not able to decompress what was compressed at -0 and -1 levels. Then i runned original rzip-2.0 against the same urandom data and it shows same bug.

So the hint is: Don't use version 2.0 (rzip.samba.org OR debian's 2.0-2) with compression levels -0 and -1 for anything inmportant before first checking if you can decompress. Other levels were fine with same data but I was not giving it any heavy testing.

Still it's able to cut in 10 minutes from 6538537833B of mail+spam down to 4466144683B at compression level 0 (these data decompresses cleanly) here and that is the hacked version without bzip2 - only rzip's own part. Yet the hacked variant without bzip at level -9 is terribly slow - but the system was swapping/trashing/mmaping(comparing large chunks of data) heavily

LZMA 0.03-1 source

Charles Hewson's picture

If anyone finds working lzma please post the URL and/or email me. I have found 0.03 and 0.04 versions on sites above but neither works as documented in article. I have tried 2.4.26 & 2.6.11 kernels. The -x -s26 switches simply return the word "error" and do nothing. Without switches 0.03 encodes but will not decode. I never got 0.04 to encode.

Oops - Forgot to Include Source Location ..

Graham K Jenkins's picture

The version I used was LZMA 4.27 on Fedora Core 4.

You can find it at: http://sourceforge.net/projects/sevenzip

Ciao ..

Working lzma

Graham K Jenkins's picture

I have compiled and used it on Fedora Core, using both 32-bit Intel and 64-bit AMD machines.

Hope this helps.

Webcast
How to Build an Optimal Hadoop Cluster to Store and Maintain Unlimited Amounts of Data Using Microservers

Realizing the promise of Apache® Hadoop® requires the effective deployment of compute, memory, storage and networking to achieve optimal results. With its flexibility and multitude of options, it is easy to over or under provision the server infrastructure, resulting in poor performance and high TCO. Join us for an in depth, technical discussion with industry experts from leading Hadoop and server companies who will provide insights into the key considerations for designing and deploying an optimal Hadoop cluster.

Learn More

Sponsored by AMD

White Paper
Red Hat White Paper: Using an Open Source Framework to Catch the Bad Guy

Built-in forensics, incident response, and security with Red Hat Enterprise Linux 6

Every security policy provides guidance and requirements for ensuring adequate protection of information and data, as well as high-level technical and administrative security requirements for a system in a given environment. Traditionally, providing security for a system focuses on the confidentiality of the information on it. However, protecting the data integrity and system and data availability is just as important. For example, when processing United States intelligence information, there are three attributes that require protection: confidentiality, integrity, and availability.

Learn more about catching the bad guy in this free white paper.

Learn More

Sponsored by DLT Solutions