Developer(s) | Julian Seward |
---|---|
Initial release | July 18, 1996 |
Stable release |
1.0.6 / September 20, 2010
|
Operating system | Cross-platform |
Type | Data compression |
License | BSD-like license |
Website | bzip |
Filename extension | .bz2 |
---|---|
Internet media type | application/x-bzip2 |
Type code | Bzp2 |
Uniform Type Identifier (UTI) | public.archive.bzip2 |
Magic number | BZh |
Developed by | Julian Seward |
Type of format | Data compression |
Open format? | Yes |
bzip2 is a free and open-source file compression program that uses the Burrows–Wheeler algorithm. It only compresses single files and is not a file archiver. It is developed and maintained by Julian Seward. Seward made the first public release of bzip2, version 0.15, in July 1996. The compressor's stability and popularity grew over the next several years, and Seward released version 1.0 in late 2000.
bzip2 compresses most files more effectively than the older LZW (.Z) and Deflate (.zip and .gz) compression algorithms, but is considerably slower. LZMA is generally more space-efficient than bzip2 at the expense of even slower compression speed, while having much faster decompression.
bzip2 compresses data in blocks of size between 100 and 900 kB and uses the Burrows–Wheeler transform to convert frequently-recurring character sequences into strings of identical letters. It then applies move-to-front transform and Huffman coding. bzip2's ancestor bzip used arithmetic coding instead of Huffman. The change was made because of a software patent restriction.
bzip2 performance is asymmetric, as decompression is relatively fast. Motivated by the large CPU time required for compression, a modified version was created in 2003 called pbzip2 that supported multi-threading, giving almost linear speed improvements on multi-CPU and multi-core computers. As of May 2010[update], this functionality has not been incorporated into the main project.