Compression algorithms

Prepress files are often huge so it is no more than logical that data are often compressed. There are quite a few compression algorithms that can be used for both text and images. A basic knowledge about how the different algorithms work can be worthwhile.

These pages give an overview of the various compression algorithms that are used in the prepress industry. It is by no means a complete overview of all available algorithms.

The following types of compression are documented in more detail:

I have not yet documented:

  • CCITT group 3 & 4
  • JBIG

Types of compression algorithms

The above algorithms can be divided into two distinct categories: they are either lossless or lossy.

  • Lossless algorithms do not change the content of a file. If you compress a file and then decompress it, it has not changed. The following algorithms are lossless:
    • CCITT group 3 & 4 compression
    • Flate/deflate compression
    • Huffman compression
    • LZW compression
    • RLE compression
  • Lossy algorithms achieve better compression ratio’s by selectively getting rid of some of the information in the file. Such algorithms can be used for images or sound files but not for text or program data. The following algorithms are lossy:
    • JPEG compression

Which algorithm is best?

Unfortunately there is no fixed answer to that question. It all depends on the type of file that has to be compressed as well as the actual content of the file and the question whether you are willing to accept a lossy algorithm for that particular file. As a general rule, here is what most people in prepress use or find appropriate:

  • text: often not worth compressing, sometimes RLE is used
  • line-art images : LZW
  • line-art and screened data: CCITT group 4
  • grayscale images: LZW for files that do not contain too much detail
  • color images : JPEG if lossy compression is acceptable
  • vector drawings: not supported by applications

Add a Comment