Lossy compression


Lossy compression

A lossy compression method is one where compressing data and then decompressing it retrieves data that may well be different from the original, but is close enough to be useful in some way. Lossy compression is most commonly used to compress multimedia data (audio, video, still images), especially in applications such as streaming media and internet telephony. By contrast, lossless compression is required for text and data files, such as bank records, text articles, etc. Lossy compression formats suffer from generation loss: repeatedly compressing and decompressing the file will cause it to progressively lose quality.This is in contrast with lossless data compression.

Information-theoretical foundations for lossy data compression are provided by rate-distortion theory. Much like the use of probability in optimal coding theory, rate-distortion theory heavily draws on Bayesian estimation and decision theory in order to model perceptual distortion and even aesthetic judgment.

Types

There are two basic lossy compression schemes:

* In "lossy transform codecs", samples of picture or sound are taken, chopped into small segments, transformed into a new basis space, and quantized. The resulting quantized values are then entropy coded.

* In "lossy predictive codecs", previous and/or subsequent decoded data is used to predict the current sound sample or image frame. The error between the predicted data and the real data, together with any extra information needed to reproduce the prediction, is then quantized and coded.

In some systems the two techniques are combined, with transform codecs being used to compress the error signals generated by the predictive stage.

Lossy versus lossless

The advantage of lossy methods over lossless methods is that in some cases a lossy method can produce a much smaller compressed file than any known lossless method, while still meeting the requirements of the application.

Lossy methods are most often used for compressing sound, images or videos. This is because these types of data are intended for human interpretation where the mind can easily "fill in the blanks" or see past very minor errors or inconsistencies – ideally lossy compression is transparent (imperceptible), which can be verified via an ABX test.

Transparency

When a user acquires a lossily compressed file, (for example, to reduce download time) the retrieved file can be quite different from the original at the bit level while being indistinguishable to the human ear or eye for most practical purposes. Many compression methods focus on the idiosyncrasies of human physiology, taking into account, for instance, that the human eye can see only certain wavelengths of light. The psychoacoustic model describes how sound can be highly compressed without degrading perceived quality. Flaws caused by lossy compression that are noticeable to the human eye or ear are known as compression artifacts.

Compression ratio

The compression ratio (that is, the size of the compressed file compared to that of the uncompressed file) of lossy video codecs is nearly always far superior to that of the audio and still-image equivalents.

* Video can be compressed immensely (e.g. 300:1) with little visible quality loss;Fact|date=September 2008
* Audio can often be compressed at 10:1 with imperceptible loss of quality;Fact|date=September 2008
* Still images are often lossily compressed at 10:1, as with audio, but the quality loss is more noticeable, especially on closer inspection.Fact|date=September 2008

Transcoding and Editing

An important caveat about lossy compression is that converting (formally, transcoding) or editing lossily compressed files causes digital generation loss from the re-encoding. This can be avoided by only producing lossy files from (lossless) originals, and only editing (copies of) original files, such as images in raw image format instead of JPEG.

Lossless editing

Some lossless editing of lossily compressed files is possible, which consists of modifying the compressed data directly, without decoding and re-encoding.

JPEG

The primary programs for lossless editing of JPEGs are jpegtran, and the derived exiftran (which also preserves EXIF information), and [http://sylvana.net/jpegcrop/ Jpegcrop] (which provides a Windows interface).

These allow one to
* crop,
* rotate, flip, and flop, or
* convert to grayscale (by dropping the chrominance channel).

[http://sylvana.net/jpegcrop/jpegjoin/ JPEGjoin] allows one to join different JPEG images (which have the same encoding), without re-encoding. (See also: [http://sylvana.net/jpegcrop/jpegtran/ New jpegtran features] .)

One can also make some changes to the compression without re-encoding:
* optimize the compression (so it takes less space),
* convert between progressive and non-progressive encoding,

There is also the freeware Windows-only IrfanView, which has some lossless JPEG operations in its JPG_TRANSFORM plugin.

MP3

;Splitting and joining: Mp3splt and Mp3wrap (or AlbumWrap) allow one to split an MP3 file into pieces or join them loselessly. These are analogous to split and cat. [Though the wrap programs do more, encoding the divisions between the original files.] ;Gain: Various Replay Gain programs such as MP3gain allow one to modify the gain (overall volume) of MP3 files losslessly.

Metadata

One can generally modify or remove metadata, such as ID3 tags, Vorbis comments, or EXIF information, without modifying the underlying media.

Downsampling / compressed representation scalability

One may wish to downsample or otherwise decrease the resolution of the represented source signal and the quantity of data used for its compressed representation without re-encoding, as in bitrate peeling, but this functionality is not supported in all designs, as not all codecs encode data in a form that allows less important detail to simply be dropped.

Some well known designs that have this capability include JPEG 2000 for still images and H.264/MPEG-4 AVC based Scalable Video Coding for video. Actually such schemes have also been standardized for older designs as well, such as JPEG images with progressive encoding, and MPEG-2 and MPEG-4 Part 2 video, although those prior schemes had limited success in terms of adoption into real-world common usage.

Without this capacity, which is often the case in practice, to produce a representation with lower resolution or lower fidelity than a given one, one needs to start with the original source signal and encode, or start with a compressed representation and then decompress and re-encode it (transcoding), thought this latter tends to cause digital generation loss.

On a related point, some audio formats feature a combination of a lossy format and a lossless correction; this allows stripping the correction to easily obtain a lossy file, though whether the lossy portion itself can be further stripped is a separate question. Such formats include MPEG-4 SLS (Scalable to Lossless), WavPack, and OptimFROG DualStream.

Methods

Graphics

Image

* Cartesian Perceptual Compression: Also known as CPC
* DjVu
* Fractal compression
* HAM, hardware compression of color information used in Amiga computers
* ICER, used by the Mars Rovers: related to JPEG 2000 in its use of wavelets
* JPEG
* JPEG 2000, JPEG's successor format that uses wavelets.
* JBIG2
* PGF, Progressive Graphics File (lossless or lossy compression)
* Wavelet compression
* S3TC texture compression for 3D computer graphics hardware

Video

*H.261
*H.263
*H.264/MPEG-4 AVC
*MNG (supports JPEG sprites)
*Motion JPEG
*MPEG-1 Part 2
*MPEG-2 Part 2
*MPEG-4 Part 2
*Ogg Theora (noted for its lack of patent restrictions)
*Sorenson video codec
*VC-1

Audio

Music

*AAC
*ADPCM
*ATRAC
*Dolby AC-3
*MP2
*MP3
*Musepack
*Ogg Vorbis (noted for its lack of patent restrictions)
*WMA

Speech

* CELP
* G.711
* G.726
* Harmonic and Individual Lines and Noise (HILN)
* AMR (used by GSM cell carriers, such as T-Mobile)
* Speex (noted for its lack of patent restrictions)

Other data

Researchers have (semi-seriously) performed lossy compression on text by either using a thesaurus to substitute short words for long ones, or generative text techniques [cite web|title=Semantic and Generative Models for Lossy Text Compression |url=http://compression.ru/download/articles/text/witten_1994cj_lossy_text_compression.pdf |author=I. H. WITTEN, et al. |publisher=The Computer Journal |accessdate=2007-10-13 |format=PDF] , although these sometimes fall into the related category of lossy data conversion.

ee also

* Data compression
* Lossless data compression

* Compression artifact
* Rate–distortion theory

* List of codecs

* Lenna

Notes

External links

* [http://www.bobulous.org.uk/misc/lossy_audio_2006.html Lossy audio formats] , comparing the speed and compression strength of five lossy audio formats.
* [http://membled.com/work/apps/lossy_png/ lossy PNG image compression (research)]
* [http://www.websiteoptimization.com/speed/tweak/lossy/ using lossy GIF/PNG compression for the web (article)]


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Lossy compression — Compression Com*pres sion, n. [L. compressio: cf. F. compression.] 1. The act of compressing, or state of being compressed. Compression of thought. Johnson. [1913 Webster] 2. (Computers) reduction of the space required for storage (of binary… …   The Collaborative International Dictionary of English

  • lossy compression — los sy com*pres sion, n. (Computers) The compression of binary data into a form which, when it is re expanded, has most, but not all, of the original information. It is used primarily for compression of images and sounds, and is designed to… …   The Collaborative International Dictionary of English

  • Lossy Compression — Lossy Compression,   Datenreduktion …   Universal-Lexikon

  • lossy compression —    Any data compression method that compresses a file by discarding any data that the compression mechanism decides is not needed. Original data is lost when the file is decompressed. Lossy compression methods may be used for shrinking audio or… …   Dictionary of networking

  • lossy compression —    A process that reduces the storage space needed for an image file. If a digital image that has undergone lossy compression is decompressed, it will differ from the image before it was compressed (though this difference may be difficult for the …   Glossary of Art Terms

  • lossy compression — nuostolingasis glaudinimas statusas T sritis informatika apibrėžtis Duomenų ↑glaudinimo būdas, kai juos suspaudžiant prarandama dalis informacijos. Iš taip suglaudintų duomenų nebegalima visiškai tiksliai atkurti pirminio jų pavidalo. Tačiau… …   Enciklopedinis kompiuterijos žodynas

  • lossy compression — compression of graphical data in which some of the information is lost and the quality of the picture is damaged although the compression is very efficient …   English contemporary dictionary


Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”

We are using cookies for the best presentation of our site. Continuing to use this site, you agree with this.