FFmpeg/doc/rate_distortion.txt

A Quick Description Of Rate Distortion Theory.

We want to encode a video, picture or piece of music optimally. What does
"optimally" really mean? It means that we want to get the best quality at a
given filesize OR we want to get the smallest filesize at a given quality
(in practice, these 2 goals are usually the same).

Solving this directly is not practical; trying all byte sequences 1
megabyte in length and selecting the "best looking" sequence will yield
256^1000000 cases to try.

But first, a word about quality, which is also called distortion.
Distortion can be quantified by almost any quality measurement one chooses.
Commonly, the sum of squared differences is used but more complex methods
that consider psychovisual effects can be used as well. It makes no
difference in this discussion.


First step: that rate distortion factor called lambda...
Let's consider the problem of minimizing:

  distortion + lambda*rate

rate is the filesize
distortion is the quality
lambda is a fixed value choosen as a tradeoff between quality and filesize
Is this equivalent to finding the best quality for a given max
filesize? The answer is yes. For each filesize limit there is some lambda
factor for which minimizing above will get you the best quality (using your
chosen quality measurement) at the desired (or lower) filesize.


Second step: splitting the problem.
Directly splitting the problem of finding the best quality at a given
filesize is hard because we do not know how many bits from the total
filesize should be allocated to each of the subproblems. But the formula
from above:

  distortion + lambda*rate

can be trivially split. Consider:

  (distortion0 + distortion1) + lambda*(rate0 + rate1)

This creates a problem made of 2 independent subproblems. The subproblems
might be 2 16x16 macroblocks in a frame of 32x16 size. To minimize:

  (distortion0 + distortion1) + lambda*(rate0 + rate1)

we just have to minimize:

  distortion0 + lambda*rate0

and

  distortion1 + lambda*rate1

I.e, the 2 problems can be solved independently.

Author: Michael Niedermayer
Copyright: LGPL
excellent first pass at a description; now it's time for the Ministry of English Composition to tear it apart and rebuild it, stronger than before Originally committed as revision 17801 to svn://svn.ffmpeg.org/ffmpeg/trunk 2009-03-04 07:24:59 +02:00			`A Quick Description Of Rate Distortion Theory.`

			`We want to encode a video, picture or piece of music optimally. What does`
			`"optimally" really mean? It means that we want to get the best quality at a`
			`given filesize OR we want to get the smallest filesize at a given quality`
			`(in practice, these 2 goals are usually the same).`

			`Solving this directly is not practical; trying all byte sequences 1`
			`megabyte in length and selecting the "best looking" sequence will yield`
			`256^1000000 cases to try.`

			`But first, a word about quality, which is also called distortion.`
			`Distortion can be quantified by almost any quality measurement one chooses.`
			`Commonly, the sum of squared differences is used but more complex methods`
			`that consider psychovisual effects can be used as well. It makes no`
			`difference in this discussion.`


			`First step: that rate distortion factor called lambda...`
			`Let's consider the problem of minimizing:`

			`distortion + lambda*rate`

The definition of rate and distortion is not conditional of lambda being fixed (at least the current text sounded odd to me). Originally committed as revision 18244 to svn://svn.ffmpeg.org/ffmpeg/trunk 2009-03-30 13:39:20 +03:00			`rate is the filesize`
			`distortion is the quality`
			`lambda is a fixed value choosen as a tradeoff between quality and filesize`
			`Is this equivalent to finding the best quality for a given max`
excellent first pass at a description; now it's time for the Ministry of English Composition to tear it apart and rebuild it, stronger than before Originally committed as revision 17801 to svn://svn.ffmpeg.org/ffmpeg/trunk 2009-03-04 07:24:59 +02:00			`filesize? The answer is yes. For each filesize limit there is some lambda`
			`factor for which minimizing above will get you the best quality (using your`
			`chosen quality measurement) at the desired (or lower) filesize.`


			`Second step: splitting the problem.`
			`Directly splitting the problem of finding the best quality at a given`
			`filesize is hard because we do not know how many bits from the total`
			`filesize should be allocated to each of the subproblems. But the formula`
			`from above:`

			`distortion + lambda*rate`

			`can be trivially split. Consider:`

			`(distortion0 + distortion1) + lambda*(rate0 + rate1)`

			`This creates a problem made of 2 independent subproblems. The subproblems`
			`might be 2 16x16 macroblocks in a frame of 32x16 size. To minimize:`

			`(distortion0 + distortion1) + lambda*(rate0 + rate1)`

			`we just have to minimize:`

			`distortion0 + lambda*rate0`

A quick description of Rate distortion theory. Originally committed as revision 17774 to svn://svn.ffmpeg.org/ffmpeg/trunk 2009-03-03 18:09:19 +02:00			`and`

excellent first pass at a description; now it's time for the Ministry of English Composition to tear it apart and rebuild it, stronger than before Originally committed as revision 17801 to svn://svn.ffmpeg.org/ffmpeg/trunk 2009-03-04 07:24:59 +02:00			`distortion1 + lambda*rate1`

			`I.e, the 2 problems can be solved independently.`
A quick description of Rate distortion theory. Originally committed as revision 17774 to svn://svn.ffmpeg.org/ffmpeg/trunk 2009-03-03 18:09:19 +02:00
			`Author: Michael Niedermayer`
			`Copyright: LGPL`