[Ffmpeg-cvslog] r5507 - trunk/libavcodec/flac.c

Wed Jun 21 07:41:35 CEST 2006

On Wed, 21 Jun 2006, Justin Ruggles wrote:
> Loren Merritt wrote:
>>
>> Overflow is not an error. For 16bit audio, if qlevel<=16, then bit #32 of
>> the sum will never be used, no matter whether it's stored in a 64bit
>> variable or just discarded.
>> OK, so maybe the check should be if(s->bps + qlevel > 32)
>
> The thing that concerns me is that while the audio itself is guaranteed
> to be signed 16 (or 17)-bit, the level-shifted predicted value (final
> sum) is not.  A bad predictor can generate >32-bit values.  An encoder
> would be unlikely to use parameters which cause this due to them not
> giving good compression, but it is not constrained by the spec.  For
> example, in the FLAC reference encoder there is a compile-time option to
> check for the range of this value.  If turned on, it uses a 64-bit sum
> and generates a warning if the value is >32-bit.  In my encoder, I
> always use a 64-bit sum and return an error value (instead of issuing a
> warning) to indicate not to encode with those parameters.  Neither of
> these solutions is in the spec though.  An encoder would still be within
> spec if it chose not to do any check, keep the predicted value & even
> the final residual in 64-bit storage, and encode the result in the FLAC
> file.

That's all true. Nevertheless, it's still ok for the decoder to ignore 
overflow.

FLAC does not define any clipping. If the predictor (after shifting) 
is >16bit, then the encoder must generate a residual that brings the final 
reconstructed sample back into the valid range of 16bit audio. Otherwise 
it wouldn't be lossless.
So since you know that the high order bits of the predictor and of the 
residual must cancel, you can discard both. This is equivalent to storing 
the reconstructed samples in an array of int16_t.

--Loren Merritt