[FFmpeg-devel] [PATCH 5/9] x86: simple_idct10_template: fix overflow in pass

Michael Niedermayer michael at niedermayer.cc
Tue Oct 13 02:26:58 CEST 2015


On Mon, Oct 12, 2015 at 07:37:46PM +0200, Christophe Gisquet wrote:
> When the input of a pass has 15 or 16 bits of precision (in particular
> the column pass), the addition of a bias to W4 may lead to overflows
> in the input to pmaddwd.
> 
> This requires postponing the adding of the bias to after the first
> butterfly. To do so, the fact that m15, unused although zeroed, is
> exploited. In case the pass is safe, an address can be directly used,
> and the number of xmm regs can be decreased. Otherwise, the 32bits bias
> is loaded into it.
> ---
>  libavcodec/x86/proresdsp.asm              |  8 ++++----
>  libavcodec/x86/simple_idct10_template.asm | 13 ++++++++++++-
>  2 files changed, 16 insertions(+), 5 deletions(-)

how can i reproduce these overflows ?

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Opposition brings concord. Out of discord comes the fairest harmony.
-- Heraclitus
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 181 bytes
Desc: Digital signature
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20151013/d19efec2/attachment.sig>


More information about the ffmpeg-devel mailing list