[FFmpeg-devel] [PATCH 6/9] x86: simple_idct_put: 10bits versions
Michael Niedermayer
michael at niedermayer.cc
Tue Oct 13 15:32:23 CEST 2015
On Mon, Oct 12, 2015 at 07:37:47PM +0200, Christophe Gisquet wrote:
> Modeled from the prores version. Clips to [0;1023] and is bitexact.
> Bitexactness requires to add offsets in different places compared to
> prores or C, and makes the function approximately 2% slower.
>
> For 16 frames of a DNxHD 4:2:2 10bits test sequence:
>
> C: 60861 decicycles in idct, 1048205 runs, 371 skips
> sse2: 27567 decicycles in idct, 1048216 runs, 360 skips
> avx: 26272 decicycles in idct, 1048171 runs, 405 skips
>
> The pure and add versions are not implemented, so the corresponding dsp
> functions are set to NULL to make it clear in a code executing them.
> ---
> libavcodec/x86/Makefile | 1 +
> libavcodec/x86/idctdsp_init.c | 20 +++++++++++++++
> libavcodec/x86/simple_idct.h | 3 +++
> libavcodec/x86/simple_idct10.asm | 54 ++++++++++++++++++++++++++++++++++++++++
> 4 files changed, 78 insertions(+)
> create mode 100644 libavcodec/x86/simple_idct10.asm
applied
thanks
[...]
--
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
Freedom in capitalist society always remains about the same as it was in
ancient Greek republics: Freedom for slave owners. -- Vladimir Lenin
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 181 bytes
Desc: Digital signature
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20151013/0eb99776/attachment.sig>
More information about the ffmpeg-devel
mailing list