[FFmpeg-devel] [PATCH 2/2] h264: integrate clear_blocks calls with IDCT.

Michael Niedermayer michaelni at gmx.at
Tue Feb 19 16:41:13 CET 2013


On Mon, Feb 18, 2013 at 09:03:02PM -0800, Ronald S. Bultje wrote:
> From: "Ronald S. Bultje" <rsbultje at gmail.com>
> 
> The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700
> to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb
> (in the decode_slice loop) goes from 1759 to 1733 cycles on the clip
> tested (cathedral), i.e. almost 30 cycles per mb faster.
> ---
>  libavcodec/arm/h264idct_neon.S     |  29 +++++++---
>  libavcodec/h264.c                  |  14 ++---
>  libavcodec/h264_mb_template.c      |  11 +---
>  libavcodec/h264addpx_template.c    |   4 ++
>  libavcodec/h264dsp.c               |   4 +-
>  libavcodec/h264dsp.h               |   4 +-
>  libavcodec/h264idct_template.c     |  16 ++++--
>  libavcodec/h264pred.h              |   8 +--
>  libavcodec/h264pred_template.c     |  28 ++++++----
>  libavcodec/ppc/h264_altivec.c      |   3 ++
>  libavcodec/svq3.c                  |   4 +-
>  libavcodec/x86/h264_idct.asm       | 108 ++++++++++++++++++++++++++++---------
>  libavcodec/x86/h264_idct_10bit.asm |  53 ++++++++++++++++--
>  13 files changed, 209 insertions(+), 77 deletions(-)

applied

thanks

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

It is dangerous to be right in matters on which the established authorities
are wrong. -- Voltaire
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: Digital signature
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20130219/8238fb53/attachment.asc>


More information about the ffmpeg-devel mailing list