[FFmpeg-devel] lavc/aarch64: add simple idct neon functions

Matthieu Bouron matthieu.bouron at gmail.com
Thu Feb 23 17:59:16 EET 2017


Hello,

The following patchset add the ff_simple_idct function neon functions for the
aarch64 platform. It's ported from armv7 simple_idct_neon with some improvements:
 * the source idct blocks are now loaded once and kept in v24-v31
 * the source idct blocks are no longer overriden in idct_col4_top
 * the destination is now written in one pass at the end of
   ff_simple_idct{,_put,_add}_neon

It is bitexact with the armv7 neon implementation.

Here are some results (reported by {START,STOP}_TIMER) on an Odroid-C2 (Cortex
A53):

Functions             IDCT: simple       IDCT: simpleneon
ff_simple_idct_put     9795 units        3170 units
ff_simple_idct_add    10227 units        3302 units

Matthieu


More information about the ffmpeg-devel mailing list