[FFmpeg-devel] [PATCH v4 1/1] lavc/aarch64: motion estimation functions in neon

Martin Storsjö martin at martin.st
Tue Jun 28 00:55:04 EEST 2022


On Sun, 26 Jun 2022, Swinney, Jonathan wrote:

> - ff_pix_abs16_neon
> - ff_pix_abs16_xy2_neon
>
> In direct micro benchmarks of these ff functions verses their C implementations,
> these functions performed as follows on AWS Graviton 3.
>
> ff_pix_abs16_neon:
> pix_abs_0_0_c: 141.1
> pix_abs_0_0_neon: 19.6
>
> ff_pix_abs16_xy2_neon:
> pix_abs_0_3_c: 269.1
> pix_abs_0_3_neon: 39.3
>
> Tested with:
> ./tests/checkasm/checkasm --test=motion --bench --disable-linux-perf
>
> Signed-off-by: Jonathan Swinney <jswinney at amazon.com>

Thanks! This looked great to me, and you seem to have applied all my 
feedback!

(There was one last case of "ldur" missing that I fixed up locally.)

With that fixed, I pushed this one now. Thanks!

// Martin



More information about the ffmpeg-devel mailing list