[FFmpeg-devel] [PATCH v2] libswscale/aarch64: add another hscale specialization

Martin Storsjö martin at martin.st
Tue Aug 16 12:08:52 EEST 2022


On Sat, 13 Aug 2022, Swinney, Jonathan wrote:

> This specialization handles the case where filtersize is 4 mod 8, e.g.
> 12, 20, etc. Aarch64 was previously using the c function for this case.
> This implementation speeds up that case significantly.
>
> hscale_8_to_15__fs_12_dstW_512_c: 6234.1
> hscale_8_to_15__fs_12_dstW_512_neon: 1505.6
>
> Signed-off-by: Jonathan Swinney <jswinney at amazon.com>
> ---
> libswscale/aarch64/hscale.S  | 107 +++++++++++++++++++++++++++++++++++
> libswscale/aarch64/swscale.c |  18 +++---
> 2 files changed, 117 insertions(+), 8 deletions(-)

Thanks, this update looks fine to me, so I pushed it!

// Martin



More information about the ffmpeg-devel mailing list