[FFmpeg-devel] [PATCH v4 0/7] avfilter/vf_bwdif: Add aarch64 neon functions
John Cox
jc at kynesim.co.uk
Tue Jul 4 17:04:38 EEST 2023
Also adds a filter_line3 method which on aarch64 neon yields approx 30%
speedup over 2xfilter_line and a memcpy
Differences from v3:
Remove a few lines of neon in filter_line that should have been removed
when copying from line3
Sorry about the two patch sets in quick succession, but I think I've
applied all the requested changes and I didn't want this mistake in the
final patchset. (The mistake was benign - it just wasted a few cycles.)
John Cox (7):
tests/checkasm: Add test for vf_bwdif filter_intra
avfilter/vf_bwdif: Add neon for filter_intra
tests/checkasm: Add test for vf_bwdif filter_edge
avfilter/vf_bwdif: Add neon for filter_edge
avfilter/vf_bwdif: Add neon for filter_line
avfilter/vf_bwdif: Add a filter_line3 method for optimisation
avfilter/vf_bwdif: Add neon for filter_line3
libavfilter/aarch64/Makefile | 2 +
libavfilter/aarch64/vf_bwdif_init_aarch64.c | 125 ++++
libavfilter/aarch64/vf_bwdif_neon.S | 788 ++++++++++++++++++++
libavfilter/bwdif.h | 20 +
libavfilter/vf_bwdif.c | 70 +-
tests/checkasm/vf_bwdif.c | 172 +++++
6 files changed, 1162 insertions(+), 15 deletions(-)
create mode 100644 libavfilter/aarch64/vf_bwdif_init_aarch64.c
create mode 100644 libavfilter/aarch64/vf_bwdif_neon.S
--
2.39.2
More information about the ffmpeg-devel
mailing list