[FFmpeg-devel] [PATCH v2 0/7] arm64 neon implementation for 8bits functions
Grzegorz Bernacki
gjb at semihalf.com
Mon Oct 3 17:10:13 EEST 2022
Changes since v1:
- changed tabs to spaces
- modified branch instruction in vsse8
- apply Martin's patches with improved instructions scheduling
Grzegorz Bernacki (4):
lavc/aarch64: Add neon implementation for pix_abs8 functions.
lavc/aarch64: Provide neon implementation of nsse8
lavc/aarch64: Provide optimized implementation of vsse8 for arm64.
lavc/aarch64: Add neon implementation for vsse_intra8
Martin Storsjö (3):
aarch64: me_cmp: Improve scheduling in ff_pix_abs8_y2_neon
aarch64: me_cmp: Fix up the prologue of ff_pix_abs8_xy2_neon
aarch64: me_cmp: Improve scheduling in vsse_intra8
libavcodec/aarch64/me_cmp_init_aarch64.c | 33 ++
libavcodec/aarch64/me_cmp_neon.S | 414 +++++++++++++++++++++++
2 files changed, 447 insertions(+)
--
2.37.1
More information about the ffmpeg-devel
mailing list