[FFmpeg-devel] [PATCH 2/5] libavfilter/x86/vf_gblur: add ff_verti_slice_avx2/512()

Michael Niedermayer michael at niedermayer.cc
Mon Aug 2 21:16:04 EEST 2021


On Mon, Aug 02, 2021 at 01:34:36PM +0800, Wu Jianhua wrote:
> The new vertical slice with AVX2/512 acceleration can significantly
> improve the performance of Gaussian Filter 2D.
> 
> Performance data (fps):
> ff_verti_slice_c: 32.57
> ff_verti_slice_avx2: 476.19
> ff_verti_slice_avx512: 833.33
> 
> Co-authored-by: Cheng Yanfei <yanfei.cheng at intel.com>
> Co-authored-by: Jin Jun <jun.i.jin at intel.com>
> ---
>  libavfilter/gblur.h             |   2 +
>  libavfilter/vf_gblur.c          |  24 ++--
>  libavfilter/x86/vf_gblur.asm    | 187 ++++++++++++++++++++++++++++++++
>  libavfilter/x86/vf_gblur_init.c |   7 ++
>  4 files changed, 212 insertions(+), 8 deletions(-)

breaks on x86_32
...
src/libavfilter/x86/vf_gblur.asm:363: ... from macro `VERTI_SLICE' defined here
src/libavfilter/x86/vf_gblur.asm:46: ... from macro `PUSH_MASK' defined here
src/libavfilter/x86/vf_gblur.asm:415: error: invalid operands in non-64-bit mode
src/libavfilter/x86/vf_gblur.asm:363: ... from macro `VERTI_SLICE' defined here
src/libavfilter/x86/vf_gblur.asm:46: ... from macro `PUSH_MASK' defined here
src/libavfilter/x86/vf_gblur.asm:415: error: invalid operands in non-64-bit mode
src/libavfilter/x86/vf_gblur.asm:363: ... from macro `VERTI_SLICE' defined here
src/libavfilter/x86/vf_gblur.asm:46: ... from macro `PUSH_MASK' defined here
src/libavfilter/x86/vf_gblur.asm:420: error: invalid operands in non-64-bit mode
src/libavfilter/x86/vf_gblur.asm:363: ... from macro `VERTI_SLICE' defined here
src/libavfilter/x86/vf_gblur.asm:49: ... from macro `PUSH_MASK' defined here
src/ffbuild/common.mak:92: recipe for target 'libavfilter/x86/vf_gblur.o' failed
make: *** [libavfilter/x86/vf_gblur.o] Error 1
make: *** Waiting for unfinished jobs....



[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Why not whip the teacher when the pupil misbehaves? -- Diogenes of Sinope
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 195 bytes
Desc: not available
URL: <https://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20210802/b2f25577/attachment.sig>


More information about the ffmpeg-devel mailing list