[FFmpeg-devel] [PATCH] swscale/x86/rgb2rgb: add AVX512ICL versions of shuffle_bytes
Shreesh Adiga
16567adigashreesh at gmail.com
Sat Jan 25 17:50:50 EET 2025
> Try running it several times using the same seed, so
> "tests/checkasm/checkasm --test=sw_rgb --bench 17575157", and make sure
> no power saving feature is enabled (so the CPU frequency doesn't change
> based on load). That may help getting consistent results.
After running "echo performance | tee
/sys/devices/system/cpu/cpu*/cpufreq/scaling_governor" and
recompiling ffmpeg with "--enable-linux-perf", I am seeing the below
numbers:
shuffle_bytes_0321_c: 56.5 ( 1.00x)
shuffle_bytes_0321_ssse3: 18.0 ( 3.14x)
shuffle_bytes_0321_avx2: 10.0 ( 5.65x)
shuffle_bytes_0321_avx512icl: 9.0 ( 6.28x)
shuffle_bytes_1230_c: 84.5 ( 1.00x)
shuffle_bytes_1230_ssse3: 18.2 ( 4.63x)
shuffle_bytes_1230_avx2: 22.2 ( 3.80x)
shuffle_bytes_1230_avx512icl: 10.0 ( 8.45x)
shuffle_bytes_2103_c: 49.8 ( 1.00x)
shuffle_bytes_2103_ssse3: 21.2 ( 2.34x)
shuffle_bytes_2103_avx2: 17.5 ( 2.84x)
shuffle_bytes_2103_avx512icl: 7.5 ( 6.63x)
shuffle_bytes_3012_c: 84.5 ( 1.00x)
shuffle_bytes_3012_ssse3: 17.0 ( 4.97x)
shuffle_bytes_3012_avx2: 16.0 ( 5.28x)
shuffle_bytes_3012_avx512icl: 16.2 ( 5.20x)
shuffle_bytes_3210_c: 92.8 ( 1.00x)
shuffle_bytes_3210_ssse3: 25.8 ( 3.60x)
shuffle_bytes_3210_avx2: 14.0 ( 6.62x)
shuffle_bytes_3210_avx512icl: 9.0 (10.31x)
Thanks,
Shreesh
More information about the ffmpeg-devel
mailing list