[FFmpeg-devel] [PATCH v3 1/3] swscale/aarch64: Add bgr24 to yuv
Martin Storsjö
martin at martin.st
Mon Jun 24 14:52:27 EEST 2024
On Mon, 24 Jun 2024, Zhao Zhili wrote:
> From: Zhao Zhili <zhilizhao at tencent.com>
>
> Test on Apple M1 with kperf
> : -O3 : -O3 -fno-vectorize
> bgr24_to_uv_8_c : 28.5 : 52.5
> bgr24_to_uv_8_neon : 54.5 : 59.7
> bgr24_to_uv_128_c : 294.0 : 830.7
> bgr24_to_uv_128_neon : 99.7 : 112.0
> bgr24_to_uv_1080_c : 965.0 : 6624.0
> bgr24_to_uv_1080_neon : 751.5 : 754.7
> bgr24_to_uv_1920_c : 1693.2 : 11554.5
> bgr24_to_uv_1920_neon : 1292.5 : 1307.5
> bgr24_to_uv_half_8_c : 54.2 : 37.0
> bgr24_to_uv_half_8_neon : 27.2 : 22.5
> bgr24_to_uv_half_128_c : 127.2 : 392.5
> bgr24_to_uv_half_128_neon : 63.0 : 52.0
> bgr24_to_uv_half_1080_c : 880.2 : 3329.0
> bgr24_to_uv_half_1080_neon : 401.5 : 390.7
> bgr24_to_uv_half_1920_c : 1585.7 : 6390.7
> bgr24_to_uv_half_1920_neon : 694.7 : 698.7
> bgr24_to_y_8_c : 21.7 : 22.5
> bgr24_to_y_8_neon : 797.2 : 25.5
> bgr24_to_y_128_c : 88.0 : 280.5
> bgr24_to_y_128_neon : 63.7 : 55.0
> bgr24_to_y_1080_c : 616.7 : 2208.7
> bgr24_to_y_1080_neon : 900.0 : 452.0
> bgr24_to_y_1920_c : 1093.2 : 3894.7
> bgr24_to_y_1920_neon : 777.2 : 767.5
> ---
This patch looks ok now
// Martin
More information about the ffmpeg-devel
mailing list