[FFmpeg-devel] swscale/swscale_unscaled : add X86_64 (SSE2, AVX) for uyvyto422

Martin Vignali martin.vignali at gmail.com
Tue Apr 3 00:26:41 EEST 2018


Hello,

In attach patch to add SSE and AVX SIMD (x86_64) for uyvyto422
Write using external ASM (keep a similar logic, than the MMX Inline version)

also add checkasm test

uyvy is use by some devices (like decklink, to provide raw data)

Around 20% faster  (on a "benchmark cmd", who test pix_fmt conversion)
(4.2s with the patch, 5.2s without)

Pass fate test for me.

Checkasm result :
uyvytoyuv422_c: 14146.6
uyvytoyuv422_mmx: 13696.4
uyvytoyuv422_mmxext: 19395.9
uyvytoyuv422_sse2: 6284.1
uyvytoyuv422_avx: 5379.6

Martin
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-swscale-swscale_unscaled-add-X86_64-SSE2-and-AVX-for.patch
Type: application/octet-stream
Size: 9896 bytes
Desc: not available
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20180402/43e86b6a/attachment.obj>


More information about the ffmpeg-devel mailing list