[MPlayer-dev-eng] [PATCH] yadif SSE2/SSSE3 optimization

Zhou, Zongyi zz65 at cornell.edu
Thu Nov 13 13:22:50 CET 2008


Hi all, 

I made this patch from ffdshow-mt branch. The original SSE2/SSSE3 codes are written by h.yamagata.
I ported those codes back to mplayer and replaced all movdqa/movdqu with movaps/movups.

My tests show that on Intel CPUs (except Yonah), SSE2 is ~20% faster than MMX2 and SSSE3 is 30% faster than MMX2.
However on AMD CPUs (except Socket 754 Semprons), SSE2 is ~5% slower than MMX2.
So now SSE2 function is used only on Intel CPUs.

Regards,

ZZ
-------------- next part --------------
A non-text attachment was scrubbed...
Name: yadif_sse2_ssse3_optimization.patch
Type: application/octet-stream
Size: 13457 bytes
Desc: not available
URL: <http://lists.mplayerhq.hu/pipermail/mplayer-dev-eng/attachments/20081113/d0f36127/attachment.obj>


More information about the MPlayer-dev-eng mailing list