[Ffmpeg-devel] a little optim for a SSE version of H263_LOOP_FILTER

Zuxy Meng zuxy.meng
Mon Nov 6 14:58:18 CET 2006


Hi,

2006/11/5, skal <skal65535 at orange.fr>:
>
>  Hi everybody,
>
>  in case, it seems to me a SSE version of
>  H263_LOOP_FILTER is possible by replacing
>       "psubusb %%mm4, %%mm2           \n\t"\
>       "movq %%mm2, %%mm3              \n\t"\
>       "psubusb %%mm4, %%mm3           \n\t"\
>       "psubb %%mm3, %%mm2             \n\t"\
>  at dsputil_mmx.c:587 (fresh cvs), by:
>       "psubusb %%mm4, %%mm2           \n\t"\
>       "pminub %%mm4, %%mm2           \n\t"\
>
>  +maybe a little re-org of the loop (mm3 is gone).
>
>  Well, this is just for the fun of it, since the speed-up
>  (if any) might not be worth a special version...

What about rewrite it with purely SSE2? I'll try when I have time...

-- 
Zuxy
Beauty is truth,
While truth is beauty.
PGP KeyID: E8555ED6




More information about the ffmpeg-devel mailing list