[Ffmpeg-devel] [PATCH] Faster rgb32tobgr32
Trent Piepho
xyzzy
Tue Apr 17 16:51:09 CEST 2007
On Tue, 17 Apr 2007, Ivo wrote:
> Hi,
>
> The following patch makes rgb32tobgr32 slightly faster by avoiding an extra
> add and a cmp.
>
> Old C Code, Avg of ten runs: 68345307
> New C Code, Avg of ten runs: 67644935
>
> Old MMX2 Code, Avg of ten runs: 66875441
> New MMX2 Code, Avg of ten runs: 65855368
Don't suppose you know the variance of the ten runs?
+ : "+r"(idx)
+ : "r" (s), "r" (d), "m" (mask32b), "m" (mask32r), "m" (mmx_one)
: "memory");
It should be "+&r"(idx). Realistically, idx won't be able to overlap any
of those parameters, but someone could always add a new input that could
overlap.
More information about the ffmpeg-devel
mailing list