[FFmpeg-devel] [PATCH] Add x86-optimized versions of exponent_min().
Loren Merritt
lorenm
Mon Jan 31 20:53:24 CET 2011
On Mon, 31 Jan 2011, Justin Ruggles wrote:
> I get some very weird mmx2 results when I remove the first sub and
> change jae to ja.
>
> Athlon64 X2 6000+
> sse2: 3006 -> 2753
> mmx2: 5228 -> 5453
> mmx: 5490 -> 5430
>
> Atom 330
> sse2: 6834 -> 3779
> mmx2: 9951 -> 10525
> mmx: 11390 -> 11325
>
> Both CPUs are consistent in the change, except that on Athlon64 the mmx2
> version is slower than the mmx version. What do you suggest?
I usually blame such weird results on code alignment, but I have no
systematic way to fix them.
--Loren Merritt
More information about the ffmpeg-devel
mailing list