[Ffmpeg-devel] [PATCH] wma optimizations
Michael Niedermayer
michaelni
Sat Aug 19 03:09:26 CEST 2006
Hi
On Sat, Aug 19, 2006 at 02:51:03AM +0200, Benjamin Larsson wrote:
> With the new dsp vector functions in the wma decoder it shaved of about
> 120000 dezicycles from the windowing code loop.
>
> Non simd before optimization:
> 138006 dezicycles in winmul, 4090 runs, 6 skipss dup=0 drop=0
> 81426 dezicycles in winspecial, 4091 runs, 5 skips
> 137845 dezicycles in winmul, 8172 runs, 20 skips dup=0 drop=0
> 81290 dezicycles in winspecial, 8177 runs, 15 skips
> 137565 dezicycles in winmul, 16350 runs, 34 skipsdup=0 drop=0
> 81051 dezicycles in winspecial, 16358 runs, 26 skips
>
> Simd:
> 50668 dezicycles in winmul, 4092 runs, 4 skips/s dup=0 drop=0
> 42723 dezicycles in winspecial, 4091 runs, 5 skips
> 50486 dezicycles in winmul, 8186 runs, 6 skips
> 42602 dezicycles in winspecial, 8186 runs, 6 skips
> 50317 dezicycles in winmul, 16369 runs, 15 skips dup=0 drop=0
> 42464 dezicycles in winspecial, 16370 runs, 14 skips
>
> A md5sum test on an amd64 box gave the same checksum for both versions.
> But I'm not sure that would be true for all kind of simd code, but the
> diffrent rounding should only lead to 1 bit differences.
>
> Overall speedup is around 3%. (From 14.2 -> 13.8).
great, and patch looks ok
[...]
--
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
In the past you could go to a library and read, borrow or copy any book
Today you'd get arrested for mere telling someone where the library is
More information about the ffmpeg-devel
mailing list