[FFmpeg-devel] [PATCH] VC-1 MMX DSP functions
Aurelien Jacobs
aurel
Wed Nov 14 00:32:11 CET 2007
Christophe GISQUET wrote:
> Michael Niedermayer a ?crit :
> > the code which is overall (whole decoder) fastest
> > and for cases where 2 are indistingishable the simpler one
>
> Sorry for the delay in replying but it was somewhat worth it: testing on
> a P4 showed that at least one optimization was in fact degrading
> performance (special case in vc1_put_shift2_mmx when stride == offset).
>
> Therefore, final (as far as I see) patch attached.
>
> Summary:
> MMX version for VC-1 subpel motion compensation functions. 30% faster
> decoding.
I've just tested this patch on a HD-DVD sample.
First, it works fine on x86-64... great.
Now the bench on my Athlon64 (best of 3 runs):
$ time mplayer UNILOGO.EVO -nosound -benchmark -vo null -quiet
unpatched:
BENCHMARKs: VC: 26.387s VO: 0.002s A: 0.000s Sys: 0.289s = 26.678s
BENCHMARK%: VC: 98.9100% VO: 0.0067% A: 0.0000% Sys: 1.0833% = 100.0000%
real 0m26.722s
user 0m26.526s
sys 0m0.188s
patched:
BENCHMARKs: VC: 17.850s VO: 0.002s A: 0.000s Sys: 0.291s = 18.143s
BENCHMARK%: VC: 98.3881% VO: 0.0094% A: 0.0000% Sys: 1.6025% = 100.0000%
real 0m18.190s
user 0m18.053s
sys 0m0.132s
Very nice optimization !
Aurel
More information about the ffmpeg-devel
mailing list