[FFmpeg-devel] [PATCH] VC-1 MMX DSP functions

Reimar Döffinger Reimar.Doeffinger
Wed Oct 3 20:16:39 CEST 2007


Hello,
On Tue, Oct 02, 2007 at 11:19:42PM +0200, Michael Niedermayer wrote:
[...]
> > +        ASMALIGN(3)
> > +        "1:                                \n\t"
> 
> how much speed is gained by the align?

Inconclusive in my tests on AMD64 on a 64 bit OS:
without:
3012 dezicycles in vc1_put_ver_16b_shift2_mmx, 1048397 runs, 179 skips
1249 dezicycles in vc1_put_hor_16b_shift2_mmx, 1048505 runs, 71 skips

3011 dezicycles in vc1_put_ver_16b_shift2_mmx, 1048397 runs, 179 skips
1232 dezicycles in vc1_put_hor_16b_shift2_mmx, 1048517 runs, 59 skips

3011 dezicycles in vc1_put_ver_16b_shift2_mmx, 1048514 runs, 62 skips
1232 dezicycles in vc1_put_hor_16b_shift2_mmx, 1048548 runs, 28 skips

with:
3038 dezicycles in vc1_put_ver_16b_shift2_mmx, 1048340 runs, 236 skips
1259 dezicycles in vc1_put_hor_16b_shift2_mmx, 1048487 runs, 89 skips

3027 dezicycles in vc1_put_ver_16b_shift2_mmx, 1048415 runs, 161 skips
1259 dezicycles in vc1_put_hor_16b_shift2_mmx, 1048515 runs, 61 skips

3030 dezicycles in vc1_put_ver_16b_shift2_mmx, 1048384 runs, 192 skips
1258 dezicycles in vc1_put_hor_16b_shift2_mmx, 1048516 runs, 60 skips

Probably other CPUs are different...

Greetings,
Reimar D?ffinger




More information about the ffmpeg-devel mailing list