[MPlayer-dev-eng] TODO

Michael Niedermayer michaelni at gmx.at
Wed Feb 13 00:11:09 CET 2002


Hi

On Tuesday 12 February 2002 18:16, Michael Niedermayer wrote:
> On Tuesday 12 February 2002 17:25, Arpi wrote:
[...]
> > >
> > > the current fastmemcpy is perhaps not optimal for mem -> agp/pci copies
> > > because it assumes that read/writes go over the same bus, so it simply
> > > reads a few kb (they are in the L1 cache after that) and then reads
> > > them again and stores them (the trick is from amd's manual not from me
> > > ...) if both source &  destination are in main mem then the bus will be
> > > 100% busy cuz the data is read from L1 cache on the read/write pass and
> > > the accesses will be sequential instead of alternating between 2
> > > locations so its faster but if the destination is on an agp card than
> > > the agp bus will be idle 50% of the time and the mem bus will be idle
> > > the other 50% ... pretty bad allthough i didnt benchmark the mem->agp
> > > variant so its just theory ...
> >
> > hmm. try it, i wanna see a 2 times faster agp_memcpy() :)
>
> simply comparing the asm memcpy version in vo_dga against fastmemcpy
> ./mplayer -nosound
> ~michael/videos/Lorenna_McKennit-Mummers_Dance-Mononoke_Hime-gabucino.avi
> -vo dga -benchmark
>
> fastmemcpy:
> BENCHMARKs: V:   8.666s VO:   2.516s A:   0.000s Sys:   4.344s =   15.526s
> BENCHMARK%: V: 55.8162% VO: 16.2078% A:  0.0000% Sys: 27.9760% = 100.0000%
>
> vo_dga asm:
> BENCHMARKs: V:   8.726s VO:   2.019s A:   0.000s Sys:   4.372s =   15.117s
> BENCHMARK%: V: 57.7245% VO: 13.3552% A:  0.0000% Sys: 28.9203% = 100.0000%
> [...]
>
> ill write a agp_memcpy ...
done, heres the benchmark for my p3 at 500

./mplayer -nosound 
~michael/videos/Lorenna_McKennit-Mummers_Dance-Mononoke_Hime-gabucino.avi -vo 
dga -benchmark
rep_movsl (the asm code from vo_dga.c)
BENCHMARKs: V:   9.189s VO:   2.134s A:   0.000s Sys:   4.992s =   16.316s
BENCHMARKs: V:   9.232s VO:   2.162s A:   0.000s Sys:   4.625s =   16.019s

fastmemcpy
BENCHMARKs: V:   9.415s VO:   2.836s A:   0.000s Sys:   4.631s =   16.883s
BENCHMARKs: V:   9.213s VO:   2.726s A:   0.000s Sys:   4.700s =   16.639s

mem2agpcpy
BENCHMARKs: V:   9.189s VO:   2.033s A:   0.000s Sys:   4.654s =   15.877s
BENCHMARKs: V:   9.272s VO:   2.055s A:   0.000s Sys:   4.563s =   15.890s

[...]
Michael



More information about the MPlayer-dev-eng mailing list