[FFmpeg-devel] [PATCH] VP8 MMX optimizations (MC and IDCT dc_add)

Michael Niedermayer michaelni
Wed Jun 23 00:37:54 CEST 2010


On Wed, Jun 23, 2010 at 12:29:45AM +0200, Michael Niedermayer wrote:
> On Tue, Jun 22, 2010 at 03:35:40PM -0400, Ronald S. Bultje wrote:
> > Hi,
> > 
> > as per $subj.
> > 
> > Speed gain:
> > - dc_add goes from 1800 to 1350 cycles (where 1150 is overhead,
> > measured as empty asm func), so about 3-3.5x faster.
> > - The MC functions are each about 4-5x faster (I only measured the 4x4
> > ones, the rest I assume are similarly faster but not measured).
> > - Total time spent on a shell-script that decodes the whole testsuite
> > (vp8-test-vectors-r1, file 001-017) including shell overhead and
> > everything goers from 2.3 to 2.1 seconds with these applied.
> > 
> > Results are bit-identical, and this is my first MMX/etc. ever! Thanks
> > to Jason for teaching me. ;-).
> > 
> > Ronald

i just wanted to clarify that i have no objections to this being commited
and improved afterwards

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

The greatest way to live with honor in this world is to be what we pretend
to be. -- Socrates
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL: <http://lists.mplayerhq.hu/pipermail/ffmpeg-devel/attachments/20100623/5ce05978/attachment.pgp>



More information about the ffmpeg-devel mailing list