[FFmpeg-devel] [PATCH] VP8 MMX optimizations (MC and IDCT dc_add)
Wed Jun 23 00:37:54 CEST 2010
On Wed, Jun 23, 2010 at 12:29:45AM +0200, Michael Niedermayer wrote:
> On Tue, Jun 22, 2010 at 03:35:40PM -0400, Ronald S. Bultje wrote:
> > Hi,
> > as per $subj.
> > Speed gain:
> > - dc_add goes from 1800 to 1350 cycles (where 1150 is overhead,
> > measured as empty asm func), so about 3-3.5x faster.
> > - The MC functions are each about 4-5x faster (I only measured the 4x4
> > ones, the rest I assume are similarly faster but not measured).
> > - Total time spent on a shell-script that decodes the whole testsuite
> > (vp8-test-vectors-r1, file 001-017) including shell overhead and
> > everything goers from 2.3 to 2.1 seconds with these applied.
> > Results are bit-identical, and this is my first MMX/etc. ever! Thanks
> > to Jason for teaching me. ;-).
> > Ronald
i just wanted to clarify that i have no objections to this being commited
and improved afterwards
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
The greatest way to live with honor in this world is to be what we pretend
to be. -- Socrates
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Size: 189 bytes
Desc: Digital signature
More information about the ffmpeg-devel