[FFmpeg-devel] [PATCH] SSE2 Xvid idct

Pascal Massimino pascal.massimino
Sun Apr 6 22:03:30 CEST 2008


  Re,

On Sun, Apr 6, 2008 at 9:39 PM, Pascal Massimino <pascal.massimino at gmail.com>
wrote:

>
>
> >
> > [...]
> > >     "movdqa   %%xmm2, ("dct")         \n\t" \
> > >     "movdqa   %%xmm3, %%xmm2          \n\t" \
> > >     "psubsw   %%xmm6, %%xmm3          \n\t" \
> > >     "paddsw   %%xmm2, %%xmm6          \n\t" \
> > >     "movdqa   %%xmm6, %%xmm2          \n\t" \
> > >     "psubsw   %%xmm7, %%xmm6          \n\t" \
> > >     "paddsw   %%xmm2, %%xmm7          \n\t" \
> > >     "movdqa   %%xmm3, %%xmm2          \n\t" \
> > >     "psubsw   %%xmm5, %%xmm3          \n\t" \
> > >     "paddsw   %%xmm2, %%xmm5          \n\t" \
> > >     "movdqa   %%xmm5, %%xmm2          \n\t" \
> > >     "psubsw   %%xmm0, %%xmm5          \n\t" \
> > >     "paddsw   %%xmm2, %%xmm0          \n\t" \
> > >     "movdqa   %%xmm3, %%xmm2          \n\t" \
> > >     "psubsw   %%xmm4, %%xmm3          \n\t" \
> > >     "paddsw   %%xmm2, %%xmm4          \n\t" \
> > >     "movdqa  ("dct"), %%xmm2          \n\t" \\
>
>
 oh! now i recall an optim: you don't need to
 save and recall xmm2 in "dct", provided you replace
 the first butterfly :

>     "movdqa   %%xmm3, %%xmm2          \n\t" \
>     "psubsw   %%xmm6, %%xmm3          \n\t" \
>     "paddsw   %%xmm2, %%xmm6          \n\t" \

 by its (non-saturating) sub,add,add equivalent:

psubw %%xmm6,%%xmm3
paddw %%xmm6,%%xmm6
paddw %%xmm3,%%xmm6

skal




More information about the ffmpeg-devel mailing list