[FFmpeg-devel] [PATCH] MMX VP3 Loop Filter
Jason Garrett-Glaser
darkshikari
Sat Oct 11 11:14:09 CEST 2008
On Sat, Oct 11, 2008 at 1:53 AM, David Conrad <lessen42 at gmail.com> wrote:
> filter_limit *= 0x02020202;
>"movd "#flim", %%mm5 \n\t" \
>"punpcklbw %%mm5, %%mm5 \n\t" \
Which is faster, this, or SPLATB in the form of punpcklbw + pshufw +
psllw (psllw because the filter_limit values are guaranteed to be <
128, so a word left shift is equivalent to a byte left shift)?
The SPLATB would avoid the integer multiply, and perhaps also as
importantly avoid the register->mm move, since you'll be able to load
it directly off the stack.
Dark Shikari
More information about the ffmpeg-devel
mailing list