[MPlayer-dev-eng] [PATCH]: af_resample SIMD optimization (MMX/SSE2/3DNOW!/SSE/SSE3)
Loren Merritt
lorenm at u.washington.edu
Fri Mar 13 04:23:42 CET 2009
On Thu, 12 Mar 2009, Zhou Zongyi wrote:
> This patch implements SIMD optimized FIR filter in resample.
Filter weights should be aligned, and multiplied directly from memory.
pshufd is slow, pshuflw is preferred for the horizontal sum.
There's a division in the inner loop: wi=(wi+dn)%up
I would consider transposing the filter weights, eliminate most of the
sample loads (use palignr to effectively keep them in a long
shift-register instead), and eliminate the horizontal sums.
Failing that, do multiple horizontal sums at once to make full use of simd
width.
--Loren Merritt
More information about the MPlayer-dev-eng
mailing list