[FFmpeg-devel] [PATCH] VP8 luma(16) inner-MB H/V loopfilter MMX/SSE2
Sun Jul 18 22:11:42 CEST 2010
On Sun, 18 Jul 2010, Ronald S. Bultje wrote:
> On Sun, Jul 11, 2010 at 2:47 PM, Loren Merritt <lorenm at u.washington.edu> wrote:
>> On Sun, 11 Jul 2010, Michael Niedermayer wrote:
>>> On Sun, Jul 11, 2010 at 04:52:04PM +0000, Loren Merritt wrote:
>>>> Rather than special-casing most of the functions, we at x264 declared
>>>> Core1 doesn't have sse2, and changed the cpuid parser accordingly.
>>>> If you want to support the few cases where sse2 is slightly faster than
>>>> mmx, I recommend picking a different flag for that and applying it only
>>>> when you've tested on Core1, so that FF_MM_SSE2 can be trusted to dwim in
>>>> the usual case.
>>>> ?cpuid.c | ? 14 +++++++++++++-
>>>> ?1 file changed, 13 insertions(+), 1 deletion(-)
>>>> 7ba0916766645e2de9330e9ba8f30d815da14c91 ?cpuid.diff
>>> do we have any float SSE2 code that this could affect negatively?
>>> if not iam ok with this patch
> Attached patch implements FF_MM_SSE2/3SLOW for this purpose.
ok if you've tested it (I haven't).
More information about the ffmpeg-devel