[FFmpeg-devel] [PATCH] x86/vf_blend: Add SSE2 optimization for divide

Timothy Gu timothygu99 at gmail.com
Sun Feb 28 17:19:20 CET 2016


On Sun, Feb 14, 2016 at 03:45:11PM +0100, Henrik Gramner wrote:
> You could try doing 8 or 16 bytes per iteration instead of 4, it might
> be faster depending on how good your cpu is at OOE.

As discussed on IRC, no observable difference has been observed with such
changes, mainly because the bottleneck is in the division itself.

Therefore, patch applied without changes.

Timothy


More information about the ffmpeg-devel mailing list