[FFmpeg-devel] [PATCH 3/3] avcodec/h264: sse2 and avx 4:2:2 idct add8 10-bit functions

Carl Eugen Hoyos ceffmpeg at gmail.com
Tue Nov 29 22:09:10 EET 2016


2016-11-29 17:14 GMT+01:00 James Darnley <jdarnley at obe.tv>:
> On 2016-11-29 15:30, Carl Eugen Hoyos wrote:
>> 2016-11-29 12:52 GMT+01:00 James Darnley <jdarnley at obe.tv>:
>>> sse2:
>>> complex: 4.13x faster (1514 vs. 367 cycles)
>>> simple:  4.38x faster (1836 vs. 419 cycles)
>>>
>>> avx:
>>> complex: 1.07x faster (260 vs. 244 cycles)
>>> simple:  1.03x faster (284 vs. 274 cycles)
>>
>> What are you comparing?

> The AVX comparison is it versus SSE2.

This wasn't obvious to me.

Thank you, Carl Eugen


More information about the ffmpeg-devel mailing list