[FFmpeg-devel] [PATCH] avfilter/avf_showcqt: cqt_calc optimization on x86
Muhammad Faiz
mfcc64 at gmail.com
Tue Jun 7 09:51:46 CEST 2016
On Tue, Jun 7, 2016 at 9:49 AM, Michael Niedermayer
<michael at niedermayer.cc> wrote:
> On Tue, Jun 07, 2016 at 08:07:45AM +0700, Muhammad Faiz wrote:
>> On Sat, Jun 4, 2016 at 2:36 PM, Muhammad Faiz <mfcc64 at gmail.com> wrote:
>> > benchmark on x86_64
>> > cqt_time:
>> > plain = 3.292 s
>> > SSE = 1.640 s
>> > SSE3 = 1.631 s
>> > AVX = 1.395 s
>> > FMA3 = 1.271 s
>> > FMA4 = not available
>> >
>> > untested on x86_32
>> >
>> > Signed-off-by: Muhammad Faiz <mfcc64 at gmail.com>
>> > ---
>> > libavfilter/avf_showcqt.c | 7 ++
>> > libavfilter/avf_showcqt.h | 3 +
>> > libavfilter/x86/Makefile | 2 +
>> > libavfilter/x86/avf_showcqt.asm | 206 +++++++++++++++++++++++++++++++++++++
>> > libavfilter/x86/avf_showcqt_init.c | 63 ++++++++++++
>> > 5 files changed, 281 insertions(+)
>> > create mode 100644 libavfilter/x86/avf_showcqt.asm
>> > create mode 100644 libavfilter/x86/avf_showcqt_init.c
>> >
>>
>> I want to apply this patch but I'm not sure if it works on
>> x86_32 or not (I didn't test it on x86_32)
>
> i had tested fate with it applied on x86_32 earlier today but i think
> this filter isnt tested with fate ...
>
> testing x86_32 is easy:
> --arch=x86_32 --target-os=linux --extra-cflags=-m32 --extra-ldflags=-m32 --enable-cross-compile
>
unfortunately, my linux is pure 64-bit (Fedora 23, it has no 32-bit libraries),
so it cannot be done
thanks
More information about the ffmpeg-devel
mailing list