[FFmpeg-devel] [PATCH 9/9] [ARMv6] Add VFP-accelerated version of qmf_32_subbands

Michael Niedermayer michaelni at gmx.at
Mon Jul 15 22:01:19 CEST 2013


On Mon, Jul 15, 2013 at 06:28:17PM +0100, Ben Avison wrote:
>                Before           After
>                Mean    StdDev   Mean    StdDev  Change
> This function   1323.0  98.0      746.2  60.6   +77.3%
> Overall        15400.0 336.4    14147.5 288.4    +8.9%
> 
> Signed-off-by: Ben Avison <bavison at riscosopen.org>
[...]

> +6:
> +        vmsr    FPSCR, OLDFPSCR
> +        ldr     WINDOW, [fp, #3*4]
> +        ldr     OUT, [fp, #4*4]
> +        sub     BUF, BUF, #32*4
> +NOVFP   ldr     SCALEINT, [fp, #6*4]
> +        mov     COUNT, #8
> +VFP     vpush   {SCALE}
> +VFP     sub     sp, sp, #3*4
> +NOVFP   sub     sp, sp, #4*4
> +7:
> +VFP     ldr     a1, [fp, #-7*4]     @ imdct
> +NOVFP   ldr     a1, [fp, #-8*4]
> +        ldmia   fp, {a2-a4}
> +VFP     stmia   sp, {WINDOW, OUT, BUF}
> +NOVFP   stmia   sp, {WINDOW, OUT, BUF, SCALEINT}
> +VFP     vldr    SCALE, [sp, #3*4]
> +        bl      ff_synth_filter_float_vfp
> +        add     OUT, OUT, #32*4
> +        add     BUF, BUF, #32*4
> +        subs    COUNT, COUNT, #1
> +        bne     7b
> +
> +        sub     sp, fp, #(8+8)*4

libavcodec/arm/dcadsp_vfp.S:473: Error: r13 not allowed here -- `sub sp,fp,#(8+8)*4'
make: *** [libavcodec/arm/dcadsp_vfp.o] Error 1

--extra-cflags='-mfpu=neon -mfloat-abi=softfp' --cpu=cortex-a8 --arch=armv7 --target-os=linux --enable-cross-compile

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

It is dangerous to be right in matters on which the established authorities
are wrong. -- Voltaire
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: Digital signature
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20130715/eed928d1/attachment.asc>


More information about the ffmpeg-devel mailing list