[FFmpeg-devel] [PATCH] dcadsp: fix SSE code to not use SSE2 instructions.

Hendrik Leppkes h.leppkes at gmail.com
Sun Apr 6 18:26:24 CEST 2014


movq from SSE register to memory is an SSE2 instruction.
Instead, use SSE movlps, which does the same thing.
---
 libavcodec/x86/dcadsp.asm | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/libavcodec/x86/dcadsp.asm b/libavcodec/x86/dcadsp.asm
index 2758daa..1ac2378 100644
--- a/libavcodec/x86/dcadsp.asm
+++ b/libavcodec/x86/dcadsp.asm
@@ -153,7 +153,7 @@ DECODE_HF
     addps       m4, va ; va1+3 vb1+3 va2+4 vb2+4
     movhlps     vb, m4 ; va1+3  vb1+3
     addps       vb, m4 ; va0..4 vb0..4
-    movh    [outq + count], vb
+    movlps  [outq + count], vb
 %if %1
     sub       cf0q, 8*NUM_COEF
 %endif
-- 
1.9.0.msysgit.0



More information about the ffmpeg-devel mailing list