[FFmpeg-cvslog] dcadsp: fix SSE code to not use SSE2 instructions.
Hendrik Leppkes
git at videolan.org
Sun Apr 6 18:44:30 CEST 2014
ffmpeg | branch: master | Hendrik Leppkes <h.leppkes at gmail.com> | Sun Apr 6 18:26:24 2014 +0200| [fc7e02f0ff345d5331b7c78f2400668d2c79a8b0] | committer: Michael Niedermayer
dcadsp: fix SSE code to not use SSE2 instructions.
movq from SSE register to memory is an SSE2 instruction.
Instead, use SSE movlps, which does the same thing.
Signed-off-by: Michael Niedermayer <michaelni at gmx.at>
> http://git.videolan.org/gitweb.cgi/ffmpeg.git/?a=commit;h=fc7e02f0ff345d5331b7c78f2400668d2c79a8b0
---
libavcodec/x86/dcadsp.asm | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/libavcodec/x86/dcadsp.asm b/libavcodec/x86/dcadsp.asm
index 2758daa..1ac2378 100644
--- a/libavcodec/x86/dcadsp.asm
+++ b/libavcodec/x86/dcadsp.asm
@@ -153,7 +153,7 @@ DECODE_HF
addps m4, va ; va1+3 vb1+3 va2+4 vb2+4
movhlps vb, m4 ; va1+3 vb1+3
addps vb, m4 ; va0..4 vb0..4
- movh [outq + count], vb
+ movlps [outq + count], vb
%if %1
sub cf0q, 8*NUM_COEF
%endif
More information about the ffmpeg-cvslog
mailing list