[FFmpeg-cvslog] x86/imdct36: use extractps inside the STORE macro

James Almer git at videolan.org
Thu Jan 28 17:36:53 CET 2016


ffmpeg | branch: master | James Almer <jamrial at gmail.com> | Thu Jan 28 13:32:53 2016 -0300| [c79252897096b89376bcf17a5bca6cdf8d21b6a0] | committer: James Almer

x86/imdct36: use extractps inside the STORE macro

Reviewed-by: Michael Niedermayer <michael at niedermayer.cc>
Reviewed-by: Henrik Gramner <henrik at gramner.com>
Signed-off-by: James Almer <jamrial at gmail.com>

> http://git.videolan.org/gitweb.cgi/ffmpeg.git/?a=commit;h=c79252897096b89376bcf17a5bca6cdf8d21b6a0
---

 libavcodec/x86/imdct36.asm |    7 +++++++
 1 file changed, 7 insertions(+)

diff --git a/libavcodec/x86/imdct36.asm b/libavcodec/x86/imdct36.asm
index 7218f06..409b2c5 100644
--- a/libavcodec/x86/imdct36.asm
+++ b/libavcodec/x86/imdct36.asm
@@ -143,6 +143,12 @@ SECTION .text
 %endmacro
 
 %macro STORE 4
+%if cpuflag(sse4)
+    movss     [%3       ], %1
+    extractps [%3 +   %4], %1, 1
+    extractps [%3 + 2*%4], %1, 2
+    extractps [%3 + 3*%4], %1, 3
+%else
     movhlps %2, %1
     movss   [%3       ], %1
     movss   [%3 + 2*%4], %2
@@ -150,6 +156,7 @@ SECTION .text
     movss   [%3 +   %4], %1
     movhlps %2, %1
     movss   [%3 + 3*%4], %2
+%endif
 %endmacro
 
 %macro LOAD 4



More information about the ffmpeg-cvslog mailing list