[FFmpeg-cvslog] x86inc: Extend FMA_INSTR functionality

James Almer git at videolan.org
Thu Feb 13 22:19:44 CET 2014


ffmpeg | branch: master | James Almer <jamrial at gmail.com> | Sat Feb  8 02:54:50 2014 -0300| [23a8c63452009df21b3f184936b343593d4ccb04] | committer: Michael Niedermayer

x86inc: Extend FMA_INSTR functionality

Support the cases where the first and last operand of
the XOP instruction are the same.

Also add vpmacsdql emulation.

Signed-off-by: James Almer <jamrial at gmail.com>
Signed-off-by: Michael Niedermayer <michaelni at gmx.at>

> http://git.videolan.org/gitweb.cgi/ffmpeg.git/?a=commit;h=23a8c63452009df21b3f184936b343593d4ccb04
---

 libavutil/x86/x86inc.asm |    4 ++++
 1 file changed, 4 insertions(+)

diff --git a/libavutil/x86/x86inc.asm b/libavutil/x86/x86inc.asm
index c455367..88cae0c 100644
--- a/libavutil/x86/x86inc.asm
+++ b/libavutil/x86/x86inc.asm
@@ -1411,6 +1411,9 @@ AVX_INSTR pfmul, 1, 0, 1
     %macro %1 4-7 %1, %2, %3
         %if cpuflag(xop)
             v%5 %1, %2, %3, %4
+        %elifidn %1, %4
+            %6 %2, %3
+            %7 %1, %2
         %else
             %6 %1, %2, %3
             %7 %1, %4
@@ -1420,6 +1423,7 @@ AVX_INSTR pfmul, 1, 0, 1
 
 FMA_INSTR  pmacsdd,  pmulld, paddd
 FMA_INSTR  pmacsww,  pmullw, paddw
+FMA_INSTR pmacsdql,  pmuldq, paddq
 FMA_INSTR pmadcswd, pmaddwd, paddd
 
 ; tzcnt is equivalent to "rep bsf" and is backwards-compatible with bsf.



More information about the ffmpeg-cvslog mailing list