[FFmpeg-devel] [PATCH v2] add a configure flag to enabled tree-vecorization with gcc

Swinney, Jonathan jswinney at amazon.com
Mon Aug 8 18:25:36 EEST 2022


Recent version of gcc improve the automatic vectorization. This flag
allows adventurous users to enable vectorization. Known problems with
this are primarily related to inline assembly for x86 and so to address
those, add a pragma to explicitly disable automatic vectorization for
those files.

Signed-off-by: Jonathan Swinney <jswinney at amazon.com>

--

Thank you considering this patch. I believe this addresses the primary
concerns that were raised by my previous submission. There may be more
files which require the pragma add `-fno-tree-vectorize`, and I welcome
suggestions. This should strike a compromise, allowing some users to
enable vectorization while not breaking mainstream builds. This should
give time to work out additional problems if they arise before enabling
vectorization more broadly.

---
 configure              | 7 ++++++-
 libavcodec/x86/cabac.h | 4 ++++
 2 files changed, 10 insertions(+), 1 deletion(-)

diff --git a/configure b/configure
index cbbb4dd9c8..8e842da1b8 100755
--- a/configure
+++ b/configure
@@ -110,6 +110,7 @@ Configuration options:
   --disable-swscale-alpha  disable alpha channel support in swscale
   --disable-all            disable building components, libraries and programs
   --disable-autodetect     disable automatically detected external libraries [no]
+  --enable-auto-vectorization enable compiler auto vectorization
 
 Program options:
   --disable-programs       do not build command line programs
@@ -1945,6 +1946,7 @@ FEATURE_LIST="
     small
     static
     swscale_alpha
+    auto_vectorization
 "
 
 # this list should be kept in linking order
@@ -7176,7 +7178,9 @@ if enabled icc; then
             disable aligned_stack
     fi
 elif enabled gcc; then
-    check_optflags -fno-tree-vectorize
+    if disabled auto_vectorization; then
+        check_optflags -fno-tree-vectorize
+    fi
     check_cflags -Werror=format-security
     check_cflags -Werror=implicit-function-declaration
     check_cflags -Werror=missing-prototypes
@@ -7569,6 +7573,7 @@ echo "pod2man enabled           ${pod2man-no}"
 echo "makeinfo enabled          ${makeinfo-no}"
 echo "makeinfo supports HTML    ${makeinfo_html-no}"
 echo "xmllint enabled           ${xmllint-no}"
+echo "auto-vectorization        ${auto_vectorization-no}"
 test -n "$random_seed" &&
     echo "random seed               ${random_seed}"
 echo
diff --git a/libavcodec/x86/cabac.h b/libavcodec/x86/cabac.h
index b046a56a6b..782e4cbda4 100644
--- a/libavcodec/x86/cabac.h
+++ b/libavcodec/x86/cabac.h
@@ -39,6 +39,10 @@
 
 #if HAVE_INLINE_ASM
 
+#ifdef __GNUC__
+    __attribute__((optimize("-fno-tree-vectorize")))
+#endif
+
 #ifndef UNCHECKED_BITSTREAM_READER
 #define UNCHECKED_BITSTREAM_READER !CONFIG_SAFE_BITSTREAM_READER
 #endif
-- 
2.37.1



More information about the ffmpeg-devel mailing list