[FFmpeg-cvslog] lavc/vorbisdsp: unroll RISC-V V inverse_coupling

Rémi Denis-Courmont git at videolan.org
Sun Jul 16 23:24:18 EEST 2023


ffmpeg | branch: master | Rémi Denis-Courmont <remi at remlab.net> | Thu Jul 13 23:16:03 2023 +0300| [a28aa0475db99cc66c37c98fd5330e313817709c] | committer: Rémi Denis-Courmont

lavc/vorbisdsp: unroll RISC-V V inverse_coupling

This increases the group multiplier as per T-Head C910 benchmarks:

inverse_coupling_c: 4597.0
inverse_coupling_rvv_i32: 1312.7 (m1)
inverse_coupling_rvv_i32: 1116.7 (m2)
inverse_coupling_rvv_i32: 732.2  (m4)
inverse_coupling_rvv_i32: 898.0  (m8)

> http://git.videolan.org/gitweb.cgi/ffmpeg.git/?a=commit;h=a28aa0475db99cc66c37c98fd5330e313817709c
---

 libavcodec/riscv/vorbisdsp_rvv.S | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/libavcodec/riscv/vorbisdsp_rvv.S b/libavcodec/riscv/vorbisdsp_rvv.S
index f45e7dc2f1..81a6c62a65 100644
--- a/libavcodec/riscv/vorbisdsp_rvv.S
+++ b/libavcodec/riscv/vorbisdsp_rvv.S
@@ -23,7 +23,7 @@
 func ff_vorbis_inverse_coupling_rvv, zve32f
         fmv.w.x   ft0, zero
 1:
-        vsetvli   t0, a2, e32, m1, ta, ma
+        vsetvli   t0, a2, e32, m4, ta, ma
         vle32.v   v16, (a1)
         sub       a2, a2, t0
         vle32.v   v24, (a0)



More information about the ffmpeg-cvslog mailing list