[FFmpeg-devel] [PATCH 5/6] x86: lossless audio: SSE4 madd 32bits
Christophe Gisquet
christophe.gisquet at gmail.com
Wed Apr 20 08:43:14 CEST 2016
Hi,
2016-04-20 2:01 GMT+02:00 Ronald S. Bultje <rsbultje at gmail.com>:
> This is typically only an issue if the data came from stack. On win64 as
> well as unix64, the 4th argument never comes from stack but is a direct
> register argument instead.
So no benefit except consistency. I don't mind either way, though.
On the other hand, this hand-coded function improves is only a slight
improvement over gcc's vectorized code, and only because it does a
poor job of it. Probably because the order is small (8) and gcc does
not have enough info on data. So, it's written, but it's not very
beneficial.
--
Christophe
More information about the ffmpeg-devel
mailing list