[FFmpeg-devel] [PATCH 5/6] x86: lossless audio: SSE4 madd 32bits

Christophe Gisquet christophe.gisquet at gmail.com
Wed Apr 20 08:43:14 CEST 2016


Hi,

2016-04-20 2:01 GMT+02:00 Ronald S. Bultje <rsbultje at gmail.com>:
> This is typically only an issue if the data came from stack. On win64 as
> well as unix64, the 4th argument never comes from stack but is a direct
> register argument instead.

So no benefit except consistency. I don't mind either way, though.

On the other hand, this hand-coded function improves is only a slight
improvement over gcc's vectorized code, and only because it does a
poor job of it. Probably because the order is small (8) and gcc does
not have enough info on data. So, it's written, but it's not very
beneficial.

-- 
Christophe


More information about the ffmpeg-devel mailing list