[FFmpeg-devel] yuv420_bgr24_mmxext conversion taking significant time
adriantong1024 at gmail.com
Sat Jun 8 16:51:51 EEST 2019
On Fri, 7 Jun 2019 at 23:20, Lauri Kasanen <cand at gmx.com> wrote:
> On Fri, 7 Jun 2019 08:38:35 -0700
> Adrian Tong <adriantong1024 at gmail.com> wrote:
> > Hi
> > I have a workload which spends a significant amount of time (~10%) in
> > the yuv420_bgr24_mmxext function in FFMEPG.
> > I looked at the assembly and profile and see MMX (64 bit) registers are
> > used. I wonder whether we can have a SSE2 version which has a register
> > width of 128.
> > I am very interested in implementing such support if it is possible.
> I'm not well versed in x86 vectors, so I can't say if SSE2 is enough or
> some other SSE version would be needed, but certainly YUV to RGB
> conversion can be done faster than with MMX. Please do send a patch.
> - Lauri
Thanks for the reply, any reason why this has not been implemented before ?
it seems to me that this would be a pretty important/hot function.
> ffmpeg-devel mailing list
> ffmpeg-devel at ffmpeg.org
> To unsubscribe, visit link above, or email
> ffmpeg-devel-request at ffmpeg.org with subject "unsubscribe".
More information about the ffmpeg-devel