[FFmpeg-cvslog] commit: H.264: split luma dc idct out and implement MMX/SSE2 versions ( Jason Garrett-Glaser )
git at videolan.org
git
Fri Jan 14 23:11:16 CET 2011
ffmpeg | branch: master | Jason Garrett-Glaser <darkshikari at gmail.com> | Fri Jan 14 21:34:25 2011 +0000| [19fb234e4af1ff9f58ff2fdd604ac6f6bb87ad6b] | committer: Jason Garrett-Glaser
H.264: split luma dc idct out and implement MMX/SSE2 versions
About 2.5x the speed.
NOTE: the way that the asm code handles large qmuls is a bit suboptimal.
If x264-style dequant was used (separate shift and qmul values), it might
be possible to get some extra speed.
Originally committed as revision 26336 to svn://svn.ffmpeg.org/ffmpeg/trunk
> http://git.videolan.org/gitweb.cgi/ffmpeg.git/?a=commit;h=19fb234e4af1ff9f58ff2fdd604ac6f6bb87ad6b
---
libavcodec/dsputil.h | 4 +
libavcodec/h264.c | 50 ++-----------
libavcodec/h264.h | 5 +-
libavcodec/h264_cabac.c | 8 +-
libavcodec/h264_cavlc.c | 8 +-
libavcodec/h264dsp.c | 1 +
libavcodec/h264dsp.h | 2 +
libavcodec/h264idct.c | 35 ++++++++++
libavcodec/svq3.c | 20 +++---
libavcodec/x86/dsputil_mmx.c | 1 +
libavcodec/x86/h264_idct.asm | 154 ++++++++++++++++++++++++++++++++++++++++++
libavcodec/x86/h264dsp_mmx.c | 4 +
12 files changed, 227 insertions(+), 65 deletions(-)
Diff: http://git.videolan.org/gitweb.cgi/ffmpeg.git/?a=commitdiff;h=19fb234e4af1ff9f58ff2fdd604ac6f6bb87ad6b
More information about the ffmpeg-cvslog
mailing list