[MPlayer-users] ssa/ass rendering uses lots of cpu with -vo gl
Reimar Döffinger
Reimar.Doeffinger at gmx.de
Sun May 1 20:11:10 CEST 2011
On Sun, May 01, 2011 at 09:46:32PM +0400, Vladimir Mosgalin wrote:
> In window:
> # Events: 1K cycles
> #
> # Overhead Command Shared Object Symbol
> # ........ ....... ............................ ............................................
> #
> 26.75% mplayer mplayer [.] deNoise
> 10.80% mplayer r300_dri.so [.] 0x525df
> 3.75% mplayer libc-2.13.so [.] __memcpy_ssse3
> 3.42% mplayer mplayer [.] ff_h264_decode_mb_cabac
> 2.33% mplayer mplayer [.] lineNoiseAvg_MMX
> 2.31% mplayer mplayer [.] decode_cabac_residual_nondc_internal
> 2.28% mplayer mplayer [.] decode_cabac_mb_mvd
> 2.17% mplayer mplayer [.] put_h264_qpel8or16_hv_lowpass_ssse3.clone.35
> 1.47% mplayer libc-2.13.so [.] _int_malloc
> 1.45% mplayer [kernel.kallsyms] [k] __list_del_entry
> 1.44% mplayer mplayer [.] get_cabac_noinline
>
> In fullscreen:
> Events: 4K cycles
> #
> # Overhead Command Shared Object Symbol
> # ........ ....... ............................ ..............................................
> #
> 14.47% mplayer libfreetype.so.6.6.0 [.] 0xfae6
> 14.40% mplayer r300_dri.so [.] 0x416be
> 11.17% mplayer [radeon] [k] radeon_dvi_detect
> 8.92% mplayer mplayer [.] deNoise
> 3.31% mplayer [drm] [k] drm_mm_init_scan
> 1.60% mplayer [kernel.kallsyms] [k] kmem_cache_alloc_trace
> 1.32% mplayer libc-2.13.so [.] __memset_sse2
> 1.22% mplayer libdrm_radeon.so.1.0.0 [.] 0x2c9a
> 1.19% mplayer [kernel.kallsyms] [k] copy_user_generic_string
> 1.12% mplayer libc-2.13.so [.] __memcpy_ssse3
> 0.88% mplayer mplayer [.] lineNoiseAvg_MMX
This doesn't look like you ran the windowed test at the fullscreen resolution.
At least that's the only explanation I have for the libfreetype CPU usage change.
You can try -vo gl:scaled-osd to avoid that change to get a clearer look on the rest.
Since we can assume that deNoise used about the same CPU for both, we can conclude
that r300_dri.so used about 4 times as much CPU as before, and the radeon kernel
module came in with about the same amount of CPU usage.
A possibilty is that it is relate to transfers, older GPUs had much, much worse
DMA support (not to mention that to my knowledge no ATI drivers, proprietary or OpenSource,
have managed to actually handle transfers well. Actually it seems to be
kind of an Achillees heel for all GPUs).
More information about the MPlayer-users
mailing list