(I'm eventually going to successfully post this!) Mplayer/mencoder use mmx/sse/etc. ... how about doing something similar with gpu's? (I imagine this would require cooperation with a special cuda kernel on the gpu.)