[FFmpeg-devel] Fix VP3 IDCT on Win64
Måns Rullgård
mans
Thu Aug 26 20:56:15 CEST 2010
Reimar D?ffinger <Reimar.Doeffinger at gmx.de> writes:
> On Thu, Aug 26, 2010 at 11:05:30AM +0000, Loren Merritt wrote:
>> On Thu, 26 Aug 2010, Reimar D?ffinger wrote:
>> >On Wed, Aug 25, 2010 at 08:43:25PM -0400, Ronald S. Bultje wrote:
>> >>
>> >>Those will stay inline of course. If an issue arises where we really
>> >>need multiple (>6) XMM registers in inline functions (which I can
>> >>honestly not imagine), then we'll think about a solution then and
>> >>there.
>> >
>> >The solution is easy: only add the clobbers for compilers where they
>> >are supported (I assume this was the issue on Win32/BSD? You never
>> >said _what_ the problem was). This can be tested in configure.
>> >And you'll have to specify the clobbers for inline functions even
>> >for a single XMM register and even for Linux, it's just unreasonable
>> >to hope that the compiler will never place some float stuff in a
>> >bad location, particularly with global optimization enabled.
>>
>> Do you plan to add an emms at the end of every mmx function?
>
> I think you should have no problem to come up with reasons why
> this is not comparable.
> But just in case
> - "fixing" emms usage necessarily has a performance impact,
> correct clobbers should not
> - on most recent CPUs and on x86 in general, --disable-mmx
> should "fix" the emms issue without too much of a performance
> issue by just using SSE
--disable-mmx also disables SSE. I don't dare have an opinion on the
ridiculousness of that.
--
M?ns Rullg?rd
mans at mansr.com
More information about the ffmpeg-devel
mailing list