[FFmpeg-devel] [PATCH] HWAccel infrastructure (take 3)

Michael Niedermayer michaelni
Wed Feb 18 19:12:31 CET 2009


On Wed, Feb 18, 2009 at 05:12:23PM +0100, Gwenole Beauchesne wrote:
> On Wed, 18 Feb 2009, Michael Niedermayer wrote:
[...]#
>>>> i cant speak about if i would accept some uglification without really
>>>> seeing
>>>> it
>>>> also considering the special casing in mpeg1/2 it does not seem the 
>>>> split
>>>> in
>>>> 3 really avoids uglification
>>>
>>> OK, I probably could do something like the following in
>>> h263dec.c::decode_slice():
>>>
> [... ugly code snippet ...]
>>> That should be enough. Would that be OK with you (stylistically 
>>> speaking)?
>>
>> no
>
> So, do you finally like my start_slice()/end_slice() approach with 
> buf+buf_end? ;-) The way h263dec.c is currently written is my only reason I 

i like it unless i find something better ...


> went for this approach. I don't see other elegant and minimalistic solution 
> for that.
>
>>>>> Concerning buf / buf_offset, this was meant for optimization purposes 
>>>>> in
>>>>> the HWAccel implementation. That way we can reduce control blocks 
>>>>> and/or
>>>>> (it depends on the HWAccel) copy only one buffer in the end.
>>>>
>>>> if you have buf + buf_size and the buffers overlap you can do that too,
>>>> besides if you have a buf/buf_size from start_frame you could check if
>>>> the slices buffer is within that.
>>>
>>> It depends, if the hardware accelerator wants the unescaped data (e.g. 
>>> for
>>> H.264 and VC-1), then the check wouldn't be valid since the passed args
>>> would be from another (temporary) buffer. On the other hand, VA API and
>>> VDPAU are working fine with the escaped (raw bitstream) data.
>>
>> with a temporary buffer buf_offset/buf can be anything
>> equal, larger or smaller ...
>> the check against the start_frame buf/buf_size though should still work
>
> OK, we can leave the "HW accelerator wants the unescaped data" case to 
> someone else.
>
>>> buf/buf_size from start_frame would be the buf/buf_size from
>>> AVCodec::decode_frame()?
>>
>> that was my idea
>
> Is there an HW accelerator able to grok that directly? But anyway, I am not 
> against it, it's just that I see issue considering the things below.
>
>>> I really want start_frame() to be at a location
>>> where we are assured that frame info is parsed.
>>
>> should not be a problem ...
>
> It is for e.g. h264.c, reasons kept below.

you can always pass NULL/0 as buf/size if there is no single buffer ...


>
>>> For h264.c, that'd be a little problematic. I could use the same mpeg12.c
>>> trick to use s->first_slice and:
>>>
>>> NAL_SLICE: [...]
>>>             if((err = decode_slice_header(hx, h)))
>>>                break;
>>>             if(s->avctx->hwaccel && s->first_slice){
>>>                 s->first_slice=0;
>>>                 s->avctx->hwaccel->start_frame(s->avctx, buf, buf_size);
>>>             }
>>>
>>> but I might not get the whole frame buffer. Or I could place the
>>> start_frame() call where ff_vdpau_h264_set_reference_frames(s); currently
>>> is. But in that case, I might have the following AVHWAccel hooks calling
>>> sequence: [?start_slice() ]+, end_slice(), start_frame(), end_frame(),
>>> which is not really desirable.
>>>
>>
>
>>> IMO, this complicates a little the code for no real gain. I don't 
>>> actually
>>> need buf/buf_size in neither VA API nor VDPAU.
>>
>> what happens with a buf of random content and buf_size of 1
>> will it segfault? if not how do you know how much you can copy? did i miss
>> some buf_size parameter somewhere?
>
> FFmpeg always sends correct data to users, it won't segfault and it's not 
> lavc's fault if users can't do a simple subtraction. ;-)
>
> so we have start_slice() with a buf+offset and end_slice() with a buf_end, 
> so we can reconstruct the buf_size.

that should work, excpt why not send
buf+offset and buf_end in end_slice, is start still needed then?

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Old school: Use the lowest level language in which you can solve the problem
            conveniently.
New school: Use the highest level language in which the latest supercomputer
            can solve the problem without the user falling asleep waiting.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL: <http://lists.mplayerhq.hu/pipermail/ffmpeg-devel/attachments/20090218/6f546645/attachment.pgp>



More information about the ffmpeg-devel mailing list