[FFmpeg-devel] [PATCH 1/2] avcodec/s302m: enable non-PCM decoding
Andreas Rheinhardt
andreas.rheinhardt at outlook.com
Fri Jan 26 08:42:33 EET 2024
Gyan Doshi:
>
>
> On 2024-01-25 06:47 pm, Andreas Rheinhardt wrote:
>> Gyan Doshi:
>>>
>>> On 2024-01-25 10:29 am, Andreas Rheinhardt wrote:
>>>> Gyan Doshi:
>>>>> Set up framework for non-PCM decoding in-place and
>>>>> add support for Dolby-E decoding.
>>>>>
>>>>> Useful for direct transcoding of non-PCM audio in live inputs.
>>>>> ---
>>>>> configure | 1 +
>>>>> doc/decoders.texi | 40 +++
>>>>> libavcodec/s302m.c | 609
>>>>> +++++++++++++++++++++++++++++++++++++--------
>>>>> 3 files changed, 543 insertions(+), 107 deletions(-)
>>>>>
>>>>> diff --git a/configure b/configure
>>>>> index c8ae0a061d..8db3fa3f4b 100755
>>>>> --- a/configure
>>>>> +++ b/configure
>>>>> @@ -2979,6 +2979,7 @@ rv20_decoder_select="h263_decoder"
>>>>> rv20_encoder_select="h263_encoder"
>>>>> rv30_decoder_select="golomb h264pred h264qpel mpegvideodec rv34dsp"
>>>>> rv40_decoder_select="golomb h264pred h264qpel mpegvideodec rv34dsp"
>>>>> +s302m_decoder_select="dolby_e_decoder"
>>>>> screenpresso_decoder_deps="zlib"
>>>>> shorten_decoder_select="bswapdsp"
>>>>> sipr_decoder_select="lsp"
>>>>> diff --git a/doc/decoders.texi b/doc/decoders.texi
>>>>> index 293c82c2ba..9f85c876bf 100644
>>>>> --- a/doc/decoders.texi
>>>>> +++ b/doc/decoders.texi
>>>>> @@ -347,6 +347,46 @@ configuration. You need to explicitly configure
>>>>> the build with
>>>>> An FFmpeg native decoder for Opus exists, so users can decode Opus
>>>>> without this library.
>>>>> + at section s302m
>>>>> +
>>>>> +SMPTE ST 302 decoder.
>>>>> +
>>>>> +SMPTE ST 302 is a method for storing AES3 data format within an MPEG
>>>>> Transport
>>>>> +Stream. AES3 streams can contain LPCM streams of 2, 4, 6 or 8
>>>>> channels with a
>>>>> +bit depth of 16, 20 or 24-bits at a sample rate of 48 kHz.
>>>>> +They can also contain non-PCM codec streams such as AC-3 or Dolby-E.
>>>>> +
>>>> This sounds like we should add bitstream filters to extract the proper
>>>> underlying streams instead.
>>>> (I see only two problems with this approach: The BSF API needs to set
>>>> the CodecID of the output during init, but at this point no packet has
>>>> reached the BSF to determine it. And changing codec IDs mid-stream is
>>>> also not supported.)
>>> In theory, this decoder shouldn't exist, as it is just a carrier,
>>> whether of LPCM or non-PCM.
>>> FFmpeg architecture also imposes a fundamental limitation in that one
>>> s302m stream may
>>> carry multiple payload streams and we support only one decoding context
>>> per input stream
>> Then why does the demuxer not separate the data into multiple streams?
>
> I didn't add demuxing support for this codec in MPEGTS, but I can venture
>
> a) it would mean essentially inlining this decoder in the demuxer.
> b) it would lead to a mapping of 1 actual to N virtual streams. How
> would stream_ids be assigned?
> c) this codec specifies for multiple payloads but I haven't seen an
> actual sample
>
>
>>>>> +
>>>>> + ret = init_get_bits8(&gb, buf, buf_size);
>>>>> + if (ret < 0)
>>>>> + return ret;
>>>>> +
>>>>> + aes_frm_size = (s->bits + 4) * 2 / 8;
>>>>> + if (buf_size < aes_frm_size * 2) // not enough to contain
>>>>> data_type & length_code
>>>>> + return AVERROR_INVALIDDATA;
>>>>> +
>>>>> + state = get_bits64(&gb, aes_frm_size * 8);
>>>>> +
>>>>> + while (!IS_NONPCMSYNC(s->bits,state) && (get_bits_left(&gb) >=
>>>>> 8))
>>>>> + state = (state << 8) | get_bits(&gb, 8);
>>>> Reading byte-aligned data with a GetBit context is very suboptimal.
>>> What is the performance difference vs. uint8 pointers?
>> Typically worse by a constant factor.
>
> But in this scenario (of a few dozen bytes per packet if dolby_e), how
> significant is it?
>
You are only looking at the ordinary case and not fuzzing samples where
it is more than a few dozen bytes.
- Andreas
More information about the ffmpeg-devel
mailing list