[FFmpeg-devel] [PATCH 2/6] aacenc: Improve Intensity Stereo phase detection
Rostislav Pehlivanov
atomnuker at gmail.com
Wed Aug 5 03:00:30 CEST 2015
>L34/R34
Calculating the 3/4 power of the coefficients isn't really that expensive
to do twice, so I'll just scrap the whole idea.
>identation
How'd I miss that? Fixed.
>phase
Copied that from an older revision, missed that '*' sign and didn't notice,
fixed.
Will wait a while to see whether the other commits look good before sending
the fixed one.
On 4 August 2015 at 08:31, Claudio Freire <klaussfreire at gmail.com> wrote:
> On Wed, Jul 29, 2015 at 1:44 AM, Rostislav Pehlivanov
> <atomnuker at gmail.com> wrote:
> > + if (cpe->ms_mode)
> > + phase = 1 - 2 * cpe->ms_mask[w*16+g];
>
>
> Shouldn't it be ?
>
> phase *= 1 - ... ;
>
> phase is an argument, the original code would step on it, with a value
> that doesn't depend on phase, so it would fail to evaluate both
> phases. Using phase *= would make sure to test both phases.
>
> Well, that's the general idea, except it breaks the phase assigned to
> the struct. Something like the following does work though:
>
> ephase = phase;
> if (cpe->ms_mode)
> ephase *= 1 - 2 * cpe->ms_mask[w*16+g];
>
> and then change all uses of phase into ephase, except the last that
> remains:
>
> is_error.phase = phase; // original phase
> is_error.pass = dist2 <= dist1;
>
>
> On Wed, Jul 29, 2015 at 1:44 AM, Rostislav Pehlivanov
> <atomnuker at gmail.com> wrote:
> > for (w = 0; w < 128; w++)
> > if (sce1->band_type[w] >= INTENSITY_BT2)
> > sce1->band_type[w] = 0;
> >
> > - if (!cpe->common_window)
> > - return;
> > - for (w = 0; w < sce0->ics.num_windows; w += sce0->ics.group_len[w])
> {
> > - start = 0;
> > - for (g = 0; g < sce0->ics.num_swb; g++) {
> > - if (start*freq_mult > INT_STEREO_LOW_LIMIT*(lambda/170.0f)
> &&
> > - cpe->ch[0].band_type[w*16+g] != NOISE_BT &&
> !cpe->ch[0].zeroes[w*16+g] &&
> > - cpe->ch[1].band_type[w*16+g] != NOISE_BT &&
> !cpe->ch[1].zeroes[w*16+g]) {
> > - int phase = 0;
> > - float ener0 = 0.0f, ener1 = 0.0f, ener01 = 0.0f;
> > - float dist1 = 0.0f, dist2 = 0.0f;
> > + if (!cpe->common_window)
> > + return;
> > + for (w = 0; w < sce0->ics.num_windows; w +=
> sce0->ics.group_len[w]) {
> > + start = 0;
> > + for (g = 0; g < sce0->ics.num_swb; g++) {
> > + if (start*freq_mult >
> INT_STEREO_LOW_LIMIT*(s->lambda/170.0f) &&
>
> This looks strange. As it is that patch, it ends up with code like:
>
> > for (w = 0; w < 128; w++)
> > if (sce1->band_type[w] >= INTENSITY_BT2)
> > sce1->band_type[w] = 0;
> >
> > if (!cpe->common_window)
> > return;
> > for (w = 0; w < sce0->ics.num_windows; w +=
> sce0->ics.group_len[w]) {
> > start = 0;
> > for (g = 0; g < sce0->ics.num_swb; g++) {
>
> Which looks wrong. Bad indentation right?
>
> I think you meant:
>
> for (w = 0; w < 128; w++)
> if (sce1->band_type[w] >= INTENSITY_BT2)
> sce1->band_type[w] = 0;
>
> if (!cpe->common_window)
> return;
> for (w = 0; w < sce0->ics.num_windows; w += sce0->ics.group_len[w]) {
> start = 0;
> for (g = 0; g < sce0->ics.num_swb; g++) {
>
> A big part of the diff in that hunk is reindent, so I believe if you
> fix that indentation snafu the patch will shrink.
>
> On Wed, Jul 29, 2015 at 1:44 AM, Rostislav Pehlivanov
> <atomnuker at gmail.com> wrote:
> > for (w2 = 0; w2 < sce0->ics.group_len[w]; w2++) {
> > + abs_pow34_v(L34, sce0->coeffs+start+(w+w2)*128,
> sce0->ics.swb_sizes[g]);
> > + abs_pow34_v(R34, sce1->coeffs+start+(w+w2)*128,
> sce0->ics.swb_sizes[g]);
> > for (i = 0; i < sce0->ics.swb_sizes[g]; i++) {
> > float coef0 = sce0->pcoeffs[start+(w+w2)*128+i];
> > float coef1 = sce1->pcoeffs[start+(w+w2)*128+i];
> > - phase += coef0*coef1 >= 0.0f ? 1 : -1;
> > ener0 += coef0*coef0;
> > ener1 += coef1*coef1;
> > ener01 += (coef0 + coef1)*(coef0 + coef1);
> > }
> > }
>
> Careful, you're stepping on L34 and R34 on eight_short_window, and
> passing the last results only to calc_encoding_err_is.
>
> In fact, I'm thinking I may have induced you to make that mistake when
> I suggested not to compute R34 / L34 twice (once for each phase),
> since L34 and R34 only have room for one window, and
> calc_encoding_err_is needs to process a whole window group.
>
> I think you'll have to move it back to inside calc_encoing_err_is and
> just compute it twice. Redundant, but at least it's correct.
>
> Also, you should use pcoeffs (coeffs will have M/S applied to it when
> ms_mask).
>
More information about the ffmpeg-devel
mailing list