[FFmpeg-devel] [PATCH 4/6] avcodec/hnm4video: Optimize postprocess_current_frame()
Tomas Härdin
tjoppen at acc.umu.se
Mon Aug 5 12:58:01 EEST 2019
lör 2019-08-03 klockan 18:57 +0200 skrev Michael Niedermayer:
> On Sat, Aug 03, 2019 at 04:07:22PM +0200, Tomas Härdin wrote:
> > lör 2019-08-03 klockan 01:49 +0200 skrev Michael Niedermayer:
> > > - uint32_t x, y, src_x, src_y;
> > > + uint32_t x, y, src_y;
> > > + int width = hnm->width;
> > >
> > > for (y = 0; y < hnm->height; y++) {
> > > + uint8_t *dst = hnm->processed + y * width;
> > > + const uint8_t *src = hnm->current;
> > > src_y = y - (y % 2);
> > > - src_x = src_y * hnm->width + (y % 2);
> > > - for (x = 0; x < hnm->width; x++) {
> > > - hnm->processed[(y * hnm->width) + x] = hnm-
> > > > current[src_x];
> > > - src_x += 2;
> > > + src += src_y * width + (y % 2);
> > > + for (x = 0; x < width; x++) {
> > > + dst[x] = *src;
> > > + src += 2;
> >
> > Looks OK. Maybe telling the compiler that src and dst don't alias
> > would
> > be worthwhile?
>
> i can add restrict keywords if you want:
> ?
>
> diff --git a/libavcodec/hnm4video.c b/libavcodec/hnm4video.c
> index 68d0baef6d..1c2501afab 100644
> --- a/libavcodec/hnm4video.c
> +++ b/libavcodec/hnm4video.c
> @@ -121,8 +121,8 @@ static void
> postprocess_current_frame(AVCodecContext *avctx)
> int width = hnm->width;
>
> for (y = 0; y < hnm->height; y++) {
> - uint8_t *dst = hnm->processed + y * width;
> - const uint8_t *src = hnm->current;
> + uint8_t * restrict dst = hnm->processed + y * width;
> + const uint8_t * restrict src = hnm->current;
Does it improve performance? Else there's little point
/Tomas
More information about the ffmpeg-devel
mailing list