[FFmpeg-devel] [PATCH] avcodec/x86/hevc: fix luma 12b overflow
James Almer
jamrial at gmail.com
Sun Feb 25 18:28:09 EET 2024
On 2/25/2024 1:22 PM, Ronald S. Bultje wrote:
> On Sun, Feb 25, 2024 at 10:56 AM Ronald S. Bultje <rsbultje at gmail.com>
> wrote:
>
>> Hi,
>>
>> On Sun, Feb 25, 2024 at 3:28 AM J. Dekker <jdek at itanimul.li> wrote:
>>
>>> Weak filter can overflow in delta0 calculation before >> 4 in int16.
>>>
>>> Signed-off-by: J. Dekker <jdek at itanimul.li>
>>> ---
>>>
>>> I do not know x86 simd at all, so this is just an attempt to fix
>>> the implementation rather than write extremely performant code.
>>>
>>> Suggestions welcome.
>>>
>>
>> https://pastebin.com/KvcbQ2nK
>>
>
> Attached a slightly adjusted version which does sse2 in 16bit also.
>
> Ronald
> diff --git a/libavcodec/x86/hevc_deblock.asm b/libavcodec/x86/hevc_deblock.asm
> index 85ee4800bb..869301caff 100644
> --- a/libavcodec/x86/hevc_deblock.asm
> +++ b/libavcodec/x86/hevc_deblock.asm
> @@ -31,6 +31,7 @@ cextern pw_1023
> pw_pixel_max_12: times 8 dw ((1 << 12)-1)
> pw_m2: times 8 dw -2
> pd_1 : times 4 dd 1
> +pd_8 : times 8 dd 8
This is unused.
More information about the ffmpeg-devel
mailing list