[FFmpeg-user] Continue audio even when RTSP video stutters?

Steven Kan steven at kan.org
Mon Mar 17 23:08:06 EET 2025


Hi! For the last 7 years I've been streaming RTSP from an IP camera to one of my 4 YouTube channels:

https://www.youtube.com/@IAmTheWaterbug/live

I am adding a series of royalty-free MP3 to the stream because the camera lacks audio, but YouTube Stream Now requires an audio channel, and this was better (IMNSHO) than adding an empty audio track. I also like the music :-). This has worked very well for the last 7 years. 

About 6 months ago the camera started malfunctioning, and the RTSP stream now stutters every few seconds. It's definitely a problem with the stream from this particular camera because my other 3 streams from different IP cameras, on the same network, and from the same streaming computer, are working fine at minimal CPU load. 

If I paste the URL for the problem camera into VLC, the video stutters just like the YT stream. ffmpeg reports:

vt decoder cb: output image buffer is null

repeatedly (see command string, below). 

As you can see on the stream, the video stuttering is annoying, but not crippling, but the audio interruptions are intolerable. 

I am still looking to replace the camera, but while I'm looking, is there a way to modify my processing chain so that, when the RTSP video stutters, the audio does _not_ pause? The audio is being sourced directly from MP3s on the local HD of the computer running ffmpeg, so there's no issue with that data being accessible in a timely manner. When the video stream stutters I'd like it to continue doing what it does now, e.g. just display the last available frame until a new one comes along. 

If I output to DeleteMe.flv, then the FLV video is a mess, while the audio continues to play normally. If I open that FLV in VLC the video will just stall completely. If I open it in IINA I can skip around in the timeline, and it will play a few frames, but it won't play through:

https://www.kan.org/download/DeleteMe.flv

I don't care about latency, within reason, because it's already at least 10 seconds behind real-time, based on the timestamp. 

Thanks!

./ffmpeg1 -thread_queue_size 2048 -hwaccel videotoolbox -i "rtsp://anonymous:password@192.168.1.11:554" -f concat -safe 0 -i playlist.txt -vf drawtext="fontfile=/Users/steven/Library/Fonts/Nouveau_IBM.ttf: text='%{localtime\:%Y-%m-%d\ %H\\\\\:%M\\\\\:%S}': fontcolor=white: fontsize=66: x=(w-text_w)*0.01: y=(h-text_h)*0.08" -vcodec h264_videotoolbox -b:v 5000k -acodec copy -t 00:1:00 -f flv DeleteMe.flv 
ffmpeg version N-109676-g40512dbd96-tessus  https://evermeet.cx/ffmpeg/  Copyright (c) 2000-2023 the FFmpeg developers
  built with Apple clang version 11.0.0 (clang-1100.0.33.17)
  configuration: --cc=/usr/bin/clang --prefix=/opt/ffmpeg --extra-version=tessus --enable-avisynth --enable-fontconfig --enable-gpl --enable-libaom --enable-libass --enable-libbluray --enable-libdav1d --enable-libfreetype --enable-libgsm --enable-libmodplug --enable-libmp3lame --enable-libmysofa --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenh264 --enable-libopenjpeg --enable-libopus --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvmaf --enable-libvo-amrwbenc --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxavs --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-version3 --pkg-config-flags=--static --disable-ffplay
  libavutil      57. 44.100 / 57. 44.100
  libavcodec     59. 57.100 / 59. 57.100
  libavformat    59. 36.100 / 59. 36.100
  libavdevice    59.  8.101 / 59.  8.101
  libavfilter     8. 54.100 /  8. 54.100
  libswscale      6.  8.112 /  6.  8.112
  libswresample   4.  9.100 /  4.  9.100
  libpostproc    56.  7.100 / 56.  7.100
Input #0, rtsp, from 'rtsp://anonymous:password@192.168.1.11:554':
  Metadata:
    title           : Session streamed by "preview"
    comment         : 
  Duration: N/A, start: 0.000000, bitrate: N/A
  Stream #0:0: Video: h264 (High), yuv420p(progressive), 3072x1728, 20 fps, 20 tbr, 90k tbn
  Stream #0:1: Audio: aac (LC), 16000 Hz, mono, fltp
[mp3 @ 0x7fc0f9804c80] Estimating duration from bitrate, this may be inaccurate
Input #1, concat, from 'playlist.txt':
  Duration: N/A, start: 0.000000, bitrate: 320 kb/s
  Stream #1:0: Audio: mp3, 44100 Hz, stereo, fltp, 320 kb/s
Stream mapping:
  Stream #0:0 -> #0:0 (h264 (native) -> h264 (h264_videotoolbox))
  Stream #1:0 -> #0:1 (copy)
Press [q] to stop, [?] for help
Output #0, flv, to 'DeleteMe.flv':
  Metadata:
    title           : Session streamed by "preview"
    comment         : 
    encoder         : Lavf59.36.100
  Stream #0:0: Video: h264 ([7][0][0][0] / 0x0007), nv12(progressive), 3072x1728, q=2-31, 5000 kb/s, 20 fps, 1k tbn
    Metadata:
      encoder         : Lavc59.57.100 h264_videotoolbox
  Stream #0:1: Audio: mp3 ([2][0][0][0] / 0x0002), 44100 Hz, stereo, fltp, 320 kb/s
[h264_videotoolbox @ 0x7fc0f8f0e5c0] Color range not set for nv12. Using MPEG range.
[h264 @ 0x7fc0f8f10ac0] vt decoder cb: output image buffer is null: -12909kbits/s speed=0.44x     
[h264 @ 0x7fc0f8f1b680] hardware accelerator failed to decode picture
[h264 @ 0x7fc0f8f10ac0] vt decoder cb: output image buffer is null: -12909kbits/s speed=0.414x    
[h264 @ 0x7fc0f8f26040] hardware accelerator failed to decode picture
Error while decoding stream #0:0: Unknown error occurred.29 bitrate=5329.5kbits/s speed=0.41x    
    Last message repeated 1 times
[h264 @ 0x7fc0f8f10ac0] vt decoder cb: output image buffer is null: -12909kbits/s speed=0.742x    
[h264 @ 0x7fc0e90043c0] hardware accelerator failed to decode picture
Error while decoding stream #0:0: Unknown error occurred.84 bitrate=4727.5kbits/s speed=0.729x    
frame=  315 fps=7.9 q=-0.0 size=   19712kB time=00:00:39.96 bitrate=4040.4kbits/s speed=0.996x    


More information about the ffmpeg-user mailing list