Prevent repeating frame indices #1265

JoeSchiff · 2024-01-21T15:24:20Z

Fix for Issue #1158.

Problem

stream.thread_type.FRAME threading causes an incorrect series of repeating frame indices at the end of most videos. This is because a list of frames is returned by CodecContext._send_packet_and_recv . frame.index corresponds to the most recent frame received from the decoder. Therefore all frames in the list are assigned the same index as the final frame in the list.

Solution

I solved this issue by detecting whenever multiple frames are returned and then correcting the frame.index based on the frame's position in the list.

Other changes

AVCodecContext::frame_number is deprecated. I switched to AVCodecContext::frame_num.
EDIT: The deprecation was issued in ffmpeg 6.0, so I reverted back to using AVCodecContext::frame_number.

Testing

The test iterations may be overkill, but it only increased the test run time from 6 seconds to 6.5 seconds on my machine.

Future considerations

It is important to note that frame.index is affected by CodecContext.skip_frame. Skipped frames will not increment the frame index. I don't know if this is intended or desired behavior. Perhaps we should consider two different frame properties.
1: A property for the number of frames returned by the decoder (this is the current behavior).
2: A property for the frame index relative to the total number of frames in the video. I assume this can be determined by a timestamp?

WyattBlue · 2024-01-26T08:20:48Z

I would like there to be performance benchmarks ran on decode. Eyeballing the code, it looks this costs more cpu time but the PR is still good if the performance difference isn't measurable.

I'm curious why you choose to extract the logic of _correct_frame_indices to it's own function instead of inlining it in decode. This again feels like a performance thing but I would like measurable results. Actually, since _setup_decoded_frame is only one line now, that could be inlined as well.

jlaine · 2024-01-26T15:55:32Z

I'm very doubtful about this PR, do we even know why we have this .index attribute?

As far as I can tell:

This is not part of FFmpeg's AVFrame
It's not in the pyav documentation at all.

Unless it's actually used somewhere, another approach would be to remove the attribute completely.

JoeSchiff · 2024-01-26T23:20:16Z

Performance Benchmarks

@WyattBlue I ran some rudimentary performance benchmarks using the following code:

import av
from av.datasets import fate as fate_suite
import time

PATHS = (fate_suite("h264/bbc2.sample.h264"), fate_suite("h264/interlaced_crop.mp4"), av.datasets.curated("pexels/time-lapse-video-of-night-sky-857195.mp4"))

def test_decode_frame_indices():
    for path in PATHS:
        for thread_type in ("NONE", "AUTO"):
            for thread_count in range(4):
                for skip_type in ("NONE", "NONKEY"):
                    frame_list = []
                    compare_list = []
                    frame_count = 0
                    #print('\n', path, thread_type, thread_count, skip_type)
                    with av.open(path) as container:
                        stream = container.streams.video[0]
                        stream.thread_type = thread_type
                        stream.thread_count = thread_count
                        stream.codec_context.skip_frame = skip_type
                        for packet in container.demux(stream):
                            for frame in packet.decode():
                                frame_list.append(frame.index)
                                compare_list.append(frame_count)
                                frame_count += 1
                    print(frame_list == compare_list)

start_time = time.perf_counter()
test_decode_frame_indices()
print(time.perf_counter() - start_time)

I did 3 runs on each branch and got the following average durations:

main: 5.705269041995052 seconds
fix_frame_indices: 5.665727951010922 seconds

If you were referring to more sophisticated benchmarking, I'll need some help to get started.

Function vs Inline

The only reason I made a function was to make it slightly more organized. If you prefer that I change to inline, that's perfectly fine with me.

JoeSchiff · 2024-01-27T01:28:03Z

AVFrame

@jlaine I looked through AVFrame Struct Reference and found display_picture_number, which looked promising. However, when I exposed it in pyav it always returned 0.
It is also deprecated in ffmpeg 6.0.

This issue seems to also conclude that display_picture_number is not trustworthy.

Docs

frame.index is found in the pyav.org docs here and here.

jlaine · 2024-01-27T08:16:22Z

Thanks for the documentation pointers. It seems a bit weird that this member is mentioned in examples, but there isn't event a docstring clarifying the fact it's an index generated by pyav, and that it has no influence on encoding.

If the point is purely to have a running index, what is gained over doing:

for (index, frame) in enumerate(container.decode(video=0)):
   ...

JoeSchiff · 2024-01-28T16:24:36Z

@jlaine Yeah, that's a good point. However, since frame.index is on the pyav.org homepage, I'm sure many people expect it to be available. I would really like to provide a replacement before removing it.

It seems these problems arise from the fact that frame.index is not an actual index. enumerate() and frame.index work correctly only when iterating every frame. They will break if using CodecContext.skip_frame or container.seek.

How about this solution:

The current functionality of "number of frames decoded so far" is moved from frame.index to a new property called CodecContext.frame_num. That way it matches ffmpeg and the original functionality is still available to users.
Create frame.guessed_index which is calculated from frame.pts * stream.guessed_rate * frame.time_base or similar (I still struggle with the intricacies of time base).

The problem is frame.pts, stream.guessed_rate, and frame.time_base are not always available in all media. Example: fate_suite("h264/bbc2.sample.h264")

If frame.guessed_index isn't available then the original intent of this PR can be implemented by the user like this:

for packet in container.demux(stream):
    frame_list = packet.decode()
    correction = len(frame_list)
    for frame in frame_list:
        corrected_index = stream.codec_context.frame_num - correction
        correction -= 1
        print(f"{corrected_index=}")

Ideally we would be able to get a frame's index from ffmpeg, but I can't find a way to do it at the moment. I think this new solution solves many problems:

The original functionality of frame.index remains (elsewhere).
The repeating frame indices from #1158 can be solved.
Pyav matches ffmpeg's structure and property names.
Provide a way to get the frame index even when CodecContext.skip_frame = "NONKEY".
The name frame.guessed_index makes it clear that the value is not guaranteed to be accurate.

WyattBlue · 2024-02-16T03:35:11Z

I agree with jlaine. We should deprecate and remove frame.index. There is no such concept in ffmpeg. Putting in a guessed_index would only further complicate an already complex project.

JoeSchiff force-pushed the fix_frame_indices branch from df6be33 to 4c45957 Compare January 21, 2024 15:52

Prevent repeating frame indices (fixes: PyAV-Org#1158)

83e20c3

JoeSchiff force-pushed the fix_frame_indices branch from 4c45957 to 83e20c3 Compare January 22, 2024 01:08

JoeSchiff changed the title ~~prevent repeating frame indices (fixes: #1158)~~ Prevent repeating frame indices Jan 22, 2024

WyattBlue closed this Feb 16, 2024

JoeSchiff deleted the fix_frame_indices branch January 6, 2025 00:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent repeating frame indices #1265

Prevent repeating frame indices #1265

JoeSchiff commented Jan 21, 2024 •

edited

Loading

WyattBlue commented Jan 26, 2024

jlaine commented Jan 26, 2024 •

edited

Loading

JoeSchiff commented Jan 26, 2024

JoeSchiff commented Jan 27, 2024

jlaine commented Jan 27, 2024 •

edited

Loading

JoeSchiff commented Jan 28, 2024

WyattBlue commented Feb 16, 2024

Prevent repeating frame indices #1265

Prevent repeating frame indices #1265

Conversation

JoeSchiff commented Jan 21, 2024 • edited Loading

Problem

Solution

Other changes

Testing

Future considerations

WyattBlue commented Jan 26, 2024

jlaine commented Jan 26, 2024 • edited Loading

JoeSchiff commented Jan 26, 2024

Performance Benchmarks

Function vs Inline

JoeSchiff commented Jan 27, 2024

AVFrame

Docs

jlaine commented Jan 27, 2024 • edited Loading

JoeSchiff commented Jan 28, 2024

WyattBlue commented Feb 16, 2024

JoeSchiff commented Jan 21, 2024 •

edited

Loading

jlaine commented Jan 26, 2024 •

edited

Loading

jlaine commented Jan 27, 2024 •

edited

Loading