You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Feeding 2 images of resolution 240x320 is generating a .npy array of shape (320, 240, 2) containing positive and negative values. I want to feed these as features for an action modeling problem. I want to get a better idea of what these values actually mean.
Is it the distance moved by that particular pixel in the next frame? or coordinate?
Would really appreciate if someone could give me clarity.
Thanks, in advance.
The text was updated successfully, but these errors were encountered:
I believe that the HXW dimensions are preserved (and not transposed).
For each pixel, optical flow is parameterized as follows [1]: one channel to specify the orientation (direction of motion in pixel space) and the other for the magnitude of motion.
Feeding 2 images of resolution 240x320 is generating a .npy array of shape (320, 240, 2) containing positive and negative values. I want to feed these as features for an action modeling problem. I want to get a better idea of what these values actually mean.
Is it the distance moved by that particular pixel in the next frame? or coordinate?
Would really appreciate if someone could give me clarity.
Thanks, in advance.
The text was updated successfully, but these errors were encountered: