原课程链接:https://www.cc.gatech.edu/~hays/compvision/
维基百科定义:https://en.wikipedia.org/wiki/Optical_flow#Methods_for_determination
百度百科定义:https://baike.baidu.com/item/%E5%85%89%E6%B5%81/7013666
Video and Motion
A video is a sequence of frames captured over time
Now our image data is a function of space (x, y) and time (t)
video is a sequence of frames.png
Sometimes, motion is the only cue
Even “impoverished” motion data can evoke a strong percept
cue from motion.png
Motion Estimation: Optical Flow
motion of cube.png motion's optical flow.pngProblem Define
How to estimate the motion of pixels from image I(x, y, t) to I(x, y, t+1)
motion of pixels.png
Key Assumption:
• color constancy:
– a point in I(x,y,t) looks the same in I(x,y,t+1)
– For grayscale images, this is brightness constancy
• small motion:
– Points do not move very far
Then we can obtain the optical flow constrains:
motion constrains.png
Brightness Constancy Constraint (equation):
brightness constancy constrain function.pngSmall Motion: (u and v are less than 1 pixel, or smooth):
Taylor series expansion of I:
Combining the two equations, we have:
equation.png equation.pngIn the limit as u and v go to zero, this becomes exact:
image.pngBrightness constancy constraint equation
image.png
How many equations and unknowns per pixel?
there is only one equation, but two unknows(u, v)
How to get more equations for a pixel?
Use spatial coherence constraint.
Spatial Coherence Constraint(equation):
Assum the pixel's neighbors have the same(u, v)
if we use a 5x5 window, then we can get 25 equations per pixel:
Least squares solution for d given by:
least squares solution.png least squares solution.pngWhen is this solvable?
condition.pngif ATA is not invertible, will render Aperture Problem
Aperture Problem Does this remind you of anything?Criteria for Harris Corner Detector!!!
baber pole illusion.png baber pole illusion.pngErrors in Lucas-Kanade
A point does not move like its neighbors
Motion segmentation
Brightness constancy does not hold
Do exhaustive neighborhood search with normalized correlation -tracking features – maybe SIFT – more later….
The motion is large (larger than a pixel)
Not-linear: Iterative refinement
Local minima: coarse-to-fine estimation
Revisiting the small motion assumption
Is this motion small enough?
Probably not—it’s much larger than one pixel
How might we solve this problem?
Coarse-to-fine optical estimation
网友评论