Tri thức là sức mạnh: Part 4: Computer Vision Algorithms for Motion

Thứ Ba, 22 tháng 4, 2014

Part 4: Computer Vision Algorithms for Motion

PROGRAMMING - COMPUTER VISION TUTORIAL
Part 4: Computer Vision Algorithms for Motion

Computer Vision Algorithms for Motion

Motion Detection

Background Subtraction

Feature Tracking

Practice Problems

Download Software

Computer Vision Tutorial Series

Motion Detection (Bulk Motion)
Motion detection works on the basis of frame differencing - meaning comparing how pixels (usually

calculate the average of a selected color in frame 1wait X seconds
calculate the average of a selected color in frame 2
if (abs(avg_frame_1 - avg_frame_2) > threshold)
then motion detected

calculate the middle mass in frame 1wait X seconds
calculate the middle mass in frame 2
if (mm_frame_1 - mm_frame_2) > threshold)
then motion detected

Tracking
By doing motion detection by calculating the motion of the middle mass, you can run more advanced algorithms such as tracking. By doing vector math, and knowing the pixel to distance ratio, one may calculate the displacement, velocity, and acceleration of a moving

Vision Tracking

calculate the middle mass in frame 1wait X seconds
calculate the middle mass in frame 2
speed = (mm_frame_1 - mm_frame_2) * distance / per_pixel

determining the distance to pixel ratio

lens effect

No Lens Effect

Lens Effect

Mars Rover Lens Effect

x_actual = xd * (1 + distortion_constant * (xd^2 + yd^2))
y_actual = yd * (1 + distortion_constant * (xd^2 + yd^2))

xd

yd

distortion_constant

Cross over

Optical Flow
This computer vision method completely ignores and has zero interest in identifying observed objects. It works by analyzing the bulk/individual motion of pixels. It is useful for tracking, 3D analysis, altitude measurement, and velocity measurement. This method has the advantage that it can work with low resolution cameras, while the more simple algorithms require minimal processing power.

Optical flow is a vector field that shows the direction and magnitude of these intensity changes from one image to the other, as shown here:

Optical Flow Vector Mapping

Applications for Optical Flow
Altitude Measurement (for constant speed)
Ever notice when traveling by plane, the higher you are the slower the ground below you seems to move? For aeriel robots that have a known constant speed, by analyzing pixel velocity from a downward facing camera the altitude can be calculated. The slower the pixels travel, the higher the robot. A potential problem however is when your robot rotates in the air, but this can be accounted for by adding additional sensors like gyros and

Velocity Measurement (for constant altitude)

Tracking

background subtraction

3D Scene Analysis

Optical Flow in 3D

Problems with optical flow . . .

Optical Flow Error

Optical Flow Error

Background Subtraction
Background subtraction is the method of removing pixels that do not move, focusing only on objects that do. The method works like this:

capture two framescompare the pixel colors on each frame
if the colors are the same, replace with the color whiteelse, keep the new pixel

Here is an example of a guy moving with a static background. Some pixels did not appear to change when he moved, resulting in error:
Background Subtraction

Background Subtraction

The problem with this method as above is that if the object stops moving, then it becomes invisible. If my hand moves, but my body doesnt, all you see is a moving hand. There is also the chance that although something is moving, not all the individual pixels change color because the object is of a uniform color. To correct for this, this algorithm must be combined with other algorithms such as edge detection and blob finding, to make sure all pixels within a moving boundary arent discarded.
There is one other form of background subtraction called blue-screening (or green-screening, or chroma-key). What you do is physically replace the background with a solid color - a big green curtain (called a chroma-key) typically works best. Then the computer replaces all pixels of that color with pixels from another scene. This technique is commonly used for weather anchor people, and is why they never wear green ties =P
Green Screen Background Subtraction

Green Screen Background Subtraction

This blue-screening method is more a machine vision technique, as it will not work in everyday situations - only in studios with expert lighting.

Here is a video of my

Sony Vegas Movie Studio

Feature Tracking
A feature is a specific identified point in the image that a tracking algorithm can lock onto and follow through multiple frames. Often features are selected because they are bright/dark spots, edges or corners - depending on the particular tracking algorithm.

Template matching

track

Feature Tracking

Visual Servoing
Visual servoing is a method of using video data to determine position data of your robot. For example, your robot sees a door and wants to go through it. Visual servoing will allow the front of your robot to align itself with the door and pass through. If your robot wanted to pick something up, it can use visual servoing to move the arm to that location. To drive down a road, visual servoing would track the road with respect to the robots heading.

Visual Servoing

To do visual servoing, first you need to use the vision processing methods listed in this tutorial to locate the object. Then your robot needs to decide how to orient itself to reach that location using some type of

robot arms tutorial

Practice What You Learned
These three below images are made from sonar capable of generating a 2D mapped field of an underwater scene with fish (for fisheries counting). Since the data is stored in a similar way to data from a camera, vision algorithms can be applied.

Fisheries Counting Vision Analysis

(scene 1, scene 2, and scene 3)

So here is your challenge:

What two different algorithms can acheive the change from scene 1 to scene 2 (hint: scene 2 only shows moving fish)?

Name the algorithm that can acheive the change from scene 2 to scene 3 (hint: color is made binary)?

What algorithm allows finding the location of the fish in the scene?

If in scene two we were to identify the types of fish, what three different algorithmsmight work?

answers are at the bottom of this page

Downloadable Software (not affiliated with SoR)
For those interested in vision code for the hacking, here is a great source for

computer vision source code

RoboRealm Image Processing Software

Không có nhận xét nào:

Đăng nhận xét

Đăng ký: Đăng Nhận xét (Atom)