Contour Tracking

Deformable Contour Tracking using PF-MT (Particle Filter with Mode Tracker)
(collaborators: Anthony Yezzi, Yogesh Rathi and Allen Tannenbaum at Georgia Tech)

Important Papers

Software

Abstract

Details

Original Videos and Results' Videos

Talks

Other papers

Papers

N. Vaswani, Y. Rathi, A. Yezzi, A. Tannenbaum, PF-MT with an Interpolation Effective Basis for Tracking Local Contour Deformations, Accepted (with mandatory minor revisions) to IEEE Trans. Image Processing, 2008.
Y. Rathi, N. Vaswani, A. Tannenbaum, A. Yezzi, Tracking Deforming Objects using Particle Filtering for Geometric Active Contours, IEEE Trans. on Pattern Analysis and Machine Intelligence (PAMI), August 2007
Y. Rathi, N. Vaswani , A. Tannenbaum, A. Yezzi, Particle Filtering for Geometric Active Contours and Application to Tracking Deforming Objects, IEEE Intl. Conference on Computer Vision and Pattern Recognition (CVPR), 2005, oral

Short Abstract:
We propose algorithms for tracking the boundary contour of a deforming object from an image sequence, when the non-affine (local) deformation over consecutive frames is large and there is overlapping clutter, occlusions, low contrast or outlier imagery. When the object is arbitrarily deforming, each contour point can move independently. Contour deformation then forms an infinite (in practice, very large), dimensional space. Direct application of particle filters (PF) for large dimensional problems is impractically expensive. But in most real problems, at any given time, “most of the contour deformation” occurs in a small number of dimensions (“effective basis”) while the residual deformation in the rest of the state space (“residual space”) is “small” (not zero). In other words, the “Large Dimensional State Space” (LDSS) property holds and thus we can apply the PF with Mode Tracker (PF-MT) algorithm that was proposed in recent work. Since most contour deformation is low spatial frequency, we propose to use the space of deformation at a “subsampled” set of locations as the effective basis space. The resulting algorithm is called Deform PF-MT. It requires significant modifications compared to the original PF-MT because the space of contours is a non-Euclidean infinite dimensional space. We also discuss when and how to change effective basis.

Details:
We consider the problem of tracking the boundary contour of a moving and deforming object from a sequence of images. If the motion of the “object” or region of interest is constrained (e.g. rigid or approximately rigid), the contour motion can be efficiently represented by a small number of parameters, e.g. the affine group. But if the “object” is arbitrarily deforming, each contour point can move independently. Contour deformation then forms an infinite (in practice, very large), dimensional space. Direct application of particle filters for large dimensional problems is impractical, due to the reduction in effective particle size as dimension increases. But in most real problems, at any given time, “most of the contour deformation” occurs in a small number of dimensions (“effective basis”) while the residual deformation in the rest of the state space (“residual space”) is “small”. The effective basis may be fixed or time varying. Based on this assumption, we modify the particle filtering method to perform sequential importance sampling only on the effective basis dimensions, while replacing it with deterministic mode tracking in residual space (PF-MT). We develop the PF-MT idea for contour tracking.
    Deforming contours occur either due to changing region of partial occlusions or when the object of interest is actually deforming its shape over a time or space sequence of images. Examples of the second kind are a beating heart, moving animals or humans, or the cross-sections of different parts of a 3D object like the brain, in consecutive MRI slices. Most biological images contain deforming objects/regions. Contour tracking has many applications in medical image analysis, e.g. sequential segmentation of volume images; tracking heart regions or image guided surgery. The observation likelihood is often multimodal due to background objects (clutter) which are partially occluded by the “object of interest” or due to an object which partially occludes the “object of interest” or due to low contrast imagery. Heavy tailed and often multimodal observation likelihoods occur when the observation noise has occasional outliers.
    In our initial work (CVPR 2005, PAMI 2007), we treated the 6 dimensional space of affine deformations as the "effective basis" while the space of non-affine deformation was the residual space. The implicit assumption is that the posterior of non-affine deformation (conditioned on affine deformation and the current image) is unimodal. This is valid for many practical problems where the non-affine deformation per frame is small, e.g. a rigid object tracked by a perspective camera with frequent viewpoint changes, or approximately rigid objects, e.g. human body contour from a distance.
    But in other situations, where local deformations are large, there may be more than one non-affine mode for the same affine deformation value and the same image, i.e. posterior of non-affine deformation may be multimodal. Example applications are a rigid object undergoing partial occlusions, e.g. a car going under a light pole, or tracking regions of interest in low contrast medical images (multiple nearby contour modes due to the low contrast). Such applications also require importance sampling on the space of non-affine deformations. In recent work (CDC 2006, Trans. IP, Accepted 2008), we use global translations and deformation velocity at subsampled contour locations interpolated using a B-spline basis as the effective basis. The effective basis dimension is allowed to change with time. We are able to get excellent results with as low as K=6 subsampled points which is much smaller than the total number of contour points, M=150-200. Or in other words, the deformation “signal” is approximately bandlimited (spatially), with the approximate cut-off frequency being much smaller than the maximum measurable frequency, 0.5Hz (cycles/pixel). We can increase K if the approximate cut-off frequency increases.

MATLAB Code for Deform PF-MT

If you use this code, please cite

N. Vaswani, Y. Rathi, A. Yezzi, A. Tannenbaum, PF-MT with an Interpolation Effective Basis for Tracking Local Contour Deformations, Accepted, IEEE Trans. Image Processing, 2008. or
N. Vaswani, A. Yezzi, Y. Rathi, A. Tannenbaum, Time-varying Finite Dimensional Basis for Tracking Contour Deformations, IEEE Conf. on Decision and Control (CDC), 2006.

Code: DeformPFMT.zip
You will also need the following two sets of tools: utils.zip netlab.zip
Unzip utils.zip , netlab.zip and DeformPFMT.zip
Add utils and netlab directory paths into MATLAB path
See README.tex or README.txt (inside the DeformPFMT directory) for further instructions

Other Related Papers:

N. Vaswani, Particle Filtering for Large Dimensional State Spaces with Multimodal Observation Likelihoods, To Appear in IEEE Trans. Signal Processing
Y. Rathi, N. Vaswani, A. Tannenbaum, A Generic Framework for Tracking using Particle Filter with Dynamic Shape Prior, IEEE Trans. Image Processing, pp.1370-1382, May 2007
N. Vaswani, A. Yezzi, Y. Rathi, A. Tannenbaum, Time-varying Finite Dimensional Basis for Tracking Contour Deformations, IEEE Conf. on Decision and Control (CDC), 2006.
N. Vaswani, Particle Filters for Infinite (Or Large) Dimensional State Spaces - Part 2 , IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP), 2006.

Invited Talks based on this work:

SAMSI workshop on Geometry and Statistics of Shape Spaces (July 2007): Deformable Contour Tracking
IPAM workshop on Random Shapes for Image Processing (May 2007): Deformable Contour Tracking and System Identification
IMA Workshop on Shape Spaces, Minneapolis (April 2006): Deformable Contour Tracking

Vidoes for Affine PF-MT (Rathi et al, PAMI'07, posted above)
(created by Yogesh Rathi at GaTech): Link to Yogesh's page

Deform PF-MT Results : coming soon.
Refer to Vaswani et al, CDC'06 or Trans IP (accepted): posted above