About AMP Lab Projects Downloads Publications People Links

Group Member

Dhruv Batra
PhD. Student

Personal Homepage: http://www.ece.cmu.edu/~dbatra
Email:

Office: HH D-Level D-5
Lab: Porter Hall B6
Phone: 412-268-4267
Fax: 412-268-3890

Mailing Address:
HH 1111, Department of ECE,
Carnegie Mellon University,
5000 Forbes Avenue, Pittsburgh,
PA 15213-3890

[Research Interests] [Project] [Publications]

Biography

I am a 2nd year PhD student in the ECE department at Carnegie Mellon University. I'm supervised by Tsuhan Chen and am a member of his Advanced Multimedia Processing Lab. I also closely work with Rahul Sukthankar from Intel Research Pittsburgh, who was my mentor during an internship in summer 2007.

I received my master's degree from CMU in 2006, during which I worked with Martial Hebert from the Robotics Institute. In summer 2006, I interned at Intel Research Pittsburgh, working with Bart Nabbe. In my first semester at Carnegie Mellon, I also worked with Marios Savvides.

Before joining CMU, I earned a B.Tech from the Institute of Technology, Benaras Hindu University. As an undergrad, I spent a summer working with Santanu Chaudhury, in the Multimedia Lab, Electrical Engineering Department, Indian Institute of Technology Delhi.

Top of this page

Research Interests

Machine Learning

Graphical Models

Bayesian Methods

Combinatorics

Approximation algorithms

Computer Vision

Simultaneous segmentation and recognition

Action Recognition

Top of this page

Projects

Learning Class-Specific Affinities for Image Labelling

Spectral clustering and eigenvector-based methods have become increasingly popular in segmentation and recognition. Although the choice of the pairwise similarity metric (or affinities) greatly influences the quality of the results, this choice is typically specified outside the learning framework. In this work, we present an algorithm to learn class-specific similarity functions. Mapping our problem in a Conditional Random Fields (CRF) framework enables us to pose the task of learning affinities as parameter learning in undirected graphical models. There are two significant advances over previous work. First, we learn the affinity between a pair of data-points as a function of a pairwise feature and (in contrast with previous approaches) the classes to which these two data-points were mapped, allowing us to work with a richer class of affinities. Second, our formulation provides a principled probabilistic interpretation for learning all of the parameters that define these affinities. Using ground truth segmentations and labellings for training, we learn the parameters with the greatest discriminative power (in an MLE sense) on the training data. We demonstrate the power of this learning algorithm in the setting of joint segmentation and recognition of object classes. Specifically, even with very simple appearance features, the proposed method achieves state-of-the-art performance on standard datasets.

[ project page ]

Space-Time Shapelets for Action Recognition

Recent works in action recognition have begun to treat actions as space-time volumes. This allows actions to be converted into 3-D shapes, thus converting the problem into that of volumetric matching. However, the special nature of the temporal dimension and the lack of intuitive volumetric features makes the problem both challenging and interesting. In a data-driven and bottom-up approach, we propose a dictionary of mid-level features called Space-Time Shapelets. This dictionary tries to characterize the space of local space-time shapes, or equivalently local motion patterns formed by the actions. Representing an action as a bag of these space-time patterns allows us to reduce the combinatorial space of these volumes, become robust to partial occlusions and errors in extracting spatial support. The proposed method is computationally efficient and achieves competitive results on a standard dataset.

[ project page ]

An Alternative Formulation for the Five Point Relative Pose Problem

The “Five Point Relative Pose Problem” is to find all possible camera configurations between two calibrated views of a scene given five point-correspondences. We take a fresh look at this well-studied problem with an emphasis on the parametrization of Essential Matrices used by various methods over the years. Using one of these parametrizations, a novel algorithm is proposed, in which the solution to the problem is encoded in a system of nine quadratic equations in six variables, and is reached by formulating this as a constrained optimization problem. We compare our algorithm with an existing 5-point method, and show our formulation to be more robust in the presence of noise.

[ project page ]

Detailed project information can be found on my personal webpage here.

Top of this page

Publications

Conference Papers:

D. Batra, R. Sukthankar, and T. Chen. Learning Class-Specific Affinities for Image Labelling. IEEE Conference on Computer Vision and Pattern Recognition 2008 (CVPR '08).
D. Batra, T. Chen, and R. Sukthankar. Space-Time Shapelets for Action Recognition. Workshop on Motion and Video Computing 2008 (WMVC '08), IEEE Winter Vision Meetings.
D. Batra, B. Nabbe, and M. Hebert. An Alternative Formulation for the Five Point Relative Pose Problem. IEEE Workshop on Motion and Video Computing 2007 (WMVC '07).
D. Batra, G. Singhal, and S. Chaudhury. Gabor Filter based Fingerprint Classification using Support Vector Machines. IEEE India Annual Conference 2004 (INDICON '04).

Tech Reports:

D. Batra, R. Sukthankar, and T. Chen. Learning Class-Specific Affinities for Image Labelling. Carnegie Mellon University Technical Report: AMP08-01.

Soft copies of all publications can be found on my personal webpage here.

Top of this page