icml2012: exemplar-svms for visual object detection , label transfer and image retrieval

57
Exemplar-SVMs: Visual Object Detection, Label Transfer and Image Retrieval Tomasz Malisiewicz (Massachusetts Institute of Technology) Joint work with: Abhinav Shrivastava, Abhinav Gupta, and Alexei (Alyosha) Efros (Carnegie Mellon University)

Upload: zukun

Post on 20-Apr-2015

81 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVMs: Visual Object Detection,

Label Transfer and Image Retrieval

Tomasz Malisiewicz(Massachusetts Institute of Technology)

Joint work with:Abhinav Shrivastava, Abhinav Gupta, and Alexei (Alyosha) Efros

(Carnegie Mellon University)

Page 2: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Overview• Visual Object Detection

• Exemplar-SVM Learning

• Understanding Exemplar-SVMs

• Experimental Results

• PASCAL VOC Object Detection

• Label Transfer

• Cross-domain Image Retrieval

• Concluding remarks and take-home lessons

Page 3: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Visual Object Detection

Page 4: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Visual Object Detection

• Histogram of Oriented Gradients features computed across a multiscale pyramid

Image HOG

Dalal et al 2005

Page 5: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Visual Object Detection-232'45647.+,-

• Linear SVMs for learning

• Histogram of Oriented Gradients features computed across a multiscale pyramid

Image HOG

Dalal et al 2005

Page 6: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Visual Object Detection-232'45647.+,-

• Linear SVMs for learning

LargeAnnotated

Dataset

• A large dataset such as PASCAL VOC (Everingham et al 2010)

• Histogram of Oriented Gradients features computed across a multiscale pyramid

Image HOG

Dalal et al 2005

Page 7: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

-232'45647.+,-Visual Object Detection

Page 8: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVMs

Tomasz Malisiewicz, Abhinav Gupta, Alexei A. Efros. Ensemble of Exemplar-SVMs for Object Detection and Beyond. In ICCV, 2011.

Page 9: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVMs

• Best of both worlds:

• Effectiveness of discriminatively-trained object detectors

• Explicit correspondence of Nearest Neighbor approaches

Tomasz Malisiewicz, Abhinav Gupta, Alexei A. Efros. Ensemble of Exemplar-SVMs for Object Detection and Beyond. In ICCV, 2011.

Page 10: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVMs

• Because each Exemplar-SVM is defined by a single positive instance, we can use different features for each exemplar

7x4 HOG 4x8 HOG

Page 11: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVMs

• Because each Exemplar-SVM is defined by a single positive instance, we can use different features for each exemplar

7x4 HOG 4x8 HOG

• Apply each Exemplar-SVM to test image in a sliding-window fashion

Page 12: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVMsExemplar E’s Objective Function:

h(x) = max(1-x,0) “hinge-loss”

Page 13: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVMs

Exemplar represented by ~100 HOG Cells (~3,000D features)

Exemplar E’s Objective Function:

h(x) = max(1-x,0) “hinge-loss”

Page 14: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVMs

Windows from images not containing any in-class instances (2,000 images x 10,000 windows per image = 20M negatives )

Exemplar represented by ~100 HOG Cells (~3,000D features)

Exemplar E’s Objective Function:

h(x) = max(1-x,0) “hinge-loss”

Page 15: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Embarrassingly Parallel

• Each exemplar performs its own hard negative mining

• Solve many convex learning problems

• Parallel training on cluster

CPU1 CPU2 CPUN

Ex1 Ex2 ExN...

Page 16: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Visualizing Exemplar-SVMs

Page 17: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Visualizing Exemplar-SVMsExemplar-SVMs Top Detections in Test Set

Page 18: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

!

!

!

!

!

!

!"#$%&'()*+&'$",&)-./0.12 34(567%89:;<

=#>

!

!

!

!

Traditional NN Local Distance Function Exemplar-SVM

Understanding Exemplar-SVMs

Local Distance Functions: Frome et al, 2006

Page 19: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Understanding Exemplar-SVMs

!"#$%&'()*+,

--

!"#$%&'( ! ./%0102#3#435/6708(/$0.#737#3

9:;<

*Learned Distance Function

*

Page 20: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Understanding Exemplar-SVMs

!"#$%&'()*+,

--

!"#$%&'( ! ./%0102#3#435/6708(/$0.#737#3

9:;<

*Learned Distance Function

*

Page 21: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Understanding Exemplar-SVMs

!"#$%&'()*+,

--

!"#$%&'( ! ./%0102#3#435/6708(/$0.#737#3

9:;<

*Learned Distance Function

*

Page 22: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Understanding Exemplar-SVMs

!"#$%&'()*+,

--

!"#$%&'( ! ./%0102#3#435/6708(/$0.#737#3

9:;<

*Learned Distance Function

*

Page 23: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Ensemble of Exemplar-SVMs

Page 24: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Ensemble of Exemplar-SVMs

!"#$%"&'()*+%(,*$- .#,"%&'()*+%(,*$-

Platt Calibration (Platt 1999)

Exemplars Compete

Page 25: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Ensemble of Exemplar-SVMs

!"#$%"&'()*+%(,*$- .#,"%&'()*+%(,*$-

Platt Calibration (Platt 1999)

Exemplars Compete

Learning Exemplar Co-occurrence Matrix

Exemplars are Combined

Page 26: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Object Category Detection

mAP averaged across 20 object categories on the PASCAL VOC 2007 object detection task

Page 27: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Object Category Detection

Traditional NN + Calibration 0.110

Local Distance Function + Calibration 0.157

Exemplar-SVMs + Calibration 0.198

Exemplar-SVMs + Co-occurrence 0.227

One SVM per category (Dalal and Triggs 2005) 0.097

Deformable Part Model (Felzenszwalb et al 2010) 0.266

mAP averaged across 20 object categories on the PASCAL VOC 2007 object detection task

Page 28: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Beyond Detection: Label Transfer

Category-basedDetector

Ensemble of Exemplar-SVMs

Page 29: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval
Page 30: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval
Page 31: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval
Page 32: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Detector w

Appearance

Exemplar

Page 33: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Detector w

Appearance

Exemplar

Meta-dataPerson

Page 34: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Detector w

Appearance

Exemplar

PersonMeta-data

Person

Page 35: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval
Page 36: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval
Page 37: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Talk Overview• Visual Object Detection

• Exemplar-SVM Learning

• Understanding Exemplar-SVMs

• Experimental Results

• PASCAL VOC Object Detection

• Label Transfer

• Cross-domain Image Retrieval

• Concluding remarks and take-home lessons

Page 38: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVMs

Windows from images not containing any in-class instances (2,000 images x 10,000 windows per image = 20M negatives )

Exemplar represented by ~100 HOG Cells (~3,000D features)

Exemplar E’s Objective Function:

h(x) = max(1-x,0) “hinge-loss”

Page 39: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVMs

Windows from images not containing any in-class instances (2,000 images x 10,000 windows per image = 20M negatives )

Exemplar represented by ~100 HOG Cells (~3,000D features)

Exemplar E’s Objective Function:

h(x) = max(1-x,0) “hinge-loss”

Page 40: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Object Category Detection

Traditional NN + Calibration 0.110

Local Distance Function + Calibration 0.157

Exemplar-SVMs + Calibration 0.198

Exemplar-SVMs + Co-occurrence 0.227

Exemplar-SVMs* + Calibration 0.142

Exemplar-SVMs* + Co-occurrence 0.197

Exemplar-SVMs* = Exemplar-SVMs with random negatives

Page 41: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Cross-domain Image Matching

Abhinav Shrivastava, Tomasz Malisiewicz, Abhinav Gupta, Alexei A. Efros. Data-driven Visual Similarity for Cross-domain Image Matching. In SIGGRAPH ASIA, 2011.

Page 42: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Query-specific Exemplar-SVM

Query Image

Random Flickr Images

Page 43: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Query-specific Exemplar-SVM

Query Image

Random Flickr Images

Page 44: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Search using Painting

35

Input Painting

GIST

Tiny Images

HOG

Bag-of-Words

Our Approach

Page 45: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Search Using Sketch

38

GIST

Tiny Images

Bag-of-Words

HOG

Our Approach

Input Sketch

Page 46: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVM vs. Google

Page 47: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Exemplar-SVM vs. Google

Page 48: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Concluding Remarks

• A mixture model with N mixture components

• The positives are represented non-parametrically and the negatives are represented parametrically

!"# !!!

"#$%&'()*+,-./ "#$%&'()*+,-.0 "#$%&'()*+,-.1-232'45647.+,-

Page 49: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Concluding Remarks• Exemplar-SVMs can be used for detection, label

transfer, as well as cross-domain image matching

Page 50: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Concluding Remarks

• Goods news: Results surprisingly nice, embarrassingly parallel learning

• Exemplar-SVMs can be used for detection, label transfer, as well as cross-domain image matching

Page 51: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Concluding Remarks

• Goods news: Results surprisingly nice, embarrassingly parallel learning

• Exemplar-SVMs can be used for detection, label transfer, as well as cross-domain image matching

• Bad news: Computationally Expensive

Page 52: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Thank you

Page 53: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Thank you

Come visit poster #30 in the Informatics Forumor Google “exemplar svm” to find papers and code

Page 54: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Varying Negative Set Sizes

Page 55: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Varying Number of Exemplar-SVMs

Page 56: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Varying Number of Exemplar-SVMs

Page 57: ICML2012: Exemplar-SVMs for Visual Object Detection , Label Transfer and Image Retrieval

Transfer Task 1: Evaluation on Buses

• 43.0% Hoiem et al. 2005

• 51.0% Monolithic Detector* + NN

• 62.3% Exemplar-SVMs

• measure pixelwise accuracy on the 3-class geometric-labeling problem: “left,” “front,” “right”-facing

*Felzenszwalb et al. 2010