we are hiring cv and ml researchers gotta adapt ’em all: joint...

1
Gotta Adapt ’Em All: Joint Pixel and Feature-Level Domain Adaptation for Recognition in the Wild Problem & Contributions Pixel and Feature-level Domain Adaptation Related Works Conclusions Experimental Results Problem Car recognition in surveillance domain with labeled training images from web domain which different in camera view- angle, lighting, weather condition, etc. Contribution Certain challenges are better handled in the image space, while others are better handled in the feature space. A novel UDA framework that adapts at multiple levels from pixel to feature, with complementary insights for each type of adaptation. Feature-level DA: classification-aware domain adversarial neural network. Pixel-level DA, attribute-conditioned CycleGAN & warping-based pose translations. A new experimental protocol on car recognition in surveillance domain. We propose a joint UDA framework by leveraging complementary tools that are better-suited for each type of adaptation challenge. Importance & complementarity of each component are demonstrated through experiments on an application of car recognition in surveillance domain. Unsupervised Domain Adaptation Domain adversarial neural network: Ganin et al. Maximum mean discrepancy: Saito et al. Perspective Transformation Direct image generation: Tatarchenko et al. Warping-based: Zhou et al. Image-to-image Translation Image translation with perspective transformation: CycleGAN - Zhu et al. Luan Tran 1 Kihyuk Sohn 2 Xiang Yu 2 Xiaoming Liu 1 Manmohan Chandraker 2,3 1 Michigan State University 2 NEC Labs America 3 UC San Diego Domain Adversarial Feature Learning Analysis on Pixel-level Adaptation CNN Discriminator (D=1) Model Classifier source CNN Discriminator (D=2) target shared shared CNN Model Classifier source CNN Discriminator (D=1) target shared CNN source CNN target shared CNN source CNN target shared Classifier (C=1,…,N / N+1) Classifier (C=N+1 / N+1) shared Classifier (C=1,…,N / N) Classifier (C<N+1 / N+1) shared Baseline: Domain Adversarial Neural Network (DANN) Classification-Aware Adversarial Learning (DANN-CA) Pixel-level Cross-Domain Image Translation Perspective Synthesis by Appearance Flow Photometric Transformation by CycleGAN Analysis on Feature-level Adaptation Perspective Transf. SV Day Night Baseline (web only) Supervised (web+SV) 54.98 98.63 72.67 98.92 19.87 98.05 Appearance Flow (AF) Keypoint-based AF (KF) KF with mask (MKF) 59.73 61.55 64.30 75.78 77.98 78.62 27.87 28.92 35.87 Photometric Transf. SV Day Night CycleGAN AC-CGAN MKF + CycleGAN MKF + AC-CGAN 64.32 67.30 71.21 79.71 77.01 78.20 81.54 84.10 39.12 45.66 50.68 70.99 Pixel Feature SV Day Night Baseline (web only) Supervised (web+SV) 54.98 98.63 72.67 98.92 19.87 98.05 - - MKF AC-CGAN Both DANN DANN-CA DANN-CA DANN-CA DANN-CA 60.40 75.83 80.40 80.24 84.20 75.56 76.73 82.50 82.15 85.77 30.31 74.05 76.22 76.44 81.10 Analysis on Joint Pixel and Feature (PnF) F: Improve Domain Alignment P: Improve Training Stability We are hiring CV and ML researchers https://bit.ly/2UQ7UfU Apply at: a=day a=night a D a D a F F G G Elev: 10° Elev: 20° Elev: 30° AF KF MKF day night Web

Upload: others

Post on 22-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: We are hiring CV and ML researchers Gotta Adapt ’Em All: Joint …cvlab.cse.msu.edu/posters/2019_CVPR_Gotta Adapt_Em_All.pdf · 2019-10-05 · MKF AC-CGAN Both DANN DANN-CA DANN-CA

Gotta Adapt ’Em All: Joint Pixel and Feature-Level

Domain Adaptation for Recognition in the Wild

Problem & Contributions Problem & Contributions Pixel and Feature-level Domain Adaptation Pixel and Feature-level Domain Adaptation

Related Works Related Works

Conclusions Conclusions

Experimental Results Experimental Results

Problem

• Car recognition in surveillance domain with labeled training

images from web domain which different in camera view-

angle, lighting, weather condition, etc.

Contribution

• Certain challenges are better handled in the image space,

while others are better handled in the feature space.

• A novel UDA framework that adapts at multiple levels from

pixel to feature, with complementary insights for each type

of adaptation.

• Feature-level DA: classification-aware domain

adversarial neural network.

• Pixel-level DA, attribute-conditioned CycleGAN &

warping-based pose translations.

• A new experimental protocol on car recognition in

surveillance domain.

• We propose a joint UDA framework by leveraging complementary tools that

are better-suited for each type of adaptation challenge.

• Importance & complementarity of each component are demonstrated through

experiments on an application of car recognition in surveillance domain.

Unsupervised Domain Adaptation

• Domain adversarial neural network: Ganin et al.

• Maximum mean discrepancy: Saito et al.

Perspective Transformation

• Direct image generation: Tatarchenko et al.

• Warping-based: Zhou et al.

Image-to-image Translation

• Image translation with perspective transformation:

CycleGAN - Zhu et al.

Luan Tran1 Kihyuk Sohn2 Xiang Yu2 Xiaoming Liu1 Manmohan Chandraker2,3 1Michigan State University 2NEC Labs America 3UC San Diego

Domain Adversarial Feature Learning Analysis on Pixel-level Adaptation

CNN

Discriminator (D=1)

Model Classifier

source

CNN Discriminator (D=2) target

shared shared

CNN Model Classifier source

CNN Discriminator (D=1) target

shared

CNN source

CNN target

shared

CNN source

CNN target

shared

Classifier

(C=1,…,N / N+1)

Classifier

(C=N+1 / N+1)

shared

Classifier

(C=1,…,N / N)

Classifier

(C<N+1 / N+1)

shared

Baseline: Domain Adversarial Neural Network (DANN)

Classification-Aware Adversarial Learning (DANN-CA)

Pixel-level Cross-Domain Image Translation

• Perspective Synthesis by Appearance Flow

• Photometric Transformation by CycleGAN

Analysis on Feature-level Adaptation

Perspective Transf. SV Day Night

Baseline (web only)

Supervised (web+SV)

54.98

98.63

72.67

98.92

19.87

98.05

Appearance Flow (AF)

Keypoint-based AF (KF)

KF with mask (MKF)

59.73

61.55

64.30

75.78

77.98

78.62

27.87

28.92

35.87

Photometric Transf. SV Day Night

CycleGAN

AC-CGAN

MKF + CycleGAN

MKF + AC-CGAN

64.32

67.30

71.21

79.71

77.01

78.20

81.54

84.10

39.12

45.66

50.68

70.99

Pixel Feature SV Day Night

Baseline (web only)

Supervised (web+SV)

54.98

98.63

72.67

98.92

19.87

98.05

-

-

MKF

AC-CGAN

Both

DANN

DANN-CA

DANN-CA

DANN-CA

DANN-CA

60.40

75.83

80.40

80.24

84.20

75.56

76.73

82.50

82.15

85.77

30.31

74.05

76.22

76.44

81.10

Analysis on Joint Pixel and Feature (PnF)

F: Improve Domain Alignment P: Improve Training Stability

We are hiring CV and ML researchers

https://bit.ly/2UQ7UfU Apply at:

a=day

a=night a

Da Da

F F G G

Elev: 10°

Elev: 20°

Elev: 30°

AF KF MKF day night Web