cross%viewac-vityrecogni-onusinghankelets robust ...€¦ · *see table 5 in paper for detailed...

Crossview Ac-vity Recogni-on using Hankelets Binlong Li, Octavia I. Camps and Mario Sznaier Dept. of Electrical and Computer Engineering Robust Systems Lab p (1) k = X p (1) k-i a i Hankelets: Viewpoint invariance p (2) k = X p (2) k-i a i p (3) k = X p (3) k-i a i Regressor is invariant to affine transformations a . = = While Hankelets carry viewpoint invariance, they are not immune to self-occlusions and limited field of view. HANKELET HANKELET p(d|w)= { a b d b-1 (b-1)! e -ad for d> 0 0 otherwise Hankelet Codebook: Clustering ˆ H p ˆ H q Dissimilarity Score: Improving Cross-view Performance: Bag of BI-LINGUAL Hankelets HANKELET BI-LINGUAL Bi-lingual Hankelets: subset visible from different viewpoints Camera0 Camera4 Waving Bi-lingual tracklets marked in green We introduce a new feature for cross-view activity recognition: the “Hankelet”. This type of feature captures dynamic properties of short tracklets that are invariant to viewpoint changes and time shifts. Experiments using Hankelets on the IXMAS database show a 20% improvement over the state of the art. Abstract Experimental Results Single View: KTH Dataset Act Type Acc Boxing 95.71 HClapping 95.48 HWaving 99.09 Walking 99.52 Running 91.52 Jogging 94.05 Cross-View: IXMAS Dataset Train classifier with data from one viewpoint, test with data from a different one Algorithm Perf. Ours 56.4% cuboids [16] 10.9% Cross-View: IXMAS Dataset Comparing against state-of-the-art cross-view techniques HANKELETS Algorithm Perf. Ours 90.57% Liu et al. [16] 75.3% Farhadi et al. [7] 58.1% Junejo et al. [10] 59.5% Farhadi et al. [6] 74.4% *See Table 5 in paper for detailed comparisons HANKELET BI-LINGUAL HANKELETS Algorithm Perf Ours 95.89 Cao et al. [3] 95.02 Wang et al.[29] 94.2 Le et al. [13] 93.9 Li et al [14] 93.6 To recognize an activity from a different viewpoint than the one used for training. GOAL Training Data Testing Data EXISTING APPROACHES • Geometric constraints [32] • Track body joints [21,22] • 3D Models [8,15,30,31] • Quasi-invariant geometric features [10,11] • Transfer features across views [7,16] (75.3% accuracy on IXMAS) Best performance is far below the state of art performance for single view activity recognition. HANKELETS A Dynamics-based Feature: Hankelet rank(H) X i=1 a i x k-i y k-i = x k y k Detected Tracklet n th order regressor Tracklets Detection (shown in green) Camera0 scratch get up punch Camera1 Camera2 Camera3 Camera4 Code provided by [29] .a = Hankelet • Captures “alignment” between the column spaces. • Robust to noisy measurements. • Computationally efficient. • Dissimilarity within cluster follows a Gamma pdf: This work was supported in part by NSF grants IIS–0713003 and ECCS–0901433, AFOSR grant FA9550–09–1–0253, and the Alert DHS Center of Excellence under Award Number 2008-ST-061-ED0001. Hankelets: Initial Frame Invariance The columns of the Hankelets span the same space, regardless of the initial frame. = Γ.X = Γ.X 0 BAGS of HANKELETS HANKELETS Over 20% improvement More than 400% accuracy improvement

Upload: others

Post on 16-Jul-2020

1 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Cross%view*Ac-vity*Recogni-on*using*Hankelets Robust ...€¦ · *See Table 5 in paper for detailed comparisons HANKELET L HANKELETS Algorithm Perf Ours 95.89 Cao et al. [3] 95.02

Cross-‐view Ac-vity Recogni-on using HankeletsBinlong Li, Octavia I. Camps and Mario SznaierDept. of Electrical and Computer Engineering

RobustSystemsLab

p(1)k =X

p(1)k�iai

Hankelets: Viewpoint invariancep(2)k =

Xp(2)k�iai

p(3)k =X

p(3)k�iai

Regressor is invariant to affine transformations

a. =

While Hankelets carry viewpoint invariance, they are not immune to self-occlusions and limited field of view.

HANKELET HANKELET

p(d|w) = {abdb�1

(b�1)! e�ad

for d > 0

0 otherwise

Hankelet Codebook: Clustering

HpHqDissimilarity Score:

Improving Cross-view Performance:Bag of BI-LINGUAL Hankelets

HANKELET

BI-LINGUAL

Bi-lingual Hankelets: subset visible from different viewpoints

Camera0 Camera4Waving

Bi-lingual tracklets marked in green

We introduce a new feature for cross-view activity recognition: the “Hankelet”. This type of feature captures dynamic properties of short tracklets that are invariant to viewpoint changes and time shifts. Experiments using Hankelets on the IXMAS database show a 20% improvement over the state of the art.

Abstract Experimental Results

Single View: KTH DatasetAct Type Acc

Boxing 95.71

HClapping 95.48

HWaving 99.09

Walking 99.52

Running 91.52

Jogging 94.05

Cross-View: IXMAS DatasetTrain classifier with data from one viewpoint, test with data from a different one

Algorithm Perf.

Ours 56.4%

cuboids [16] 10.9%

Cross-View: IXMAS DatasetComparing against state-of-the-art cross-view techniques

HANKELETS

Algorithm Perf.Ours 90.57%

Liu et al. [16] 75.3%Farhadi et al. [7] 58.1%Junejo et al. [10] 59.5%Farhadi et al. [6] 74.4%*See Table 5 in paper for detailed comparisons

HANKELET

BI-LINGUAL

HANKELETS

Algorithm PerfOurs 95.89

Cao et al. [3] 95.02Wang et al.[29] 94.2

Le et al. [13] 93.9Li et al [14] 93.6

To recognize an activity from a different viewpoint than the one used for training.

GOAL

Training DataTesting Data

EXISTING APPROACHES• Geometric constraints [32]• Track body joints [21,22]• 3D Models [8,15,30,31]• Quasi-invariant geometric features [10,11]• Transfer features across views [7,16] (75.3% accuracy on IXMAS)

Best performance is far below the state of art performance for single view activity recognition.

HANKELETS

A Dynamics-based Feature: Hankelet

rank(H)X

i=1

xk�i

yk�i

�=

�

Detected Tracklet

nth order regressor

Tracklets Detection (shown in green)

Camera0

scratch

get u

ppu

nch

Camera1 Camera2 Camera3 Camera4Code provided by [29]

.a =

Hankelet

• Captures “alignment” between the column spaces.• Robust to noisy measurements.• Computationally efficient.• Dissimilarity within cluster follows a Gamma pdf:

This work was supported in part by NSF grants IIS–0713003 and ECCS–0901433, AFOSR grant FA9550–09–1–0253, and the Alert DHS Center of Excellence under Award Number 2008-ST-061-ED0001.

Hankelets: Initial Frame Invariance

The columns of the Hankelets span the same space, regardless of the initial frame.

= �.X = �.X0

BAGS of HANKELETS

HANKELETS

Over 20% improvement

More than 400% accuracy improvement

®. ® 2 3 4 First-Class Service Performance - - - - - - - - 96.04 95.89 95.35 95.22 93.17 93.26 95.25 * Targets Shown are FY 2014 96.80 96.50 5 (.15)

A quantitative theory of immediate visual recognitioncbcl.mit.edu/publications/ps/Serre_etal_PBR07.pdf · describe a quantitative theory of object recogni-tion in primate visual cortex

Accelerometer Based Gestural Control of Browser Applicationssclab.yonsei.ac.kr/courses/10TPR/10TPR.files... · plications with an accelerometer based continuous hand gesture recogni-tion

Mississippi Rules of Appellate Procedure - MCRAmscra.com/wp-content/uploads/rules_appellate_procedure.pdf · The Mississippi Rules of Appellate Procedure, ... The rul e recogni zes

13 cover Entry Data dengan Optical Character Recogni okpsbtik.smkn1cms.net/adaptif/kkpi/entry_data_dengan...1.5.1 Soal Teori 1. Microsoft Windows 98 adalah …. a. program pengolah