a dataset and benchmark for large-scale multi-modal face ... zhang's... · acquisition details...

Shifeng Zhang 1 *, Xiaobo Wang 2 *, Ajian Liu 3 , Chenxu Zhao 2 , Jun Wan 1 , Sergio Escalera 4 , Hailin Shi 2 , Zezheng Wang 5 , Stan Z. Li 1 1 CBSR, NLPR, CASIA, UCAS 2 JD AI Research 3 MUST 4 CVC, UB 5 JD Finance A Dataset and Benchmark for Large-scale Multi-modal Face Anti-spoofing Experiments CASIA-SURF Dataset Acquisition Details • Intel RealSense SR300 camera for RGB, Depth and Infrared (IR) • Face angle: ≤ 30° • Face distance: 0.3~1.0 m Six Attacks • 2 Prints: flat and curved face • 3 Cuts: eyes/nose/mouth region Data Preprocessing • Input original image to Dlib tool for face detection • Apply PRNet to estimate face binary mask • Get face area by pointwise product Subset Dividing • Split into train/val/test randomly Statistics • 1000 subjects, 21000 videos, 492522 images, 3 modalities • Balanced male to female ratio • Reasonable age distribution Existing face anti-spoofing benchmark datasets have limited number of subjects (≤170) and modalities (≤ 2), hindering the further development. Contribution Present a large-scale (1000 subjects) multi- model (3 modalities) face anti-spoofing dataset. Introduce a new multi- modal fusion method to merge three modalities. Conduct extensive experiments to verify its significance and generalization capability. Motivation Dataset Year # subjects # videos Camera Modal Attack Replay-Attack 2012 50 1200 VIS RGB Print, 2 Replay CASIA-MFSD 2012 50 600 VIS RGB Print, Replay 3DMAD 2013 17 255 VIS/Kinect RGB/Depth 3D Mask MSU-MFSD 2015 35 440 Phone/Laptop RGB Print, 2 Replay Replay-Mobile 2016 40 1030 VIS RGB Print, Replay Msspoof 2016 21 4704 VIS/NIR RGB/IR Print Oulu-NPU 2017 55 5940 VIS RGB 2 Print, 2 Replay SiW 2018 165 4620 VIS RGB 2 Print, 4 Replay Ours 2018 1000 21000 RealSense RGB/Depth/IR Print, Cut Model Analysis Method TRP (%) APCER (%) NPCER (%) ACER (%) @FPR=10 -2 @FPR=10 -3 @FPR=10 -4 Halfway fusion 89.1 33.6 17.8 5.6 3.8 4.7 Proposed fusion 96.7 81.8 56.8 3.8 1.0 2.4 Baseline: SEFusion • Performs feature re- weighting to select more informative channels while suppressing less useful ones for each modal Effect of Number Modalities Modal TRP (%) APCER (%) NPCER (%) ACER (%) @FPR=10 -2 @FPR=10 -3 @FPR=10 -4 RGB 49.3 16.6 6.8 8.0 14.5 11.3 Depth 88.3 27.2 14.1 5.1 4.8 5.0 IR 65.3 26.5 10.9 15.0 1.2 8.1 RGB & Depth 86.1 49.5 10.6 4.3 5.6 5.0 RGB & IR 79.1 50.9 26.1 14.4 1.6 8.0 Depth & IR 89.7 71.4 24.3 1.5 8.4 4.9 RGB & Depth & IR 96.7 81.8 56.8 3.8 1.0 2.4 Effect of Number Subjects # ID TRP (%) ACER (%) @FPR=10 -2 @FPR=10 -3 @FPR=10 -4 50 88.0 28.5 0 5.1 100 89.5 56.9 0 4.3 200 91.1 69.3 46.1 3.6 300 96.7 81.8 56.8 2.4 SiW dataset Prot. Method APCER (%) BPCER (%) ACER (%) 1 FAS-BAS 3.58 3.58 3.58 FAS-TD-SF 1.27 0.83 1.05 FAS-TD-SF-CASIA-SURF 1.27 0.33 0.80 2 FAS-BAS 0.57 ± 0.69 0.57 ± 0.69 0.57 ± 0.69 FAS-TD-SF 0.33 ± 0.27 0.29 ± 0.39 0.31 ± 0.28 FAS-TD-SF-CASIA-SURF 0.08 ± 0.17 0.25 ± 0.22 0.17 ± 0.16 3 FAS-BAS 8.31 ± 3.81 8.31 ± 3.80 8.31 ± 3.81 FAS-TD-SF 7.70 ± 3.88 7.76 ± 4.09 7.73 ± 3.99 FAS-TD-SF-CASIA-SURF 6.27 ± 4.36 6.43 ± 4.42 6.35 ± 4.39 CASIA-MFSD dataset Method Training Testing HTER (%) Motion Replay-Attack CASIA-MFSD 47.9 LBP Replay-Attack CASIA-MFSD 57.6 Motion-Mag Replay-Attack CASIA-MFSD 47.0 Spectral cubes Replay-Attack CASIA-MFSD 50.0 CNN Replay-Attack CASIA-MFSD 45.5 FAS-TD-SF SiW CASIA-MFSD 39.4 FAS-TD-SF CASIA-SURF CASIA-MFSD 37.3 Generalization Capability

Upload: others

Post on 25-Jul-2020

2 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: A Dataset and Benchmark for Large-scale Multi-modal Face ... Zhang's... · Acquisition Details • Intel RealSense SR300 camera for RGB, Depth and Infrared (IR) • Face angle: ≤

Shifeng Zhang1*, Xiaobo Wang2*, Ajian Liu3, Chenxu Zhao2, Jun Wan1, Sergio Escalera4, Hailin Shi2, Zezheng Wang5, Stan Z. Li11CBSR, NLPR, CASIA, UCAS 2JD AI Research 3MUST 4CVC, UB 5JD Finance

A Dataset and Benchmark for Large-scale Multi-modal Face Anti-spoofing

ExperimentsCASIA-SURF Dataset Acquisition Details

• Intel RealSense SR300 camera for RGB, Depth and Infrared (IR)

• Face angle: ≤ 30°• Face distance: 0.3~1.0 m

Six Attacks• 2 Prints: flat and curved face• 3 Cuts: eyes/nose/mouth region

Data Preprocessing• Input original image to Dlib tool

for face detection• Apply PRNet to estimate face

binary mask• Get face area by pointwise product

Subset Dividing• Split into train/val/test randomly

Statistics• 1000 subjects, 21000 videos,

492522 images, 3 modalities• Balanced male to female ratio• Reasonable age distribution

Existing face anti-spoofing benchmark datasets have limited number of subjects (≤170) and modalities (≤ 2), hindering the further development.

Contribution Present a large-scale

(1000 subjects) multi-model (3 modalities) face anti-spoofing dataset.

Introduce a new multi-modal fusion method to merge three modalities.

Conduct extensive experiments to verify its significance and generalization capability.

MotivationDataset Year # subjects # videos Camera Modal Attack

Replay-Attack 2012 50 1200 VIS RGB Print, 2 Replay

CASIA-MFSD 2012 50 600 VIS RGB Print, Replay

3DMAD 2013 17 255 VIS/Kinect RGB/Depth 3D Mask

MSU-MFSD 2015 35 440 Phone/Laptop RGB Print, 2 Replay

Replay-Mobile 2016 40 1030 VIS RGB Print, Replay

Msspoof 2016 21 4704 VIS/NIR RGB/IR Print

Oulu-NPU 2017 55 5940 VIS RGB 2 Print, 2 Replay

SiW 2018 165 4620 VIS RGB 2 Print, 4 Replay

Ours 2018 1000 21000 RealSense RGB/Depth/IR Print, Cut

Model AnalysisMethod

TRP (%) APCER(%)

NPCER(%)

ACER(%)@FPR=10-2 @FPR=10-3 @FPR=10-4

Halfway fusion 89.1 33.6 17.8 5.6 3.8 4.7Proposed fusion 96.7 81.8 56.8 3.8 1.0 2.4

Baseline: SEFusion• Performs feature re-

weighting to select more informative channels while suppressing less useful ones for each modal

Effect of Number ModalitiesModal

TRP (%) APCER(%)

NPCER(%)

ACER(%)@FPR=10-2 @FPR=10-3 @FPR=10-4

RGB 49.3 16.6 6.8 8.0 14.5 11.3Depth 88.3 27.2 14.1 5.1 4.8 5.0

IR 65.3 26.5 10.9 15.0 1.2 8.1RGB & Depth 86.1 49.5 10.6 4.3 5.6 5.0

RGB & IR 79.1 50.9 26.1 14.4 1.6 8.0Depth & IR 89.7 71.4 24.3 1.5 8.4 4.9

RGB & Depth & IR 96.7 81.8 56.8 3.8 1.0 2.4

Effect of Number Subjects

# IDTRP (%) ACER

(%)@FPR=10-2 @FPR=10-3 @FPR=10-4

50 88.0 28.5 0 5.1100 89.5 56.9 0 4.3200 91.1 69.3 46.1 3.6300 96.7 81.8 56.8 2.4

SiW dataset

Prot. Method APCER (%) BPCER (%) ACER (%)

1FAS-BAS 3.58 3.58 3.58

FAS-TD-SF 1.27 0.83 1.05FAS-TD-SF-CASIA-SURF 1.27 0.33 0.80

2FAS-BAS 0.57±0.69 0.57±0.69 0.57±0.69

FAS-TD-SF 0.33±0.27 0.29±0.39 0.31±0.28FAS-TD-SF-CASIA-SURF 0.08±0.17 0.25±0.22 0.17±0.16

3FAS-BAS 8.31±3.81 8.31±3.80 8.31±3.81

FAS-TD-SF 7.70±3.88 7.76±4.09 7.73±3.99FAS-TD-SF-CASIA-SURF 6.27±4.36 6.43±4.42 6.35±4.39

CASIA-MFSD dataset

Method Training Testing HTER (%)

Motion Replay-Attack CASIA-MFSD 47.9

LBP Replay-Attack CASIA-MFSD 57.6

Motion-Mag Replay-Attack CASIA-MFSD 47.0

Spectral cubes Replay-Attack CASIA-MFSD 50.0

CNN Replay-Attack CASIA-MFSD 45.5

FAS-TD-SF SiW CASIA-MFSD 39.4

FAS-TD-SF CASIA-SURF CASIA-MFSD 37.3

Generalization Capability

DATACARD SR300 RETRANSFER CARD PRINTERphrs.co.za/images/SR300_data_sheet.pdf · 2014. 6. 17. · DATACARD ® SR300 RETRANSFER CARD PRINTER The Datacard ® SR300 retransfer card printer

DATACARD SR200 & SR300 RETRANSFER CARD PRINTERS · Technology Count on Datacard® SR200 & SR300 Retransfer Printers to produce high-quality, real edge-to-edge printing over a variety

Creating 3D images for facial recognition using the ...essay.utwente.nl/75772/1/Manterola_BA_SCS.pdfCreating 3D images for facial recognition using the RealSense SR300 Nahuel Manterola

Zhang's Portfolio

DATACARD® SR200 & SR300 RETRANSFER CARD PRINTERS...ribbons which obscures data on used retransfer film. • Industry Leading Reliability. Datacard® SR200 & SR300 retransfer printers,

63spidermonkey.weebly.com63spidermonkey.weebly.com/uploads/5/2/1/5/5215820/the_silk_road_-_east... · (Chen), on a crucial mission. Zhang's goal was to make an alliance with some

DATACARD SR200 & SR300 RETRANSFER CARD PRINTERS · black panel text on ribbons which obscures data on used retransfer film. • Industry Leading Reliability. Datacard® SR200 & SR300

Yiwei Zhang's Visual Resume

Detecting Face with Densely Connected Face Proposal Network › users › sfzhang › Shifeng Zhang's Homepage... · 2018-02-19 · detection benchmark datasets with CPU real-time

User Manual Fresh Roast SR500 SR300 - Roastmasters.com · User Manual Fresh Roast SR500 SR300 Coffee Bean Roasters ... Replacement will be made with a new or remanufactured product

MASTER ZHANG'S CALLIGRAPHY SCROLLS

Zhang's analysis

JOURNAL OF IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS … Zhang's Homepage_files/T… · [1]–[5], which aims to detect object instances of the predeﬁned categories. Accurate object

Intel® RealSense™ SDK Release Notesregistrationcenter-download.intel.com/akdlm/irc_nas/...SDK_2016_R2.pdf · Release Notes Intel® RealSenseTM SDK Release F200 Gold R200 Gold SR300

DATACARD SR300 RETRANSFER CARD PRINTER · Superior image quality at an affordable price The Datacard® SR300 retransfer card printer provides full-color, over-the-edge, dual-sided

Driving the big deals Our international Banking & Finance team · Our experiences international Banking & Finance team Client Deal size Summary Saudi-based bank SR300 million Project

Achieving Trafﬁc - Transportation Research Boardonlinepubs.trb.org/onlinepubs/sr/sr300.pdf · viii Achieving Trafﬁc Safety Goals in the United States Nations. The committee also

Jonathan Zhang's portfolio

Forum may 2011 yun zhang's presentation

Australian Hualong Pty Ltd - EPA Tasmania Hualong... · and base metal mining, processing and smelting, mostly through his operations in Yunnan, China. More details about Mr. Zhang's

Dr. Zhang's CV

The Formalism for Networks Today from the Internet to ... · Nagurney and Zhang's (1996) book, Projected Dynamical Systems and Variational Inequalities, is published. Ran and Boyce's

Financial Engineering and Actuarial Science. Zhang's presentation UCSB 2-21... · Financial engineering and actuarial science as a new practice for variable annuities . 4 Traditional

SR300 SR400 - starrett.com

1 Game Theory - Jun Zhang's Website - Homezhangjun.weebly.com/uploads/2/8/1/8/2818435/lecturenotesf.pdf1 Game Theory 1.1 Basic Elements of Noncooperative Games (i) The players deﬁnes

JIABEI ZHANG'S VITAE - Homepages at WMUhomepages.wmich.edu/~zhangj/wordsfiles/Vitae Zhang.pdfVitae 3 40. Zhang, J. (2010).Quantitative analysis of the adapted physical education employment

Although some fashion labels started in-Nionúeal …static2.shopify.com/s/files/1/0025/8232/files/gazette_article.pdf · Karl Hearne and Stacey Zhang's customers ask for locally-made

Intel RealSenseTM Depth Camera SR300 Series …...Document Number: 334531- 002 Intel® RealSenseTM Depth Camera SR300 Series Product Family Datasheet Intel® RealSense Depth Camera

Superior, WI face trim - Lincoln Marketing · Superior, WI face trim face trim face trim face trim face trim face trim face trim face trim face trim face trim face trim face trim

SR200 & SR300 Card Printer User Guide

Face Face

Muhan Zhang's Homepage - Weisfeiler-Lehman Neural ...Weisfeiler-Lehman Neural Machine for Link Prediction Muhan Zhang Department of Computer Science and Engineering Washington University

Intel R RealSense SR300 Coded light depth Camera TM

May 10, 2019 Announcements - Yale Cancer Center · 2020. 2. 22. · impair T cell tumor infiltration. Dr. Zhang's research interests focus on the microenvironment of poorly immune

User’s Guide - Amped Wireless · 2017-11-30 · SR300 USER’S GUIDE 2 INTRODUCTION Thank you for purchasing this Amped Wireless product. At Amped Wireless we strive to provide