fpga based system prototype for face location and ...jmram/cvjmr/fpga based conielecomp2010.pdf ·...

5
FPGA Based System Prototype for Face Location and Recording of People Getting on a Public Transport Bus. Jorge Martínez-Carballido, Carlos García-Lucero, Juan Manuel Ramírez-Cortés, Rogerio Enríquez- Caldera Electronics Department, National Institute of Astrophysics, Optics and Electronics {jmc, cgl, jmram,rogerio}@inaoep.mx Abstract Assault and robbery on the public transport is a social problem. Almost every day, people lose possessions and, in some cases, thieves or citizens get hurt or even die, in the process. In Mexico, attempts to reduce or prevent crime include vigilance by armed agents [1 ] and video cameras on public transport [2 ]. This work contributes to crime reduction and/or prevention on public transport by taking photographic records of people as they get on the bus, using an FPGA based system with camera and Secure Digital (SD) storage. An FPGA is used as the processing device which contains all the necessary logic to obtain the images from the camera, process them to locate the face, extract face region and store them into a SD media using an uncompressed BMP file. For face area location single channel image was used to extract regional attributes and their inter- regional relationship, resulting in a face detection algorithm with tolerance to skin tones and light conditions. 1. Introduction The use of technology as a tool for preventing and solving crime is a natural response to the decreasing security. Also, faster and smaller integrated circuits (ICs) which can be used to process ever increasing image resolution and higher processing capability of information are now available at lower cost. Increasing demand of high resolution cameras of cellular phones, personal devices and laptops, lead to economies of scale and decrease of prices. Technology to prevent and decrease assault and robbery on public transport, and public places, is being used around the world. For instance, United Kingdom government has invested billions of pounds on video cameras to be used to monitor the security by police [3 ]. The initiative included: Images database to track and identify offenders. Publish images of robbery cases on internet. National CCTV recordings and pictures database of convicted offenders as well as unidentified suspects. Although a good strategy to reduce crime was planed, only 3% of assaults were solved by the use of CCTV technology [3 ]. Problem consists on people monitoring the cameras because they had little training, and the task of watching video for hours is not effective. Also when something occurs, search for the exact moment of occurrence is a hard work. After training and new strategies, CCTV technology was helpful in solving 15-20% of street robberies [3 ]. In Mexico, the use of technology to prevent crime is very limited and little information can be found. In June, 2009, the State of Puebla Congress’ security commission approved the use of camera installation on public transport [2 ]. This preventing technology is on its experimental phase, and Congress expressed its concern about the high cost of this technology for transport owners. The article didn’t mention about the usage recordings and monitoring tasks that will be carried out. 2. Problem Description Having real time video from transport to observe the events as they take place seems like a good strategy, because police could respond immediately if an assault is happening. But when talking about hundreds of transport units that could have the same technology, meaning that hundreds of videos are 978-1-4244-5353-5/10/$26.00 ©2010 IEEE 164

Upload: lekien

Post on 21-Apr-2018

218 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: FPGA Based System Prototype for Face Location and ...jmram/cvjmr/FPGA based conielecomp2010.pdf · FPGA Based System Prototype for Face Location and Recording of ... VHDL and synthesized

FPGA Based System Prototype for Face Location and Recording of People Getting on a Public Transport Bus.

Jorge Martínez-Carballido, Carlos García-Lucero, Juan Manuel Ramírez-Cortés, Rogerio Enríquez-Caldera

Electronics Department, National Institute of Astrophysics, Optics and Electronics {jmc, cgl, jmram,rogerio}@inaoep.mx

Abstract

Assault and robbery on the public transport is a social problem. Almost every day, people lose possessions and, in some cases, thieves or citizens get hurt or even die, in the process. In Mexico, attempts to reduce or prevent crime include vigilance by armed agents [1] and video cameras on public transport [2].

This work contributes to crime reduction and/or prevention on public transport by taking photographic records of people as they get on the bus, using an FPGA based system with camera and Secure Digital (SD) storage.

An FPGA is used as the processing device which contains all the necessary logic to obtain the images from the camera, process them to locate the face, extract face region and store them into a SD media using an uncompressed BMP file.

For face area location single channel image was used to extract regional attributes and their inter-regional relationship, resulting in a face detection algorithm with tolerance to skin tones and light conditions.

1. Introduction

The use of technology as a tool for preventing and solving crime is a natural response to the decreasing security. Also, faster and smaller integrated circuits (ICs) which can be used to process ever increasing image resolution and higher processing capability of information are now available at lower cost. Increasing demand of high resolution cameras of cellular phones, personal devices and laptops, lead to economies of scale and decrease of prices.

Technology to prevent and decrease assault and robbery on public transport, and public places, is

being used around the world. For instance, United Kingdom government has invested billions of pounds on video cameras to be used to monitor the security by police [3]. The initiative included:

Images database to track and identify offenders. Publish images of robbery cases on internet. National CCTV recordings and pictures database

of convicted offenders as well as unidentified suspects.

Although a good strategy to reduce crime was planed, only 3% of assaults were solved by the use of CCTV technology [3]. Problem consists on people monitoring the cameras because they had little training, and the task of watching video for hours is not effective. Also when something occurs, search for the exact moment of occurrence is a hard work.

After training and new strategies, CCTV technology was helpful in solving 15-20% of street robberies [3].

In Mexico, the use of technology to prevent crime is very limited and little information can be found. In June, 2009, the State of Puebla Congress’ security commission approved the use of camera installation on public transport [2]. This preventing technology is on its experimental phase, and Congress expressed its concern about the high cost of this technology for transport owners. The article didn’t mention about the usage recordings and monitoring tasks that will be carried out.

2. Problem Description

Having real time video from transport to observe

the events as they take place seems like a good strategy, because police could respond immediately if an assault is happening. But when talking about hundreds of transport units that could have the same technology, meaning that hundreds of videos are

978-1-4244-5353-5/10/$26.00 ©2010 IEEE 164

Page 2: FPGA Based System Prototype for Face Location and ...jmram/cvjmr/FPGA based conielecomp2010.pdf · FPGA Based System Prototype for Face Location and Recording of ... VHDL and synthesized

being received for their inspection monitoring task begins to be difficutheir option is to record the video ostoring media for later review/inspcourse means police could not assault is taking place.

As other countries had expermonitoring videos must be trained assigning a police to monitor videosmixed results, but purposely traineguarantee to have a stable performan

This work approach is to use digimedia to obtain still images (framfaces to store these into a storagehuman intervention during the proassault takes place, the face image already been stored on the medprevents the thief from erasing destroying the camera, for its lateAlso a massive publishing of thiprevent the thief from assaulting agapolice a good starting point to locthis delinquent. By no usingintervention during the process, theits operating cost due to the use ofand any kind of monitoring trainineeded.

3. Solution approach

The system has the follow

modules: camera [4], PSRAM [5], S[8], and people detection to the FPRAM, camera bus serial controllerand image processing modules are VHDL and synthesized on a Xil4FT256 FPGA as shown on Figure

Figure 1. System’s modules s

4. Detection

By using a 3 axis accelerometeon the steps of the bus entrance as s2, we can obtain accelerometer sign

in real time, the ult, then perhaps on some kind of

pection which of respond as the

rienced, people for this task, so

s generally gives ed police agents nce. ital cameras as a

mes) from people e media without ocess. When an of the thief has

dia, which also its picture by

er identification. is picture could ain, and give the cate and process g any human e system reduces f less equipment ng is no longer

wing peripheral SD [6] [7], RTC PGA. A Block r, I2C controller implemented in inx XC3S1000-1.

structure

er [9] positioned shown on Figure

nals to detect that

people has stepped on the bus astopped.

Figure 2. Accelerometer position o

Figure 3 shows the area of interedetect from the three axis signals thstepped on to the bus after it hasstopped.

Figure 3. Accelerometer

5. Face location module

First we consider that from the the 1.3 Megapixel camera, only a sof view is where the face of peopbus is possible to appear. Also it isa given bus, the area is different asthe person is on the first or secentrance. Tests with people of were made so that the First Step RSecond Step Region (SSR) could bwith a measuring stick. Figure 4 stests.

after the bus has

on bus entrance

est where one can hat somebody has s decelerated and

signals

e

field of view for sector of this field ple getting on the s possible that for s well as whether cond step of the different heights

Region (FSR) and be defined, along shows one of the

978-1-4244-5353-5/10/$26.00 ©2010 IEEE 165

Page 3: FPGA Based System Prototype for Face Location and ...jmram/cvjmr/FPGA based conielecomp2010.pdf · FPGA Based System Prototype for Face Location and Recording of ... VHDL and synthesized

Figure 4. FSR and SSR test

Figure 4 shows First and Seconwith a rectangle of n by m pixels asapplication. Next to locate a face winterest we took the approach of properties of the sub-image of interof its possible dimensions andinterrelationship to form the sub-ima person dresses with clothes of simskin color on the selected planepotential of finding regions represWith this in mind a filter was used detection.

A binary matrix was used to regions with properties that make tto be a face region.

Figure 5. Binary matrix of local

In the binary matrix a 0 repreregion is not a candidate, and a 1local region is a candidate. Figuimage and its corresponding binary m

images

nd Step Region s obtained for an

within a region of f selecting local rest in a fraction d define their

mage. Now when milar response to e, we have the senting clothing. to improve face

represent local them candidates

properties

esents this local 1 represents this ure 5 shows an matrix.

Figure 6. Binary matrix XOR

After de-noising the binary filtering is needed to exclude achieved by considering physical properties of a face, as comparsection of a body. A sample is of tshown on Figure 7.

Figure 7. Isolated face r

6. FPGA implementation

The tool used to synthesize, plaimplement the VHDL code into thewas the free software from Xilinx11.2. Figure 8 shows the RTL schVHDL TOP module. This figure shof SD, PSRAM, Camera, XOCoordinates modules generatedWebPACK.

Figure 8. TOP module’s RTL

R filtered

matrix, another clothing, this is and dimensional

red to the trunk this application is

region

ce and route, and e Spartan3 FPGA : ISE WebPACK hematic from the hows the location

OR filtering and d by the ISE

Schematic

978-1-4244-5353-5/10/$26.00 ©2010 IEEE 166

Page 4: FPGA Based System Prototype for Face Location and ...jmram/cvjmr/FPGA based conielecomp2010.pdf · FPGA Based System Prototype for Face Location and Recording of ... VHDL and synthesized

7. Results and Tests

Several tests for good locationprototype within the bus where shows the location used for the follo

Figure 9. Prototype location wit

Tests were taken for First and the entrance ladder on the bus. Imagare gray scale, but originals are informat.

Figure 10 show images stored osystem’s prototype, once taken anlocated face on first and seconconfigured on the FPGA.

Figure 10 gray scale images for

8. Conclusions

As final result we have a prototFPGA with a 1.3 pixel camera mmodule that is capable of taking imwhen they get on the bus on first aof the entrance. From the whole 1.3system on the FPGA locates the only the face region found on its coror second step region, this helps tothat have gotten on the bus, saves scard and in the event of assault orthese images could be useful to locapersons involved, thus helping to

n of the system tried, Figure 9

owing tests.

thin the bus

Second steps of ges on this paper n RGB555 BMP

on the SD of the nd extracted the nd step regions

FSR and SSR

type based on a module, SD card mages of people and second steps 3 megapixel, the face and stores rresponding first

o identify people space on the SD r some incident, ate the person or diminish events

by helping authorities to identilegally any person breaking the law

Use of local properties on a singallowed a successful location of tests and its FPGA system implemthan algorithms on literature thatpoint and/or trigonometric operatio[13].

9. References

[1] Saúl Vela. Milenio.com. Marc[Online]. http://www.milenio.com/node[Accessed: September 03, 200

[2] Gerardo Orta. periodicodigita2009. [Online]. http://www.periodicodigital.co?option=com_content&task=v&Itemid=67. [Accessed: Septe

[3] Owen Bowcott. guardina.co.u[Accessed: December 12, 200

[4] Omnivision. (2005, August) OColor CMOS SXGA (1.3 MegConcept Camera Module withTechnology.

[5] Micron Technology, Inc. (200Async/Page/Burst CellularRA

[6] SANDISK CORPORATION. Disk Secure Digital (SD) CardManual, Rev 1.9©.

[7] SD Group. (2000, February) SSpecifications. Part 2 FILE SYSPECIFICATION.

[8] Philips Semiconductors. (1997Clock/calendar with 240 x 8-b

[9] Pavel Lajšner and Radomír KoUsing the MMA7360L ZSTA

[10] Shanq-Jang Ruan, Mon-Chau Yu-Ting Pai, "A SIMPLE ANCOLOR FACE DETECTIONIN COMPLEX BACKGROUNInternational Conference on MExpo, vol. 1-5, Taipei, Taiwan

ify and process w on a bus.

gle channel image faces on all the

mentation simpler t require floating ons [10] [11] [12]

ch 2009.

e/189736. 09]

l.com.mx. June

om.mx/index.phpview&id=92893ember 03, 2009]

uk. May 2008. 08]

OV9650FSL gaPixel). h OmniPixel®

04) AM™ 1.5.

(2003) Sand d Product

SD Memory Card YSTEM

7, Jul) PCF8583 bit.

ozub. (2007) R2 Demo Board.

Shie, Yi-Chi Liu ND ACCURATE N ALGORITHM

ND," in Multimedia and n, 2006, pp.

978-1-4244-5353-5/10/$26.00 ©2010 IEEE 167

Page 5: FPGA Based System Prototype for Face Location and ...jmram/cvjmr/FPGA based conielecomp2010.pdf · FPGA Based System Prototype for Face Location and Recording of ... VHDL and synthesized

1545-1548.

[11] Zhe-Ming Lu, Xiu-Na Xu Wei-Min Zheng, "A Novel Skin Clustering Method for Face Detection," in First International Conference on Innovative Computing, Information and Control, vol. 3, China, 2006, pp. 166-169.

[12] Chang-Tsun Li Wen-Hsiang Lai, "Skin Colour-based Face Detection in Colour Images," in International Conference on Advanced Video and Signal Based Surveillance, UK, 2006, pp. paper56:1-6.

[13] K.D. Adaos, G.P. Alexiou V. Mariatos, "Design and Implementation of a Reconfigurable, Embedded Real-Time Face Detection System," in INTERNATIONAL WORKSHOP ON RAPID SYSTEM PROTOTYPING, Greece, 2007, pp. 65-68.

978-1-4244-5353-5/10/$26.00 ©2010 IEEE 168