asme imece2010 submitted final 2010-6-11

6
1 Copyright © 2010 by ASME “DRAFT” Proceedings of the ASME 2010 International Mechanical Engineering Congress & Exposition IMECE2010 November 12-18, 2010, Vancouver, British Columbia, Canada IMECE2010-39118 MINIA TURE STEREO IMAGING CONVERTER WITH TRANSLATING APERTURE Wook Choi Mechanical and Aerospace Engineering Dept. University of California, Los Angeles (UCLA) Los Angeles, CA, USA [email protected] Vladimir Rubtsov Intelligent Optical systems, Inc. (IOS) Torrence, CA, USA [email protected] Chang-Jin “CJ” Kim Mechanical and Aerospace Engineering Dept. University of California, Los Angeles (UCLA) Los Angeles, CA, USA [email protected] ABSTRACT Depth information from an image can greatly increase the work efficiency when observing or inspecting objects, because the size, distance, and relative locations can be estimated. Various stereo imaging methods are being used to find depth information in a wide range of application fields, typically by  placing multiple optical systems side-by-side to create multiple shifted images. In this study, we develop a miniature stereo image generating device, which can augment an existing single optical system (i.e., a two-dimensional images capturer) with three-dimensional capability. Developed with MEMS technology, the device consists of a single translating aperture, which shifts laterally between two positions (up to 100 µm apart demonstrated) by means of electrostatic comb actuators. Attached at the objective end of conventional (i.e., nonstereo) optical systems, this stereo converter with an aperture 900 µm in diameter is shown to successfully generate slightly different viewing angles, providing stereo images. Being miniaturized, this device is suitable for microscopic or endoscopic applications, where the size of the system is limited or axial depth of focus is relatively large. 1. INTRODUCTION Analyzing two-dimensional (2D) images to extract three- dimensional (3D) information is a complex and time- consuming task mostly relying on limited information such as shading or surface discontinuity [1-2]. The difficulties in finding the 3D information, however, can be greatly reduced if any depth information about an object is provided for the image analysis. To find such depth information, various stereo imaging methods are used these days, clarifying the size, distance, and 3D shapes of objects in a range of application fields. The most common method is using two or more separate imaging systems having certain distances between them [3-5] to provide multiple viewing angles using triangulation methods. However, such multiple imaging systems lead to an increase in size and structural complexity of the overall system. To address this issue, several stereo systems with a single imaging system [6-8] or an off-axis aperture [9-11] have been introduced. We have previously proposed dramatic miniaturization of those stereo systems through use of micro electro-mechanical systems (MEMS) technologies, reporting success with a single glass disc that flips around the neutral angle in the light path [12]. The operating mechanism for the flipping was electrostatic actuation at resonance, consuming low power. The glass parts, including the glass disc, were fabricated by thermal molding into a micromachined silicon mold, followed by lapping and polishing steps. Although the device was successful, we have since been exploring new designs suitable for more conventional MEMS fabrication, which would be more reliable and low-cost. In this study, we report an all-silicon (i.e., no glass parts) device much simpler in structure and fabrication processes. The completed device is attachable to an existing optical system

Upload: annielee7298

Post on 08-Apr-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

8/6/2019 ASME IMECE2010 Submitted Final 2010-6-11

http://slidepdf.com/reader/full/asme-imece2010-submitted-final-2010-6-11 1/6

1 Copyright © 2010 by ASME

“DRAFT” Proceedings of the ASME 2010 International Mechanical Engineering Congress & ExpositionIMECE2010

November 12-18, 2010, Vancouver, British Columbia, Canada

IMECE2010-39118

MINIATURE STEREO IMAGING CONVERTER WITH TRANSLATING APERTURE

Wook ChoiMechanical and Aerospace Engineering Dept.University of California, Los Angeles (UCLA)

Los Angeles, CA, [email protected]

Vladimir RubtsovIntelligent Optical systems, Inc. (IOS)

Torrence, CA, [email protected]

Chang-Jin “CJ” KimMechanical and Aerospace Engineering Dept.University of California, Los Angeles (UCLA)

Los Angeles, CA, [email protected]

ABSTRACT

Depth information from an image can greatly increase the work 

efficiency when observing or inspecting objects, because the

size, distance, and relative locations can be estimated. Various

stereo imaging methods are being used to find depth

information in a wide range of application fields, typically by

 placing multiple optical systems side-by-side to create multiple

shifted images. In this study, we develop a miniature stereo

image generating device, which can augment an existing single

optical system (i.e., a two-dimensional images capturer) with

three-dimensional capability. Developed with MEMS

technology, the device consists of a single translating aperture,

which shifts laterally between two positions (up to 100 µm

apart demonstrated) by means of electrostatic comb actuators.

Attached at the objective end of conventional (i.e., nonstereo)

optical systems, this stereo converter with an aperture 900 µm

in diameter is shown to successfully generate slightly different

viewing angles, providing stereo images. Being miniaturized,

this device is suitable for microscopic or endoscopic

applications, where the size of the system is limited or axial

depth of focus is relatively large.

1. INTRODUCTION

Analyzing two-dimensional (2D) images to extract three-

dimensional (3D) information is a complex and time-

consuming task mostly relying on limited information such as

shading or surface discontinuity [1-2]. The difficulties in

finding the 3D information, however, can be greatly reduced if

any depth information about an object is provided for the image

analysis. To find such depth information, various stereo

imaging methods are used these days, clarifying the size

distance, and 3D shapes of objects in a range of application

fields. The most common method is using two or more separate

imaging systems having certain distances between them [3-5]

to provide multiple viewing angles using triangulation

methods. However, such multiple imaging systems lead to an

increase in size and structural complexity of the overall system

To address this issue, several stereo systems with a single

imaging system [6-8] or an off-axis aperture [9-11] have been

introduced.

We have previously proposed dramatic miniaturization of

those stereo systems through use of micro electro-mechanica

systems (MEMS) technologies, reporting success with a single

glass disc that flips around the neutral angle in the light path

[12]. The operating mechanism for the flipping was

electrostatic actuation at resonance, consuming low power. The

glass parts, including the glass disc, were fabricated by therma

molding into a micromachined silicon mold, followed by

lapping and polishing steps. Although the device was

successful, we have since been exploring new designs suitable

for more conventional MEMS fabrication, which would be

more reliable and low-cost.

In this study, we report an all-silicon (i.e., no glass parts)

device much simpler in structure and fabrication processes. The

completed device is attachable to an existing optical system

8/6/2019 ASME IMECE2010 Submitted Final 2010-6-11

http://slidepdf.com/reader/full/asme-imece2010-submitted-final-2010-6-11 2/6

2 Copyright © 2010 by ASME

(e.g., a camera or a microscope) to add stereo imaging

capability if a correctly sized aperture is used.

2. IMAGING PRINCIPLE

Several off-axis aperture stereo imaging methods have

 been developed as alternatives to multiple-lens stereo imagingmethods to avoid the large size and complexity of the usual

multiple-lens systems. Such off-axis aperture methods include

the use of pinholes (i.e., no lens) located at a certain distance

from the sensor which is widely used in the pinhole stereo

 photography, and the use of multiple apertures off-axis in front

of an imaging system with lenses [10]. However, the pinhole

method generally requires a significant exposure time or strong

illumination, which is not suitable for general imaging

applications, while the multiple-apertures method requires

several imaging filters and post imaging processes to extract

separate images coming from each aperture. To avoid those

limitations and complications, a stereo imaging method with a

single translatable aperture placed in front of an existingimaging system [11] is used in this study.

Fig. 1 shows three objects – a square, a triangle, and a

circle – in front of the objective lens of the imaging system,

with the triangle at the focusing distance of the lens. Because of 

their different physical distances from the lens, their imaging

locations behind the lens (L1, L2, and L3, respectively) are also

different. If all those three objects are lined up along the optical

axis, it is hard to distinguish their relative locations (Fig. 1(a))

simply by observing the images on the sensor. However, when

a screen with an aperture is placed in front of the lens and

translates up and down, the images of the square and the circle

shift down and up, respectively, while the image of the triangle

remains still. That is, objects closer or farther than the lens

focusing distance have their images on the sensor shifted in thesame or the opposite direction, respectively, as the aperture

translation, providing views from different angles as shown in

Fig. 1 (b) and (c).

If a thin lens is assumed as in Fig. 1 and the aperture is very

close to the lens, the image shift at the sensor can be estimated

 by the following equation [9];

 D D F v

h

1111+⎟

 ⎠

 ⎞⎜⎝ 

⎛ −=  

where

Fig. 2 shows the actual image shift test using a translating

aperture. There are two pens placed at different distances; the

 ballpoint pen (without a cap) in the front is closer to the camera

than the marker pen (with a cap) in the back. A screen with an

aperture 1 mm in diameter is placed in front of the camera. The

camera is kept focused on the marker pen in the back during

this test. As the aperture is moved up and down, only the image

of the pen in the front which is not in focus shifts while the

image of the pen in the back on which the camera focuses doesnot move, providing depth information to the observer tha

cannot be easily obtained using a regular 2D camera. This

experiment confirms that when an aperture of adequate size is

used in front of the imaging system, only the image of objects

not in focus shifts as the aperture translates, which can be used

to find the relative locations of each object.

(a)

(b)

(c)

Figure 1. Image shift by aperture translation. (a) Objects at

different locations have different imaging distances behind

the lens. When an aperture is used and translates upward (b)

or downward (c), each object has its image on the sensor 

shifted according to its relative location.

Figure 2. Captured image shift by a camera. A screen with

an aperture 1 mm in diameter is placed in front of the

camera. The camera focuses on the capped marker pen in

the back. As the aperture translates up and down, only the

image of the opened ballpoint pen in the front shifts while

the image of the pen in the back remains still.

d : distance to an object in front of the lens

v: aperture translating distanceh: image shift at the image sensor 

 D: distance to a plane conjugate to sensor plane

 F : focal length of the lens

8/6/2019 ASME IMECE2010 Submitted Final 2010-6-11

http://slidepdf.com/reader/full/asme-imece2010-submitted-final-2010-6-11 3/6

3 Copyright © 2010 by ASME

3. MINIATURIZED DEVICE DESIGN

Fig. 3 illustrates the implementation of the stereo converter 

introduced in the previous section in the form of a miniaturized

stereo converting device in conjunction with a lens system. A

silicon screen with an aperture patterned at the center is located

right in front of an optical system. With the translation motionof this screen, images of objects viewed through the aperture

shift to generate stereo images from what would otherwise be a

  planar 2D image, giving the attached optical system 3D

imaging capabilities. Required aperture size depends on (1) the

imaging applications because the aperture size determines the

images’ depth of focus, (2) the light sensitivity of the imaging

system, and (3) the illumination setup used for the imaging test

  because the brightness of the image through the aperture is

 proportional to the square of the aperture size.

Fig. 4 shows a simplified view of the stereo converter’s

design. The entire converter is made of silicon, and the central

screen with the aperture at the center is suspended by folded

spring structures which guide the screen’s translating motion.

The translation is generated by two sets of electrostatic comb

drive actuators patterned on the screen. By alternately applying

an electric potential to the comb drives, the aperture’s

translation motion can be achieved. Considering the minimal

overall device size for future endoscopic applications, the

folded spring fixtures are designed to have 1 mm of length.

The spring constant of the folded spring fixtures in Y-

direction is given by

3

32

 L

wt  E  K Y 

⋅⋅⋅=  

With the given folded spring length, the translating

distance is determined by the number of the comb fingers at the

actuators and the width of the folded springs, and can be

calculated by

2

3

3

4

w E  g 

 Ln DY 

⋅⋅⋅

⋅⋅=

ε 

 

where

4. DEVICE FABRICATION

The miniaturized 3D image converter is fabricated by

using a series of dry etching methods on a silicon-on-insulator

(SOI) wafer used as a starting material. The fabrication

sequence in detail is shown in Fig. 5. The SOI wafer used for

the device in this study has 50 µm of device silicon, 0.5 µm o

 buried oxide (BOX), and 500 µm of handle silicon layer (Fig

5(a)). To avoid additional processes for metal pads for electric

contact, a highly doped device layer is used. As the first step

Figure 4. Illustration of the stereo-generating silicon device.

A screen with an aperture in the center translates (up anddown in the figure) by the opposing comb drive actuators.

The translation is guided by the attached folded spring

structures.

L

Aperture

Comb driveactuator 

Comb driveactuator 

Y

X

Translating aperture

device

Objective lens

Figure 3. A proposed stereo imaging system. A silicon

microdevice with a translating aperture screen is located

right in front of the optical system to generate image shifts.

ɛ: permittivity constant of air 

n: number of gaps between the comb fingers

 L: length of the folded spring

G: gap between comb fingers

 E : Young’s modulus of the silicon

w: width of the folded springs

V : applied voltage

8/6/2019 ASME IMECE2010 Submitted Final 2010-6-11

http://slidepdf.com/reader/full/asme-imece2010-submitted-final-2010-6-11 4/6

4 Copyright © 2010 by ASME

silicon dioxide layers are deposited on top and bottom of the

wafer by plasma enhanced chemical vapor deposition

(PECVD) method, followed by patterning those layers using

reactive ion etching (RIE) (Fig. 5(b)). After that, the silicon

handle layer is etched deeply (Fig. 5(c)) by using a deep

reactive ion etching (DRIE) or using an anisotropic wet etching

method (i.e., KOH etching). This step is to free the silicondevice layer on the top so that the patterned structure can move

freely, while the remaining thick structures in the handle layer 

will be used to provide mechanical strength to certain areas on

the thin device layer at the points of electric contact. The device

layer with the aperture screen and the comb drive actuator is

then patterned by DRIE, followed by silicon dioxide layer 

removal by RIE (Fig. 5(d)), finalizing the device fabrication.

The fabricated device is not only much simpler in

fabrication process with only two silicon etching steps, but also

much more robust compared to the flipping device with a thick 

glass plate [12] because no heavy structures need to be

suspended by the folded springs, as shown in the final cross

section view (Fig. 5(d)).

5. DEVICE OPERATION

Fig. 6 (a) shows the fabricated silicon stereo converting

device. It is 5 mm in overall diameter and has an aperture 900

µm in diameter. The design with 150 moving comb fingers and

1 mm-long folded spring fixtures, among several variations

has been used for device tests in this study. The gap between

the opposing fingers is 3 µm. The device layer has been

  patterned to form both signal and ground electrodes. For the

secure electrical connections, copper tapes were used to deliver

the driving and ground electric signals directly to each

electrode with silver paint brushed around the contact point between the tapes and the device electrodes.

Fig. 6 (b) and (c) are microscopic views of the white

dotted circle area in Fig. 6 (a), with ruler bars on the left with

10 µm increments. Fig. 6 (b) shows the comb drive actuator

with top stationary and bottom moving comb fingers when no

voltage is applied to the actuator, which is the initial finger

  position as fabricated. Fig. 6 (c) shows the comb driv

actuation when 44 V is applied to the actuator, translating 60

µm of distance when 4 µm-wide folded spring is used.

Using the same device, up to 62 µm of screen translation

could be achieved at 45 V of applied voltage, as shown in Fig

7. When 5 µm-wide springs were used, up to 60 µm of finger

translation could be achieved at 58 V before some fingers

laterally touched each other. Even though 4 µm- and 5 µm-

wide folded spring fixtures were used, their translating motion

 behaved like the ones with the slightly thinner spring widths

This was because of the tapering down of the 50 µm-thick

folded spring structures during the DRIE process, resulting in

the narrower spring bottom width than the top.

Figure 5. Process flow for the silicon stereo converter 

device

Silicon

Oxide

Silicon

(b)

(c)

(a)

(d)

SOI as starting material

Oxide deposition and patterning

(Top and bottom surfaces)

Backside anisotropic etching

Frontside etching and oxide removal

Aperture Comb drive

Figure 6. (a) Photo of the fabricated silicon device 5 mm in

overall diameter. Microscopic views of the white dotted

circle area are shown in (b) and (c) during the comb drive

operation. (b) Initial comb finger location when no voltage

is applied, and (c) 60 µm of aperture translation when 44 V

is applied.

Moving fingers

Stationary fingers(b)  (c) 

   5  m  m 

(a) 

+

+

--

8/6/2019 ASME IMECE2010 Submitted Final 2010-6-11

http://slidepdf.com/reader/full/asme-imece2010-submitted-final-2010-6-11 5/6

5 Copyright © 2010 by ASME

6. VIEWING ANGLE CHANGES

A stereo system on a regular point-and-shoot camera (NV3,

Samsung Electronics) was built by attaching the fabricated

silicon device right in front of the camera lens. Additional

 black screen material is attached in front of the silicon device to

 block unnecessary light coming through the spring and comb

drive areas. Fig. 8 (a) shows a conventional 2D photo of a

LEGO® figure (without using the stereo converter) 4.5 cm in

height placed approximately 10 cm from the camera. The background is a hallway with doors at the end. Fig. 8 (b) and

(c) are photos taken with the camera optically zoomed to the

dotted square area shown in Fig. 8 (a) with the fabricated stereo

converter attached in the front. Keeping the camera focusing on

the doors during the experiment, the aperture screen translated

horizontally 50 µm each direction. As the aperture translated to

the left and right, the image of the LEGO® figure, which was

closer and not in focus, shifted to the right and left,

respectively, while the image of door at the end of the hallway

did not move.

The same silicon stereo converter was used for a

microscopic stereo imaging test as in Fig. 9, which shows the

images of a ruler covering approximately 3 cm captured at thetwo extrema aperture locations. As the aperture makes ±50 µm

horizontal translation, the distance between each marking on

the ruler expanded and contracted due to the slight change in

viewing angle. The microscope focused on the marking in the

middle (at 6.5 cm location), such that the image of that point

remains unmoved. As in these experiments, relative locations

or views from different angles of objects can be found by

observing the directions and the amount of each object’s image

shift.

CONCLUSIONS

Sets of stereo images can be used as a powerful tool to find

3D information when observing or inspecting an object by

  providing depth information. This paper presented

microfabricated stereo imaging generator using a relatively

simple stereo imaging method with a single translatable

aperture. This miniaturized stereo converter was shown to

Figure 9. Recordedmicroscopic images of a

ruler viewed through the

translating aperture device.

As the aperture translates

±50 µm horizontally, the

width of each division on

the ruler changes (left) due

to the viewing angle

change, which is equivalent

to looking at the ruler from

two different angles

(above).

Figure 8. (a) 2D photo showing a LEGO® figure in the front

and a door in the back at the end of the hallway without thestereo converter attached. When the converter is attached

and the aperture translates left and right, the image of the

figure shifts (b) to the right and (c) left, respectively. For (b)

and (c), the camera focuses on the door in the back whose

image remains still.

(a) (b)

(c)

   4 .   5  c  m

3 µm-wide springcalculated

4 µm-wide springcalculated

4 µm-widespring tested

5 µm-wide springtested

5 µm-wide springcalculated

Applied voltage (V)

   A  p  e  r   t  u  r  e   t  r  a  n  s   l  a   t   i  o  n   (  µ  m   )

Figure 7. Aperture translation by comb drive actuators. 4µm and 5 µm-wide folded springs were used for the testing.

8/6/2019 ASME IMECE2010 Submitted Final 2010-6-11

http://slidepdf.com/reader/full/asme-imece2010-submitted-final-2010-6-11 6/6

6 Copyright © 2010 by ASME

successfully generate image shift when attached and operated

in front of imaging systems, even with the limited translating

distances microfabricated silicon actuators can provide. The

  presented technique would be especially beneficial for 

microscopic object inspection or endoscopic applications where

extraction of depth information from 2D images is more

challenging due to the relatively high axial depth of focuswithin the working distance of the device.

ACKNOWLEDGEMENTSThe project has been supported by the Small Business

Innovation Research (SBIR) grants from National Institutes of 

Health (NIH). Authors want to thank Mr. James Jenkins for 

valuable discussions and the input regarding the assembly of 

the paper.

REFERENCES

[1] Zhang, R., Tsai, P., Cryer, J. E., and Shah, M., 1999, “Shape

from shading: A survey”, IEEE Transactions on Pattern

Analysis and Machine Intelligence, vol. 21, pp. 690-706.

[2] Saxena, A., Sun, M., and Ng, A. Y., 2007, “Learning 3-d

scene structure from a single still image”, in Proceedings of 

IEEE International Conference on Computer Vision, pp. 1-8,

Rio de Janeiro, Brazil.

[3] Durrani, A. F. and Preminger, G. M., 1995, “Three-

dimensional video imaging for endoscopic surgery”,

Computers in Biology and Medicine, vol. 25, pp. 237-247.

[4] Schreier, H. W., Garcia, D., and Sutton, M. A., 2004,

“Advances in light microscope stereo vision”, Experimental

Mechanics, vol. 44, pp. 278-288.

[5] Okutomi, M. and Kanade, T., 1993, “A multiple-baseline

stereo”, IEEE transactions on Pattern Analysis and Machine

Intelligence, vol. 15, pp. 353-363.

[6] Goshtasby, A. and Gruver, W. A., 1993, “Design of a single-

lens stereo camera system", Pattern Recognition, vol. 26, pp.

923-937.

[7] Gao, C. and Ahuja, N., 2004, “Single camera stereo using

  planar parallel plate”, in Proceedings of the International

Conference on Pattern Recognition, vol. 4, pp. 108-111,

Cambridge, UK.

[8] Lee, D. H. and Kweon, I. S., 2000, “A novel stereo camera

system by a biprism”, IEEE Transactions on Robotics and

Automation, vol. 16, pp. 528-541.

[9] Adelson, E. H. and Wang, J. Y. A., 1992, “Single lens stereo

with a plenoptic camera”, IEEE Transactions on Pattern

Analysis and Machine Intelligence, vol. 14, pp. 99-106.

[10] Seo, S., 2005, “Stereo-image capturing device”, US Paten

 No. US 6,977,674 B2.

[11] Dou, Q. and Favaro, P., 2008, “Off-axis aperture camera

3d shape reconstruction and image restoration”, in Proceedings

of IEEE Conference on Computer Vision and Pattern

Recognition, pp. 1-7, Anchorage, USA.

[12] Choi, W., Akbarian, M., Rubtsov, V., and Kim, C.-J., 2009

“Microfabricated flipping glass disc for stereo imaging in

endoscopic visual inspection”, in Proceedings of IEEE

International Conference on Micro Electro Mechanica

Systems, pp. 160-163, Sorrento, Italy.