cs 523 (cs 423/ee 533) computer vision lecture 1 introduction to computer vision

CS 523 (CS 423/EE 533)

Computer Vision

Lecture 1INTRODUCTION TO COMPUTER VISION

About the Course

http://vvgl.ozyegin.edu.tr

Objective

Introduction to the theory, tools, and algorithms of 3D computer vision

Instructor

Assist. Prof. M. Furkan Kıraç

E-mail: furkan.kirac@ozyegin.edu.tr

Room: 219

Wednesdays, 10:40-13:30, Room: 241

Grading

Projects: 6x10%

Final Exam: 40%

Syllabus

Short Projects:Late submissions are not accepted. Copying answers from others’ work is not permitted.

Final Exam:At least 3 of the 6 Short Projects must be turned in by the due date in order to qualify for the Final Exam. No make-up will be given for the Final Exam. Students can take the Bütünleme exam if they miss the Final Exam.

Grading

Computer Vision: Algorithms and Applications, Richard Szeliski, Springer, 2010.

Computer Vision: A Modern Approach, David A. Forsyth and Jean Ponce, Prentice-Hall, 2002.

Introductory Techniques for 3D Computer

Vision, Emanuele Trucco and Alessandro Verri, Prentice-Hall 1998.

Recommended Books

Learning OpenCV, Gary Bradski and Adrian Kaehler, O'Reilly, 2008.

OpenCV 2 Computer Vision Application Programming Cookbook, Robert Laganiere, Packt Publishing, 2011.

Mastering OpenCV with Practical Computer Vision Projects, Daniel Lelis Baggio, et al., Packt Publishing, 2012.

OpenCV Resources

Week Lectures

24 September 2014 Lecture 1

1 October 2014 Lecture 2

5 November 2014 Lecture 7

3 December 2014 Lecture 11

31 December 2014 Lecture 15 ?

Applications of Computer Vision

Image Stitching

Image Matching

Object Recognition

3D Reconstruction

Interior Modeling

3D Augmented Reality

3D Camera Tracking

Stereo Conversion for 3DTV

Depth Estimation and View Interpolation for 3DTV

Human Tracking

License Plate Recognition

Human Pose Estimation

Course Outline

3D geometry fundamentals Transformations and projections Camera calibration Feature detection and matching Image stitching Single view geometry Two view geometry Multiple view geometry Stereo vision and depth estimation 3D structure from motion 3D camera tracking

Topics to be covered

Relation to Other Fields

Computer Vision

Figure from "Computer Vision: Algorithms and Applications,” Richard Szeliski, Springer, 2010.

Lights and materials Shading Texture mapping Environment effects Animation 3D scene modeling 3D character modeling (OpenGL)

Computer Graphics

Resampling Enhancement Noise filtering Restoration Reconstruction Segmentation Image compression (MATLAB and OpenCV)

Image Processing Topics

Image Processing

Spatio-temporal sampling Motion estimation Frame-rate conversion Multi-frame noise filtering Multi-frame restoration Super-resolution Video compression (MATLAB & OpenCV)

Video Processing Topics

Video acquisition-display chain

Capture Representation Coding

Transmission Decoding Rendering

Human vs. Computer

Optical illusions

Actual vs. Perceived Intensity (Mach band effect)

Brightness Adaptation of the Eye

Optical illusions

Why is Computer Vision Difficult?

Human perception

Human Visual System

Human Eye

Photoreceptors: Rods & Cones

Rods vs. Cones

RodsPerceive brightness onlyNight vision

ConesPerceive colorDay visionRed, green, and blue cones

Cone Distribution

Blue is less-focused

Visual Threshold drop during Dark Adaptation

Spatial Resolution of the Human Eye Photopic (bright-light) vision:

Approximately 7 million cones Concentrated around fovea

Scotopic (dim-light) vision Approximately 75-150 million rods Distributed over retina

(HDTV: 1920x1080 = 2 million pixels)

Frequency Responses of Cones

Same amount of energy produces different sensations of brightness at different wavelengths

Green wavelength contributes most to the perceived brightness.

Trichromatic Color Mixing

Any color can be obtained by mixing three primary colors Red, Green, Blue (RGB) with the right proportion

valuessTristimulu :,3,2,1

kk TCTC

Image Formation

Human Eye vs. Camera

Camera components Eye components

Lens Lens, cornea

Shutter Iris, pupil

Film Retina

Cable to transfer images Optic nerve to send the incident light information to the brain

Human Vision

Image formation

Pin-Hole Camera Model

Point Spread Effect

Out-of-Focus Blur

Shrinking the Aperture

Converging Lens

Correction with a Converging Lens

Perfectly In-Focus for a Certain Distance Only

“circle of confusion”

Depth-of-Field

“Sharp Image” within Depth-of-Field due to Finite Sensor Size

Focal Length (F) and Depth (Z)

Aperture Size Affects Depth-Of-Field

f / 5.6

f / 32

Aperture

Camera f-number

Exposure Time

Motion Blur Effect due to Finite Exposure Time

Decrease in aperture implies…

Increase in depth-of-field Decrease in motion blur Decrease in exposure

2D Image Representation

Image Capture

(Courtesy Gonzalez & Woods)

Digital Image Capture

Light sensitive diodes convert photons to electrons

Color Image Capture: Single vs. Three CCD Arrays

RGB splitter(three separate imaging sensors, higher resolution)

Bayer filter(cheaper but introduces spatial resolution loss)

Digital Camera Issues

Noise caused by low light

Color color fringing (chromatic aberration) artifacts from Bayer patterns

Blooming charge overflowing into neighboring pixels

In-camera processing over-sharpening can produce halos

Compression creates blocking artefacts

Digitization: Sampling and Quantization

Over Sampling

Over Quantization

Images as Matrices of Integers

126 127 126

125 126 127

123 126 125

128 127 124

123 120 144

121 128 155

126 123 127

120 122 124

119 121 123

122 142 162

130 157 161

145 162 164

0 ≤ s(m,n) ≤ 255 } quantization

0 ≤ m ≤ M-1

0 ≤ n ≤ N-1

MxN 8-bit gray-scale (intensity, luminance) image

sampling

0 → black, 255 → white

Images as Functions

We can think of an image as a function, f, from R2 to R: f( x, y ) gives the intensity at position ( x, y ) Realistically, we expect the image only to be defined

over a rectangle, with a finite range:• f: [a,b]x[c,d] [0,1]

A color image is just three functions pasted together. We can write this as a “vector-valued” function:

( , ) ( , )

f x y g x y

RGB Color Bands (Channels)

Green Blue

YUV Bands

Also called Y Cb Cr Y : Luma

Cb : Chrominance_blueCr : Chrominance_red

U (Cb)

YUV-RGB Conversion

Summary

Human visual system

Pin-hole camera model

Image representation

Summary

How to find camera parameters? Where is the camera, where is it directed at? What is the movement of the camera?

Where are the objects located in 3D? What are the dimensions of objects in 3D? What is the 3D structure of a scene?

How to process stereo video? How to detect and match image features? How to stitch images?

Problems to be Addressed

cs 523 (cs 423/ee 533) computer vision lecture 1 introduction to computer vision

computer vision difficult

computer visionlecture

computer visionabout

final exam

btnleme exam

human tracking

view interpolation

human eyephot

Documents

cs 2770: computer vision

new cs 534: computer vision appearance-based vision ·...

cs 4495 computer vision classification 2 - gatech.edu

cs 1699: intro to computer vision...

welcome to cs 675 – computer vision fall 2014

cs 534: computer vision stereo...

cs 556: computer vision lecture 8

cap 6412 advanced computer vision - cs department

cs 534: computer visionelgammal/classes/cs534/lectures/...cs...

cs 4495 computer vision introduction - college of...

cs 231a computer vision...

cs 231a computer vision sample...

cs 496: computer vision thanks to chris bregler. cs 496:...

cs 231a computer vision midterm - stanford...

cs 4495 computer vision motion and optic flow€¦ · cs...

cs 1699: intro to computer vision...

cs 534: computer vision camera...

computer vision, cs 763 - cse, iit...

cs 1699: intro to computer vision...

cs 4495 computer vision - college of computing