topic 9 (pattern recognition)

20
CSE 473: Digital Image CSE 473: Digital Image Processing and Pattern Processing and Pattern Recognition Recognition Spring 2015 Spring 2015 Course Teacher: Course Teacher: Md. Tarek Habib Md. Tarek Habib Assistant Professor Assistant Professor Department of Computer Science and Department of Computer Science and Engineering Engineering Green University of Bangladesh Green University of Bangladesh Topic – 9: Topic – 9: Pattern Pattern Recognition Recognition

Upload: arch-sidz

Post on 10-Aug-2015

23 views

Category:

Education


7 download

TRANSCRIPT

Page 1: Topic   9 (pattern recognition)

CSE 473: Digital Image Processing CSE 473: Digital Image Processing and Pattern Recognitionand Pattern Recognition

Spring 2015Spring 2015

Course Teacher:Course Teacher:Md. Tarek HabibMd. Tarek HabibAssistant ProfessorAssistant Professor

Department of Computer Science and Department of Computer Science and EngineeringEngineering

Green University of BangladeshGreen University of Bangladesh

Topic – 9: Topic – 9: Pattern RecognitionPattern Recognition

Page 2: Topic   9 (pattern recognition)

IntroductionIntroduction Features, Features Vectors, Features, Features Vectors, and Classifiersand Classifiers Supervised, Unsupervised, Supervised, Unsupervised, and Semi-supervised Learningand Semi-supervised Learning

2

Lecture OutlineLecture Outline

Md. Tarek HabibMd. Tarek Habib

Page 3: Topic   9 (pattern recognition)

Introduction

Pattern recognition is the scientific

discipline whose goal is the classification of

objects into a number of categories or classes. Depending on the application, these objects

can be images or signal waveforms or any type

of measurements that need to be classified. We will refer to these objects using the generic

term patterns. Pattern recognition is an integral part of most

machine intelligence systems built for decision

making.3

Md. Tarek HabibMd. Tarek Habib

Page 4: Topic   9 (pattern recognition)

Introduction

Typical application areas of pattern

recognition are: Machine vision Character recognition (OCR) Computer-aided diagnosis Speech recognition Face recognition Image Data Base retrieval Data mining and knowledge discovery Bioinformatics Biometrics 4

Md. Tarek HabibMd. Tarek Habib

Page 5: Topic   9 (pattern recognition)

Introduction

So… The task: Assign unknown objects –

patterns – into the correct class. This is

known as classification.

5

Md. Tarek HabibMd. Tarek Habib

Page 6: Topic   9 (pattern recognition)

Features, Features Features, Features Vectors, and ClassifiersVectors, and Classifiers

The measurements obtained from the

patterns and used for the classification are

known as features.

A number of features xi, i = 1, 2, . . . , l,

form the feature vector

where T denotes transposition.

Each of the feature vectors identifies

uniquely a single pattern (object).6

,,...,, 21T

lxxxx

Md. Tarek HabibMd. Tarek Habib

Page 7: Topic   9 (pattern recognition)

Figure 1.1 shows two images, each having a distinct region inside it. The two regions are also themselves visually different. We could say that the region of Figure 1.1a results from a benign lesion, class A, and that of Figure 1.1b from a malignant one (cancer), class B. We will further assume that these are not the only patterns (images) that are available to us, but we have access to an image database with a number of patterns, some of which are known to originate from class A and some from class B.

7

Features, Features Features, Features Vectors, and ClassifiersVectors, and Classifiers

Md. Tarek HabibMd. Tarek Habib

Page 8: Topic   9 (pattern recognition)

8

Features, Features Features, Features Vectors, and ClassifiersVectors, and Classifiers

Md. Tarek HabibMd. Tarek Habib

Page 9: Topic   9 (pattern recognition)

The first step is to identify the measurable quantities that make these two regions distinct from each other. Figure 1.2 shows a plot of the mean value of the intensity in each region of interest versus the corresponding standard deviation around this mean. Each point corresponds to a different image from the available database. It turns out that class A patterns tend to spread in a different area from class B patterns. The straight line seems to be a good candidate for separating the two classes. 9

Features, Features Features, Features Vectors, and ClassifiersVectors, and Classifiers

Md. Tarek HabibMd. Tarek Habib

Page 10: Topic   9 (pattern recognition)

10

Features, Features Features, Features Vectors, and ClassifiersVectors, and Classifiers

Md. Tarek HabibMd. Tarek Habib

Page 11: Topic   9 (pattern recognition)

Let us now assume that we are given a new image with a region in it and that we do not know to which class it belongs. It is reasonable to say that we measure the mean intensity and standard deviation in the region of interest and we plot the corresponding point. This is shown by the asterisk (∗) in Figure 1.2. Then it is sensible to assume that the unknown pattern is more likely to belong to class A than class B. The preceding artificial classification task has outlined the rationale behind a large class of pattern recognition problems.

11

Features, Features Features, Features Vectors, and ClassifiersVectors, and Classifiers

Md. Tarek HabibMd. Tarek Habib

Page 12: Topic   9 (pattern recognition)

The straight line in Figure 1.2 is known as the decision line, and it constitutes the classifier whose role is to divide the feature space into regions that correspond to either class A or class B. If a feature vector x, corresponding to an unknown pattern, falls in the class A region, it is classified as class A, otherwise as class B. This does not necessarily mean that the decision is correct. If it is not correct, a misclassification has occurred.

12

Features, Features Features, Features Vectors, and ClassifiersVectors, and Classifiers

Md. Tarek HabibMd. Tarek Habib

Page 13: Topic   9 (pattern recognition)

In order to draw the straight line in Figure 1.2 we exploited the fact that we knew the labels (class A or B) for each point of the figure. The patterns (feature vectors) whose true class is known and which are used for the design of the classifier are known as training patterns (training feature vectors).

13

Features, Features Features, Features Vectors, and ClassifiersVectors, and Classifiers

Md. Tarek HabibMd. Tarek Habib

Page 14: Topic   9 (pattern recognition)

Figure 1.3 shows the various stages followed for the design of a classification system. As is apparent from the feedback arrows, these stages are not independent. On the contrary, they are interrelated and, depending on the results, one may go back to redesign earlier stages in order to improve the overall performance. Furthermore, there are some methods that combine stages, for example, the feature selection and the classifier design stage, in a common optimization task. 14

Features, Features Features, Features Vectors, and ClassifiersVectors, and Classifiers

Md. Tarek HabibMd. Tarek Habib

Page 15: Topic   9 (pattern recognition)

15

Features, Features Features, Features Vectors, and ClassifiersVectors, and Classifiers

Md. Tarek HabibMd. Tarek Habib

Page 16: Topic   9 (pattern recognition)

16

Supervised, Unsupervised, Supervised, Unsupervised, and Semi- Supervised and Semi- Supervised

LearningLearning

Md. Tarek HabibMd. Tarek Habib

In the example of Figure 1.1, we assumed that a set of training data were available, and the classifier was designed by exploiting this a priori known information. This is known as supervised pattern recognition or in the more general context of machine learning as supervised learning.

Page 17: Topic   9 (pattern recognition)

17

Supervised, Unsupervised, Supervised, Unsupervised, and Semi- Supervised and Semi- Supervised

LearningLearning

Md. Tarek HabibMd. Tarek Habib

However, this is not always the case, and there is another type of pattern recognition tasks for which training data, of known class labels, are not available. In this type of problem, we are given a set of feature vectors x and the goal is to unravel the underlying similarities and cluster (group) “similar” vectors together. This is known as unsupervised pattern recognition or unsupervised learning or clustering.

Page 18: Topic   9 (pattern recognition)

18

Supervised, Unsupervised, Supervised, Unsupervised, and Semi- Supervised and Semi- Supervised

LearningLearning

Md. Tarek HabibMd. Tarek Habib

Semi-supervised learning/pattern recognition for designing a classification system shares the same goals as the supervised case, however now, the designer has at his or her disposal a set of patterns of unknown class origin, in addition to the training patterns, whose true class is known. We usually refer to the former ones as unlabeled and the latter as labeled data. Semi-supervised pattern recognition can be of importance when the system designer has access to a rather limited number of labeled data.

Page 19: Topic   9 (pattern recognition)

19

Supervised, Unsupervised, Supervised, Unsupervised, and Semi- Supervised and Semi- Supervised

LearningLearning

Md. Tarek HabibMd. Tarek Habib

In such cases, recovering additional information from the unlabeled samples, related to the general structure of the data at hand, can be useful in improving the system design. Semi-supervised learning finds its way also to clustering tasks. In this case, labeled data are used as constraints in the form of must-links and cannot-links.

Page 20: Topic   9 (pattern recognition)

20

Supervised, Unsupervised, Supervised, Unsupervised, and Semi- Supervised and Semi- Supervised

LearningLearning

Md. Tarek HabibMd. Tarek Habib

In other words, the clustering task is constrained to assign certain points in the same cluster or to exclude certain points of being assigned in the same cluster. From this perspective, semi-supervised learning provides an a priori knowledge that the clustering algorithm has to respect.