svm active learning for image retrieval
DESCRIPTION
SVM Active Learning For Image Retrieval. Morris LeBlanc. Outline. Why Image Retrieval is Hard? Problems with Image Retrieval Support Vector Machines Active Learning Image Processing Texture and Color Relevance Feedback. Why is Image Retrieval Hard?. What is the topic of this image? - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/1.jpg)
Morris LeBlanc
![Page 2: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/2.jpg)
Why Image Retrieval is Hard? Problems with Image Retrieval Support Vector Machines Active Learning Image Processing
◦ Texture and Color Relevance Feedback
![Page 3: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/3.jpg)
What is the topic of this image?
What are right keywords to index this image
What words would you use to retrieve this image?
The Semantic Gap
![Page 4: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/4.jpg)
A picture is worth a thousand words
The meaning of an image is highly individual and subjective
![Page 5: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/5.jpg)
Is a set of related learning methods used for classification and regression
Views data in two sets of vectors in a n-dimensional space
With this we are able to label “relevant” and “non-relevant” images◦Based on distance from a labeled
instance
![Page 6: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/6.jpg)
SVM training process proceeds as follows:1. Choose some working subset of the query images2. Construct classifier – i.e. create a new surface:
Optimize the weights associated with the working subset of images (feature vectors)
Update optimality conditions for images (vectors) not in working subset Broadcast working subset images (vectors) and
weights Update optimality conditions for all images in query
(Map) Reduce to find greatest violating image (vector) not
contained in working subset (Reduce)
![Page 7: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/7.jpg)
Updating SVMs Cont’d
3. Update working subset to include greatest violating image (vector)
4. Iterate until all images (vectors) satisfy optimality conditions
5. Repeat steps 2 through 4 until correct images are returned
![Page 8: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/8.jpg)
This image shows the multiple current version space chosen by the user (wi) and all instances found later. The closet one is what will be shown to the user.
![Page 9: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/9.jpg)
Here, one allows the learner the flexibility to choose the data points that it feels are most relevant for learning a particular task
◦ An analogy is that a standard passive learner is a student that sits and listens to a teacher while an active learner is a student that asks the teacher questions, listens to the answers and asks further questions based upon the teacher's response
![Page 10: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/10.jpg)
Representing the Images
◦Segmentation
◦Low Level Features Color Texture
![Page 11: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/11.jpg)
Information about color or texture or shape which are extracted from an image are known as image features
◦Also a low-level features Red, sandy
◦As opposed to high level features or concepts Beaches, mountains, happy, serene, George Bush
![Page 12: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/12.jpg)
Do we consider the whole image or just part ?
◦Whole image - global features
◦Parts of image - local features
![Page 13: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/13.jpg)
Segment images into parts
Two sorts:◦Tile Based◦Region based
![Page 14: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/14.jpg)
(a) 5 tiles (b) 9 tiles
(c) 5 regions (d) 9 regions
Tiles
Regions
![Page 15: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/15.jpg)
Break image down into simple geometric shapes
Similar Problems to GlobalPlus dangers of breaking up significant
objectsComputational SimpleSome Schemes seem to work well in practice
![Page 16: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/16.jpg)
Break image down into visually coherent areas
Can identify meaningful areas and objects
Computationally intensive Unreliable
![Page 17: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/17.jpg)
Produce a color signature for region/whole image
Typically done using color correllograms or color histograms
![Page 18: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/18.jpg)
Identify a number of buckets in which to sort the available colours (e.g. red green and blue, or up to ten or so colours)
Allocate each pixel in an image to a bucket and count the number of pixels in each bucket.
Use the figure produced (bucket id plus count, normalised for image size and resolution) as the index key (signature) for each image
![Page 19: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/19.jpg)
0
10
20
30
40
50
60
70
80
90
Red Orange
![Page 20: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/20.jpg)
Produce a mathematical characterization of a repeating pattern in the image◦Smooth◦Sandy◦Grainy◦Stripey
![Page 21: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/21.jpg)
Reduces an area/region to a (small - 15 ?) set of numbers which can be used a signature for that region
Proven to work well in practice
Hard for people to understand
![Page 22: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/22.jpg)
Well established technique in text retrieval◦ Experimental results have always shown it to
work well in practice
Unfortunately experience with search engines has show it is difficult to get real searchers to adopt it - too much interaction
![Page 23: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/23.jpg)
User performs an initial query
Selects some relevant results
System then extracts terms from these to augment the initial query
Requeries
![Page 24: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/24.jpg)
Identify the N top-ranked images Identify all terms from the N top-ranked images
Select the feedback terms Merge the feedback terms with the original query
Identify the top-ranked images for the modified queries through relevance ranking
![Page 25: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/25.jpg)
Q’ = aQ + b sum(R) - c sum(S)
◦Q: original query vector◦R: set of relevant document vectors◦S: set of non-relevant image vectors◦a, b, c: constants (Rocchio weights)◦Q’: new query vector
![Page 26: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/26.jpg)
“SVM Active Learning For Image Retrieval” Simon Tong, Stanford University and Edward Chang, UCSB
John Tait, University of Sunderland, UK tait.ppt
http://robotics.stanford.edu/~stong/research.html -Simon Tong’s website
![Page 27: SVM Active Learning For Image Retrieval](https://reader033.vdocuments.us/reader033/viewer/2022051419/56815884550346895dc5e73c/html5/thumbnails/27.jpg)