www.infotech.monash.edu sieve—search images effectively through visual elimination ying liu,...

www.infotech.monash.edu

SIEVE—Search Images Effectively SIEVE—Search Images Effectively through Visual Eliminationthrough Visual Elimination

Ying Liu, Dengsheng Zhang and Guojun Lu

Gippsland School of Info Tech, Monash University,

Churchill, Victoria, 3842 {dengsheng.zhang, guojun.lu}@infotech.monash.edu.au

OutlineOutline

• Motivations• SIEVE• Experiment• Results• Conclusions

Motivations—Semantic GapMotivations—Semantic Gap• Conventional content-based image retrieval (CBIR) systems put

visual features ahead of textual information. • However, there is a gap between visual features and semantic

features (textual information) which cannot be closed easily.

Motivations—Text Search PopularMotivations—Text Search Popular

• CBIR systems are not widely used as text based image search engines.

• However, the textual description in existing search engines may not capture image content and is subjective in nature.

• We propose to integrate the existing text-based image search engine with visual features.

• A post-filtering algorithm is proposed, it is called SIEVE—Search Images Effectively through Visual Elimination.

• Practical fusion methods are also proposed to integrate SIEVE with contemporary text-based search engines.

SIVE—The IdeaSIVE—The Idea

• The idea of using SIEVE is very similar to object classification done by a human being.

• First, objects of interest are roughly distinguished from other very different objects either manually or through certain hand tools.

• Then, the collected objects are subject to visual inspection to confirm each object of interest from unwanted objects.

SIEVE—The ApproachSIEVE—The Approach

• In our approach, text-based image search results for a given query are obtained first.

• SIEVE is then used to filter out those images which are semantically irrelevant to the query.

SIEVE—The SystemSIEVE—The System

SIEVE—Feature Extraction SIEVE—Feature Extraction

• For each image in the list, SIEVE first segments it into different regions.

• Next, color and texture features of each region are extracted.

• The region color feature is the dominant color in HSV space and the region texture feature is the Gabor feature obtained

Segmentation Features

SIEVE—Decision Tree AnalysisSIEVE—Decision Tree Analysis

• Semantic template based decision tree reasoning algorithm is used to derive a set of decision rules to learn a set of concepts in natural scenery images.

• Using these decision rules, the low-level features of a region are mapped to semantic concepts.

SIEVE—Decision Tree AnalysisSIEVE—Decision Tree Analysis

Experiment—Image CollectionsExperiment—Image Collections

• To test the retrieval performance of SIEVE, 10 queries are selected, including mountain, beach, building, firework, flower, forest, snow, sunset, tiger and sea.

• Google image search can return up to thousands of images for a query, however, users are usually only interested in the first few pages.

• Therefore, for each query, the top 100 images are downloaded from the first 5 pages.

Experiment—Learning SemanticsExperiment—Learning Semantics

• For a given query, each image in the returned list is segmented into different regions using JSEG

• Regions with size over 5% of the entire image are selected.

• Then, low-level features of these regions are extracted.

• Next, the semantic based decision tree method is used to learn the concept of each region in an image and decide whether the image is relevant to the query or not.

Experiment—Measurement Experiment—Measurement

• In Web image search scenario, it is not known how many relevant images there are in the database for a given query.

• Bull’s eye measurement is used.

• The bull’s eye measures the retrieval precision among the top K retrieved images.

Results—Retrieval AccuracyResults—Retrieval Accuracy

0 10 20 30 40 50K

Precision

SIEVE Google

Average retrieval precision for 10 image concepts

Results—Results—Retrieval ExamplesRetrieval Examples

Above: Search result by Google Above: Search result by Google using query ‘Tiger’using query ‘Tiger’

Left: Result by SIEVE using the Left: Result by SIEVE using the same query ‘Tiger’same query ‘Tiger’

Above: Search result by Google Above: Search result by Google using query ‘Snow’using query ‘Snow’

Left: Result by SIEVE using the Left: Result by SIEVE using the same query ‘Snow’same query ‘Snow’

Above: Search result by Google Above: Search result by Google using query ‘Firework’using query ‘Firework’

Right: Result by SIEVE using the Right: Result by SIEVE using the same query ‘Firework’same query ‘Firework’

Integration with Search EnginesIntegration with Search Engines

• Scenario 1— SIEVE is installed on the server. User sends an image search query a Web browser. Search engine returns the SIEVED images to the user.

• Scenario 2— SIEVE is integrated with the Web browser as a plug-in. A user query is directed by the SIEVE to search engine. The returned list is subject to SIEVE.

• Scenario 3— SIEVE is used as an application software. SIEVE directs user query to various Web image search engines. The returned lists from search engines are further SIEVED.

IssuesIssues

• Significant time on image segmentation and computing image semantics. This can be solved by indexing images semantics upfront in image search engines.

• Although a limited concept set is used to test its performance, the decision tree can accommodate more semantic concepts, provided their corresponding distinct feature templates are available for inclusion in the training dataset.

• SIEVE can be applied more effectively if images in database are first classified into categories.

ConclusionsConclusions

• An effective method called SIEVE has been proposed to improve text based Web image search.

• Compared with text based image search engine, it shows significant improvement on the tested semantic concepts.

• Compared with conventional CBIR systems, it is much more efficient in dealing huge image database like Web images. Because SIEVE makes use of efficient text based image search engine.

• Future research will extend SIEVE to include large number of image concepts.

www.infotech.monash.edu sieve—search images effectively through visual elimination ying liu,...

visual features

google image search

image content

sievethe system slide

texture features

segmentation features

lowlevel features

existing search engines

Documents

united nations · list of participants ... drug prevention...

introduction to multimedia1 multimedia network. zreference:...

image compression and encryption based on wavelet transform...

wang guojun july2010

robust keyframe-based dense slam with an rgb-d …1 robust...

www.infotech.monash.edu fit 1005 networks & data...

chy loop integrands from holomorphic forms · published:...

metal-isotope-tagged monoclonal antibodies for...

www.infotech.monash.edu fit 1005 networks & data...

www.infotech.monash.edu fit2043 technical documentation for...

efﬁcient endocytic uptake and maturation in drosophila...

a comparative analysis of satellite-based approaches for...

valuation of large variable annuity portfolios with rank...

stm capacity for chinese words and idioms: chunking and...

1 faculty of information technology generic fourier...

kais t scalable key management for secure multicast...

original article rhesus monkey is a new model of … ·...

multi-wavelength pulsed emission from fermi pulsars: vela &...

www.infotech.monash.edu fit 1005 networks & data...

liu guojun middle school zhang shilan. unit 1 school life...