“inside the bible” segmentation, annotation and retrieval for a new browsing experience

22
“Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience Daniele Borghesani International Doctorate school on Information and Communication Technologies English for academic purposes I

Upload: lukas

Post on 23-Feb-2016

37 views

Category:

Documents


0 download

DESCRIPTION

International Doctorate school on Information and Communication Technologies English for academic purposes I. “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience. Daniele Borghesani. Goals Text segmentation Picture segmentation Results Conclusions. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

“Inside the Bible”Segmentation, annotation and retrieval for a new

browsing experience

Daniele Borghesani

International Doctorate school on Information and Communication Technologies

English for academic purposes I

Page 2: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Overview

1.Goals

2.Text segmentation

3.Picture segmentation

4.Results

5.Conclusions

Page 3: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Overview

1.Goals

2.Text segmentation

3.Picture segmentation

4.Results

5.Conclusions

Page 4: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Dataset description

Page 5: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Dataset description

• Holy Bible of Borso d’Este (1450-1471 d.C.)

• Illuminated manuscript

• A lot of illustrations (biblical episodes, animals, symbols, court life scenes…)

• 1200+ high resolution pages

Page 6: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Manual annotation

Page 7: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Our project

Text recognition

Texture analysisPreprocessing

Illustrations classification

Text Illustrations

Decorated initials Decoration Picture

Annotationdatabase

Imagesdatabase

Feature annotation

User interfaceCBIR

Page 8: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Our project

• Automatic analysis of Bible pages

• Extraction of valuable pictures

• Addition of translations, commentaries, references…

• Finally, media station with an appealing user interface

(museums)

Obscura HP Multi-Touch Video Wall

Page 9: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Overview

1.Goals

2.Text segmentation

3.Picture segmentation

4.Results

5.Conclusions

Page 10: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Text Segmentation

1. Block analysis with autocorrelation

2. Directional histogram

• Sum of pixel along each direction

3. Modeling with mixtures of Von Mises distributions

• Very good for handling of angular data

• Compact representation (5 values for a mixture

of two Von Mises distributions)

Page 11: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Text Segmentation

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0

1

2

3

4

5

6

7

8

1 10 19 28 37 46 55 64 73 82 91 100

109

118

127

136

145

154

163

172

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0

1

2

3

4

5

6

7

8

1 10 19 28 37 46 55 64 73 82 91 100

109

118

127

136

145

154

163

172

Text!Text

Page 12: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Text Segmentation

Page 13: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Overview

1.Goals

2.Text segmentation

3.Picture segmentation

4.Results

5.Conclusions

Page 14: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Picture Segmentation

3. Preprocess to focus on most important blobs of pixels

(1) Original image (2) Background suppression and Labeling (fast)

(3) Morphology (4) Blob filling

Page 15: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Picture Segmentation

4. Block analysis

b) SVM classification on the pages…

a) SVM learning with a

training set of positive

and negative samples ...

...

Features: color (HSV and RGB histogram), texture (gradients), low frequency coefficients

Page 16: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Overview

1.Goals

2.Text segmentation

3.Picture segmentation

4.Results

5.Conclusions

Page 17: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Results

Page 18: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Results

Page 19: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Retrieval by similarity

Page 20: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Browsing with Sammon Mapping

Page 21: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Conclusions

• We are studying a set of techniques in order to analyze the

Holy Bible of Borso d’Este

• Our goal is to produce a media station, available both locally

(museums) and remotely (web app), to “touch” this

untouchable masterpiece

Page 22: “Inside the Bible” Segmentation, annotation and retrieval for a new browsing experience

Thank you!