extraction robuste des cases et du texte de bandes dessinées

22
Extraction robuste des cases et du texte de bandes dessinées Christophe Rigaud, Norbert Tsopze, J. C. Burie, J. M. Ogier Laboratoire Informatique, Image et Interaction (L3i) CIFED 2012, Bordeaux, pp.349-360 - 23 Mars 2012

Upload: christophe-rigaud

Post on 29-Jan-2018

537 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Extraction robuste des cases et du texte de bandes dessinées

Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud, Norbert Tsopze, J. C. Burie, J. M. Ogier

Laboratoire Informatique, Image et Interaction (L3i)

CIFED 2012, Bordeaux, pp.349-360 - 23 Mars 2012

Page 2: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 2

SUMMARY

● Project eBDtheque

● Proposed work

● Experiments

● Conclusion

Page 3: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 3

SUMMARY

● Project eBDtheque● Proposed work● Experiments● Conclusion

Page 4: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 4

Project eBDtheque

● Project CPER● European Regional Development Fund, the region Poitou-Charentes, the General

Council of Charente Maritime and the town of La Rochelle.● Team of 12 people● 2011-2013● http://l3i.univ-larochelle.fr/eBDtheque.html

● Comics :● Important heritage to develop with new technologies

● Target: ● Help to automatically convert digitalized comics into digital comics.

– « Find all the panels containing the Eiffel Tower »...

– User and author interactions

Page 5: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 5

Project eBDtheque > Context

Digitalised comicsPatrimonial

CIBDI¹Paper bookMedium dependent

Digital comicsNew technologies

SmartphoneWebSpeech synthesisAnimation (cartoon)Story reconstruction...

eBDtheque

EXTRACTIONMulti segmentation

REPRESENTATIONOntologyBrowsing

¹ CIBDI: Centre International de la Bande Dessinées et de l'Image

Page 6: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 6

Project eBDtheque > Multi-segmentation

Panels

Text

Page

Image credit: Cyb, Cosmozone, Studio Cyborga, Goven, France, 2009

. . .

. . .

Page 7: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 7

SUMMARY

● Project eBDtheque● Proposed work

● Related works● Contribution● Limitations

● Experiments● Conclusion

Page 8: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 8

Proposed work > Related works

● Panel segmentation● Hough line● X-Y cut algorithm¹● Gradient²● Connected component³

● Text segmentation● Connected component

– Speech balloon > text⁴– Text > speech balloon³

¹ Han E., Kim K., Yang H., Jung K., « Frame segmentation used MLP-based X-Y recursive for mobile cartoon content », Proceedings of the 12th, HCI’07, Springer-Verlag, Berlin, Heidelberg, p. 872-881, 2007.² Tanaka T., Shoji K., Toyama F., Miyamichi J., « Layout Analysis of Tree-Structured Scene Frames in Comic Images. », IJCAI’07, p. 2885-2890, 2007.³ Ho A. K. N., Burie J.-C., Ogier J.-M., « Comics page structure analysis based on automatic panel extraction », GREC 2011, Ninth IAPR International Workshop on Graphics Recognition, Seoul, Korea, September, 15-16, 2011 ⁴ Arai K., Tolle H., « Method for Automatic E-Comic Scene Frame Extraction for Reading Comic on Mobile Devices »,

ITNG ’10, IEEE Computer Society, Washington, DC, USA, p. 370-375, 2010.

Page 9: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 9

Proposed work > Contribution

Simultaneous panel + text extraction (CC) => time saving

Image credit: Cyb, Bubblegôm, Studio Cyborga, Goven, France, 2009

Page 10: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 10

Proposed work > Contribution

Simultaneous panel + text extraction (CC) => time saving

Image credit: Cyb, Bubblegôm, Studio Cyborga, Goven, France, 2009

Panels without frame

Page 11: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 11

Proposed work > Contribution

Simultaneous panel + text extraction (CC) => time saving

Panels without frame Out of balloon text

Image credit: Cyb, Bubblegôm, Studio Cyborga, Goven, France, 2009

Page 12: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 12

Proposed work > Content extraction

Image credit: Cyb, Bubblegôm, Studio Cyborga, Goven, France, 2009

Median level = binarisation threshold

BinarisedGrayscale CC bounding boxes

Page 13: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 13

Proposed work > Panel classification

K-means 3 classes

Class “CASE”

Page 14: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 14

Proposed work > Topological filtering

Filtering

K-means 3 classes

Page 15: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 15

Proposed work > Text extraction

K-means 3 classes

Medianheight (mH)

Grouping+filtering(Distance < 2*mH)

Page 16: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 16

Proposed work > Limitations

● 3 classes (panel, text, noise)

● Page/text background grey level

● Overlapping elements

Image credit: Lamisseb, Les noeils Tome 1, Bac@BD, Valence, France, 2011.

Page 17: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 17

SUMMARY

● Project eBDtheque● Proposed work● Experiments

● Dataset● Extraction

● Conclusion

Page 18: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 18

Experiments > Dataset

● Same as Ho et al.¹ for comparison● 7 albums => 355 panels (42 pages A4, 300dpi)● 435 speech balloons + 79 narrative text areas

¹ Ho A. K. N., Burie J.-C., Ogier J.-M., « Comics page structure analysis based on automatic panel extraction », GREC 2011, Nineth IAPR International Workshop on Graphics Recognition, Seoul, Korea, September, 15-16, 2011

Page 19: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 19

Experiments > Panel extraction

Method Tanaka¹ Arai² Ngo Ho³ Proposed Gain

Panel (%) 63.9 75.6 87.3 88.2 +0.9%

Page (%) 42.8 47.6 64.3 66.7 +2.4%

¹ Tanaka, T., Shoji, K., Toyama, F., Miyamichi, J.: Layout analysis of tree-structured scene frames in comic images. In: IJCAI’07. pp. 2885–2890 (2007)² Arai, K., Tolle, H.: Method for automatic e-comic scene frame extraction for reading comic on mobile devices. In: Seventh International Conference on Information Technology: NewGenerations. pp. 370–375. ITNG, IEEE Computer Society, Washington, DC, USA (2010)³ Ngo Ho A. K. N., Burie J.-C., Ogier J.-M., « Comics page structure analysis based on automatic panel extraction », GREC 2011, Nineth IAPR International Workshop on Graphics Recognition, Seoul, Korea, September, 15-16 (2011)

=> panels without frame are extracted

CONDITION:Panel OK if “properly” detectedPage OK if all panels OK

Page 20: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 20

Experiments > Text extraction

Text type TP FN

Speech 78 22

Narrative 53 47

CONDITION:TP: text area detected correctlyFN: text area not detected

TP

FN

FP

TN

Page 21: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 21

Conclusion

● Simultaneous comics panel and text extraction● Robust to page size and resolution variations● All text areas (no panel dependent)● Assume 3 classes and few overlapping elements

● Coming soon: panel content extraction...

Page 22: Extraction robuste des cases et du texte de bandes dessinées

Christophe Rigaud - CIFED 2012, pp.349-360 22

???{christophe.rigaud, norbert.tsopze, jcburie, jmogier}@univ-lr.fr

CIFED 2012, Bordeaux, pp.349-360 - 23 Mars 2012