Digital Comics Image Indexing Based on Deep Learning
The digital comic book market is growing every year now, mixing digitized and digital-born comics. Digitized comics suffer from a limited automatic content understanding which restricts online content search and reading applications. This study shows how to combine state-of-the-art image analysis me...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-07-01
|
Series: | Journal of Imaging |
Subjects: | |
Online Access: | http://www.mdpi.com/2313-433X/4/7/89 |
id |
doaj-1278cfb54c66448ca86ef3b24f5a26ba |
---|---|
record_format |
Article |
spelling |
doaj-1278cfb54c66448ca86ef3b24f5a26ba2020-11-24T23:59:40ZengMDPI AGJournal of Imaging2313-433X2018-07-01478910.3390/jimaging4070089jimaging4070089Digital Comics Image Indexing Based on Deep LearningNhu-Van Nguyen0Christophe Rigaud1Jean-Christophe Burie2Lab L3I, University of La Rochelle, 17000 La Rochelle, FranceLab L3I, University of La Rochelle, 17000 La Rochelle, FranceLab L3I, University of La Rochelle, 17000 La Rochelle, FranceThe digital comic book market is growing every year now, mixing digitized and digital-born comics. Digitized comics suffer from a limited automatic content understanding which restricts online content search and reading applications. This study shows how to combine state-of-the-art image analysis methods to encode and index images into an XML-like text file. Content description file can then be used to automatically split comic book images into sub-images corresponding to panels easily indexable with relevant information about their respective content. This allows advanced search in keywords said by specific comic characters, action and scene retrieval using natural language processing. We get down to panel, balloon, text, comic character and face detection using traditional approaches and breakthrough deep learning models, and also text recognition using LSTM model. Evaluations on a dataset composed of online library content are presented, and a new public dataset is also proposed.http://www.mdpi.com/2313-433X/4/7/89comics analysisimage indexingdeep learningCNNLSTMhandwritten text recognition |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Nhu-Van Nguyen Christophe Rigaud Jean-Christophe Burie |
spellingShingle |
Nhu-Van Nguyen Christophe Rigaud Jean-Christophe Burie Digital Comics Image Indexing Based on Deep Learning Journal of Imaging comics analysis image indexing deep learning CNN LSTM handwritten text recognition |
author_facet |
Nhu-Van Nguyen Christophe Rigaud Jean-Christophe Burie |
author_sort |
Nhu-Van Nguyen |
title |
Digital Comics Image Indexing Based on Deep Learning |
title_short |
Digital Comics Image Indexing Based on Deep Learning |
title_full |
Digital Comics Image Indexing Based on Deep Learning |
title_fullStr |
Digital Comics Image Indexing Based on Deep Learning |
title_full_unstemmed |
Digital Comics Image Indexing Based on Deep Learning |
title_sort |
digital comics image indexing based on deep learning |
publisher |
MDPI AG |
series |
Journal of Imaging |
issn |
2313-433X |
publishDate |
2018-07-01 |
description |
The digital comic book market is growing every year now, mixing digitized and digital-born comics. Digitized comics suffer from a limited automatic content understanding which restricts online content search and reading applications. This study shows how to combine state-of-the-art image analysis methods to encode and index images into an XML-like text file. Content description file can then be used to automatically split comic book images into sub-images corresponding to panels easily indexable with relevant information about their respective content. This allows advanced search in keywords said by specific comic characters, action and scene retrieval using natural language processing. We get down to panel, balloon, text, comic character and face detection using traditional approaches and breakthrough deep learning models, and also text recognition using LSTM model. Evaluations on a dataset composed of online library content are presented, and a new public dataset is also proposed. |
topic |
comics analysis image indexing deep learning CNN LSTM handwritten text recognition |
url |
http://www.mdpi.com/2313-433X/4/7/89 |
work_keys_str_mv |
AT nhuvannguyen digitalcomicsimageindexingbasedondeeplearning AT christopherigaud digitalcomicsimageindexingbasedondeeplearning AT jeanchristopheburie digitalcomicsimageindexingbasedondeeplearning |
_version_ |
1725446753586511872 |