Digital Comics Image Indexing Based on Deep Learning

The digital comic book market is growing every year now, mixing digitized and digital-born comics. Digitized comics suffer from a limited automatic content understanding which restricts online content search and reading applications. This study shows how to combine state-of-the-art image analysis me...

Full description

Bibliographic Details
Main Authors: Nhu-Van Nguyen, Christophe Rigaud, Jean-Christophe Burie
Format: Article
Language:English
Published: MDPI AG 2018-07-01
Series:Journal of Imaging
Subjects:
CNN
Online Access:http://www.mdpi.com/2313-433X/4/7/89
id doaj-1278cfb54c66448ca86ef3b24f5a26ba
record_format Article
spelling doaj-1278cfb54c66448ca86ef3b24f5a26ba2020-11-24T23:59:40ZengMDPI AGJournal of Imaging2313-433X2018-07-01478910.3390/jimaging4070089jimaging4070089Digital Comics Image Indexing Based on Deep LearningNhu-Van Nguyen0Christophe Rigaud1Jean-Christophe Burie2Lab L3I, University of La Rochelle, 17000 La Rochelle, FranceLab L3I, University of La Rochelle, 17000 La Rochelle, FranceLab L3I, University of La Rochelle, 17000 La Rochelle, FranceThe digital comic book market is growing every year now, mixing digitized and digital-born comics. Digitized comics suffer from a limited automatic content understanding which restricts online content search and reading applications. This study shows how to combine state-of-the-art image analysis methods to encode and index images into an XML-like text file. Content description file can then be used to automatically split comic book images into sub-images corresponding to panels easily indexable with relevant information about their respective content. This allows advanced search in keywords said by specific comic characters, action and scene retrieval using natural language processing. We get down to panel, balloon, text, comic character and face detection using traditional approaches and breakthrough deep learning models, and also text recognition using LSTM model. Evaluations on a dataset composed of online library content are presented, and a new public dataset is also proposed.http://www.mdpi.com/2313-433X/4/7/89comics analysisimage indexingdeep learningCNNLSTMhandwritten text recognition
collection DOAJ
language English
format Article
sources DOAJ
author Nhu-Van Nguyen
Christophe Rigaud
Jean-Christophe Burie
spellingShingle Nhu-Van Nguyen
Christophe Rigaud
Jean-Christophe Burie
Digital Comics Image Indexing Based on Deep Learning
Journal of Imaging
comics analysis
image indexing
deep learning
CNN
LSTM
handwritten text recognition
author_facet Nhu-Van Nguyen
Christophe Rigaud
Jean-Christophe Burie
author_sort Nhu-Van Nguyen
title Digital Comics Image Indexing Based on Deep Learning
title_short Digital Comics Image Indexing Based on Deep Learning
title_full Digital Comics Image Indexing Based on Deep Learning
title_fullStr Digital Comics Image Indexing Based on Deep Learning
title_full_unstemmed Digital Comics Image Indexing Based on Deep Learning
title_sort digital comics image indexing based on deep learning
publisher MDPI AG
series Journal of Imaging
issn 2313-433X
publishDate 2018-07-01
description The digital comic book market is growing every year now, mixing digitized and digital-born comics. Digitized comics suffer from a limited automatic content understanding which restricts online content search and reading applications. This study shows how to combine state-of-the-art image analysis methods to encode and index images into an XML-like text file. Content description file can then be used to automatically split comic book images into sub-images corresponding to panels easily indexable with relevant information about their respective content. This allows advanced search in keywords said by specific comic characters, action and scene retrieval using natural language processing. We get down to panel, balloon, text, comic character and face detection using traditional approaches and breakthrough deep learning models, and also text recognition using LSTM model. Evaluations on a dataset composed of online library content are presented, and a new public dataset is also proposed.
topic comics analysis
image indexing
deep learning
CNN
LSTM
handwritten text recognition
url http://www.mdpi.com/2313-433X/4/7/89
work_keys_str_mv AT nhuvannguyen digitalcomicsimageindexingbasedondeeplearning
AT christopherigaud digitalcomicsimageindexingbasedondeeplearning
AT jeanchristopheburie digitalcomicsimageindexingbasedondeeplearning
_version_ 1725446753586511872