Indian Language Identification using Deep Learning

Spoken language is the most regular method of correspondence in this day and age. Endeavours to create language recognizable proof frameworks for Indian dialects have been very restricted because of the issue of speaker accessibility and language readability. However, the necessity of SLID is expand...

Full description

Bibliographic Details
Main Authors: Godbole Shubham, Jadhav Vaishnavi, Birajdar Gajanan
Format: Article
Language:English
Published: EDP Sciences 2020-01-01
Series:ITM Web of Conferences
Subjects:
Online Access:https://www.itm-conferences.org/articles/itmconf/pdf/2020/02/itmconf_icacc2020_01010.pdf
id doaj-6006792edb5045ad8cf923c7ed6933e8
record_format Article
spelling doaj-6006792edb5045ad8cf923c7ed6933e82021-04-02T14:28:11ZengEDP SciencesITM Web of Conferences2271-20972020-01-01320101010.1051/itmconf/20203201010itmconf_icacc2020_01010Indian Language Identification using Deep LearningGodbole Shubham0Jadhav VaishnaviBirajdar GajananDepartment of Electronics Engineering, Ramrao Adik Institute of Technology, NerulSpoken language is the most regular method of correspondence in this day and age. Endeavours to create language recognizable proof frameworks for Indian dialects have been very restricted because of the issue of speaker accessibility and language readability. However, the necessity of SLID is expanding for common and safeguard applications day by day. Feature extraction is a basic and important procedure performed in LID. A sound example is changed over into a spectrogram visual portrayal which describes a range of frequencies in regard with time. Three such spectrogram visuals were generated namely Log Spectrogram, Gammatonegram and IIR-CQT Spectrogram for audio samples from the standardized IIIT-H Indic Speech Database. These visual representations depict language specific details and the nature of each language. These spectrograms images were then used as an input to the CNN. Classification accuracy of 98.86% was obtained using the proposed methodology.https://www.itm-conferences.org/articles/itmconf/pdf/2020/02/itmconf_icacc2020_01010.pdfconvolutional neural network (cnn)spoken indian language identification (slid)log spectrogram, gammatonegramiir-cqt spectrogramartificial neural network (ann)deep learning
collection DOAJ
language English
format Article
sources DOAJ
author Godbole Shubham
Jadhav Vaishnavi
Birajdar Gajanan
spellingShingle Godbole Shubham
Jadhav Vaishnavi
Birajdar Gajanan
Indian Language Identification using Deep Learning
ITM Web of Conferences
convolutional neural network (cnn)
spoken indian language identification (slid)
log spectrogram, gammatonegram
iir-cqt spectrogram
artificial neural network (ann)
deep learning
author_facet Godbole Shubham
Jadhav Vaishnavi
Birajdar Gajanan
author_sort Godbole Shubham
title Indian Language Identification using Deep Learning
title_short Indian Language Identification using Deep Learning
title_full Indian Language Identification using Deep Learning
title_fullStr Indian Language Identification using Deep Learning
title_full_unstemmed Indian Language Identification using Deep Learning
title_sort indian language identification using deep learning
publisher EDP Sciences
series ITM Web of Conferences
issn 2271-2097
publishDate 2020-01-01
description Spoken language is the most regular method of correspondence in this day and age. Endeavours to create language recognizable proof frameworks for Indian dialects have been very restricted because of the issue of speaker accessibility and language readability. However, the necessity of SLID is expanding for common and safeguard applications day by day. Feature extraction is a basic and important procedure performed in LID. A sound example is changed over into a spectrogram visual portrayal which describes a range of frequencies in regard with time. Three such spectrogram visuals were generated namely Log Spectrogram, Gammatonegram and IIR-CQT Spectrogram for audio samples from the standardized IIIT-H Indic Speech Database. These visual representations depict language specific details and the nature of each language. These spectrograms images were then used as an input to the CNN. Classification accuracy of 98.86% was obtained using the proposed methodology.
topic convolutional neural network (cnn)
spoken indian language identification (slid)
log spectrogram, gammatonegram
iir-cqt spectrogram
artificial neural network (ann)
deep learning
url https://www.itm-conferences.org/articles/itmconf/pdf/2020/02/itmconf_icacc2020_01010.pdf
work_keys_str_mv AT godboleshubham indianlanguageidentificationusingdeeplearning
AT jadhavvaishnavi indianlanguageidentificationusingdeeplearning
AT birajdargajanan indianlanguageidentificationusingdeeplearning
_version_ 1721562221434109952