Indian Language Identification using Deep Learning
Spoken language is the most regular method of correspondence in this day and age. Endeavours to create language recognizable proof frameworks for Indian dialects have been very restricted because of the issue of speaker accessibility and language readability. However, the necessity of SLID is expand...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2020-01-01
|
Series: | ITM Web of Conferences |
Subjects: | |
Online Access: | https://www.itm-conferences.org/articles/itmconf/pdf/2020/02/itmconf_icacc2020_01010.pdf |
id |
doaj-6006792edb5045ad8cf923c7ed6933e8 |
---|---|
record_format |
Article |
spelling |
doaj-6006792edb5045ad8cf923c7ed6933e82021-04-02T14:28:11ZengEDP SciencesITM Web of Conferences2271-20972020-01-01320101010.1051/itmconf/20203201010itmconf_icacc2020_01010Indian Language Identification using Deep LearningGodbole Shubham0Jadhav VaishnaviBirajdar GajananDepartment of Electronics Engineering, Ramrao Adik Institute of Technology, NerulSpoken language is the most regular method of correspondence in this day and age. Endeavours to create language recognizable proof frameworks for Indian dialects have been very restricted because of the issue of speaker accessibility and language readability. However, the necessity of SLID is expanding for common and safeguard applications day by day. Feature extraction is a basic and important procedure performed in LID. A sound example is changed over into a spectrogram visual portrayal which describes a range of frequencies in regard with time. Three such spectrogram visuals were generated namely Log Spectrogram, Gammatonegram and IIR-CQT Spectrogram for audio samples from the standardized IIIT-H Indic Speech Database. These visual representations depict language specific details and the nature of each language. These spectrograms images were then used as an input to the CNN. Classification accuracy of 98.86% was obtained using the proposed methodology.https://www.itm-conferences.org/articles/itmconf/pdf/2020/02/itmconf_icacc2020_01010.pdfconvolutional neural network (cnn)spoken indian language identification (slid)log spectrogram, gammatonegramiir-cqt spectrogramartificial neural network (ann)deep learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Godbole Shubham Jadhav Vaishnavi Birajdar Gajanan |
spellingShingle |
Godbole Shubham Jadhav Vaishnavi Birajdar Gajanan Indian Language Identification using Deep Learning ITM Web of Conferences convolutional neural network (cnn) spoken indian language identification (slid) log spectrogram, gammatonegram iir-cqt spectrogram artificial neural network (ann) deep learning |
author_facet |
Godbole Shubham Jadhav Vaishnavi Birajdar Gajanan |
author_sort |
Godbole Shubham |
title |
Indian Language Identification using Deep Learning |
title_short |
Indian Language Identification using Deep Learning |
title_full |
Indian Language Identification using Deep Learning |
title_fullStr |
Indian Language Identification using Deep Learning |
title_full_unstemmed |
Indian Language Identification using Deep Learning |
title_sort |
indian language identification using deep learning |
publisher |
EDP Sciences |
series |
ITM Web of Conferences |
issn |
2271-2097 |
publishDate |
2020-01-01 |
description |
Spoken language is the most regular method of correspondence in this day and age. Endeavours to create language recognizable proof frameworks for Indian dialects have been very restricted because of the issue of speaker accessibility and language readability. However, the necessity of SLID is expanding for common and safeguard applications day by day. Feature extraction is a basic and important procedure performed in LID. A sound example is changed over into a spectrogram visual portrayal which describes a range of frequencies in regard with time. Three such spectrogram visuals were generated namely Log Spectrogram, Gammatonegram and IIR-CQT Spectrogram for audio samples from the standardized IIIT-H Indic Speech Database. These visual representations depict language specific details and the nature of each language. These spectrograms images were then used as an input to the CNN. Classification accuracy of 98.86% was obtained using the proposed methodology. |
topic |
convolutional neural network (cnn) spoken indian language identification (slid) log spectrogram, gammatonegram iir-cqt spectrogram artificial neural network (ann) deep learning |
url |
https://www.itm-conferences.org/articles/itmconf/pdf/2020/02/itmconf_icacc2020_01010.pdf |
work_keys_str_mv |
AT godboleshubham indianlanguageidentificationusingdeeplearning AT jadhavvaishnavi indianlanguageidentificationusingdeeplearning AT birajdargajanan indianlanguageidentificationusingdeeplearning |
_version_ |
1721562221434109952 |