Deep Learning-Based Language Identification in English-Hindi-Bengali Code-Mixed Social Media Corpora

This article addresses language identification at the word level in Indian social media corpora taken from Facebook, Twitter and WhatsApp posts that exhibit code-mixing between English-Hindi, English-Bengali, as well as a blend of both language pairs. Code-mixing is a fusion of multiple languages pr...

Full description

Bibliographic Details
Main Authors: Jamatia Anupam, Das Amitava, Gambäck Björn
Format: Article
Language:English
Published: De Gruyter 2019-07-01
Series:Journal of Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1515/jisys-2017-0440