Automatic Language Identification in Code-Switched Hindi-English Social Media Text

Natural Language Processing (NLP) tools typically struggle to process code-switched data and so linguists are commonly forced to annotate such data manually. As this data becomes more readily available, automatic tools are increasingly needed to help speed up the annotation process and improve consi...

Full description

Bibliographic Details
Main Authors: Li Nguyen, Christopher Bryant, Sana Kidwai, Theresa Biberauer
Format: Article
Language:English
Published: Ubiquity Press 2021-06-01
Series:Journal of Open Humanities Data
Subjects:
Online Access:https://openhumanitiesdata.metajnl.com/articles/44