DCU-UVT: Word-Level Language Classification with Code-Mixed Data

28Citations
Citations of this article
88Readers
Mendeley users who have this article in their library.

Abstract

This paper describes the DCU-UVT team's participation in the Language Identification in Code-Switched Data shared task in the Workshop on Computational Approaches to Code Switching. Word-level classification experiments were carried out using a simple dictionary-based method, linear kernel support vector machines (SVMs) with and without contextual clues, and a k-nearest neighbour approach. Based on these experiments, we select our SVM-based system with contextual clues as our final system and present results for the Nepali-English and Spanish-English datasets.

References Powered by Scopus

Learning to predict code-switching points

116Citations
144Readers

Part-of-Speech tagging for English-Spanish code-switched text

97Citations
123Readers

Normalizing tweets with edit scripts and recurrent neural embeddings

62Citations
160Readers

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Barman, U., Wagner, J., Chrupała, G., & Foster, J. (2014). DCU-UVT: Word-Level Language Classification with Code-Mixed Data. In 1st Workshop on Computational Approaches to Code Switching, Switching 2014 at the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014 - Proceedings (pp. 127–132). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-3915

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 23

64%

Lecturer / Post doc 6

17%

Researcher 6

17%

Professor / Associate Prof. 1

3%

Readers' Discipline

Tooltip

Computer Science 34

79%

Linguistics 6

14%

Business, Management and Accounting 2

5%

Decision Sciences 1

2%

Save time finding and organizing research with Mendeley

Sign up for free