Domain Adaptation for Arabic Cross-Domain and Cross-Dialect Sentiment Analysis from Contextualized Word Embedding

26Citations
Citations of this article
76Readers
Mendeley users who have this article in their library.

Abstract

Finetuning deep pre-trained language models has shown state-of-the-art performances on a wide range of Natural Language Processing (NLP) applications. Nevertheless, their generalization performance drops under domain shift. In the case of Arabic language, diglossia makes building and annotating corpora for each dialect and/or domain a more challenging task. Unsupervised Domain Adaptation tackles this issue by transferring the learned knowledge from labeled source domain data to unlabeled target domain data. In this paper, we propose a new unsupervised domain adaptation method for Arabic cross-domain and cross-dialect sentiment analysis from Contextualized Word Embedding. Several experiments are performed adopting the coarse-grained and the fine-grained taxonomies of Arabic dialects. The obtained results show that our method yields very promising results and outperforms several domain adaptation methods for most of the evaluated datasets. On average, our method increases the performance by an improvement rate of 20.8% over the zero-shot transfer learning from BERT.

References Powered by Scopus

Deep CORAL: Correlation alignment for deep domain adaptation

2325Citations
N/AReaders
Get full text

ASTD: Arabic sentiment tweets dataset

325Citations
N/AReaders
Get full text

Arabic Dialect Identification

202Citations
N/AReaders
Get full text

Cited by Powered by Scopus

DIP: Dual Incongruity Perceiving Network for Sarcasm Detection

38Citations
N/AReaders
Get full text

Aspect-based sentiment analysis: an overview in the use of Arabic language

27Citations
N/AReaders
Get full text

Augmented language model with deep learning adaptation on sentiment analysis for E-learning recommendation

26Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

El Mekki, A., El Mahdaouy, A., Berrada, I., & Khoumsi, A. (2021). Domain Adaptation for Arabic Cross-Domain and Cross-Dialect Sentiment Analysis from Contextualized Word Embedding. In NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 2824–2837). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.naacl-main.226

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 20

71%

Lecturer / Post doc 3

11%

Researcher 3

11%

Professor / Associate Prof. 2

7%

Readers' Discipline

Tooltip

Computer Science 24

80%

Linguistics 4

13%

Neuroscience 1

3%

Social Sciences 1

3%

Save time finding and organizing research with Mendeley

Sign up for free