New under-sampling methods to address the problem of unbalanced sentiment classification: Application on Arabic datasets

TitreNew under-sampling methods to address the problem of unbalanced sentiment classification: Application on Arabic datasets
Publication TypeJournal Article
Year of Publication2016
AuthorsMountassir, A, Benbrahim, H, Berrada, I
JournalInternational Journal of Information and Communication Technology
Volume9
Pagination64-77
Abstract

This paper presents the study we have carried out to address the problem of unbalanced datasets in supervised sentiment classification in an Arabic context. We propose three different methods to under-sample the majority class documents. Our goal is to compare the effectiveness of the proposed methods with the common random under-sampling. We also aim to evaluate the behaviour of the classifier toward different under-sampling rates. We use three different common classifiers, namely Naïve Bayes, support vector machines and k-nearest neighbours. The experiments are carried out on two different Arabic datasets that we have built internally. We show that results obtained on the first dataset, which is slightly skewed, are better than those obtained on the second one which is highly skewed. We conclude also that Naïve Bayes is sensitive to dataset size, the more we reduce the data the more the results degrade. However, support vector machines are highly sensitive to unbalanced datasets. We record an instable behaviour of k-nearest neighbour. The results show also that we can rely on the proposed techniques and that they are typically competitive with random under-sampling. © 2016 Inderscience Enterprises Ltd.

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84979641121&doi=10.1504%2fIJICT.2016.077687&partnerID=40&md5=3261431f79a2a1c6ae94fa16e8ad8b01
DOI10.1504/IJICT.2016.077687
Revues: 

Partenaires

Localisation


Location map

Suivez-nous sur

  

Contactez-nous

ENSIAS

Avenue Mohammed Ben Abdallah Regragui, Madinat Al Irfane, BP 713, Agdal Rabat, Maroc

Résultat de recherche d'images pour "icone fax" Télécopie : (+212) 5 37 77 72 30

    Compteur de visiteurs:312,749
    Education - This is a contributing Drupal Theme
    Design by WeebPal.