The Large Annotated Corpus for the Arabic Language (LACAL)

TitreThe Large Annotated Corpus for the Arabic Language (LACAL)
Publication TypeJournal Article
Year of Publication2022
AuthorsYousfi, A, Boumehdi, A, Laaroussi, S, Makoudi, R, Aouragh, SL, Gueddah, H, Habibi, B, Nejja, M, Said, I
JournalStudies in Computational Intelligence
Volume1061
Pagination205-219
Abstract

Annotated corpora has an important role in the NLP field. They are used in almost all NLP applications: automatic dictionary construction, text analysis, information retrieval, machine translation, etc. Annotated corpora are the basis for training operation in NLP systems. Without these corpora, it is difficult to build an efficient system that takes into account all variations and linguistic phenomena. In this paper, we present the annotated corpus we developed. This corpus contains more than 12 million different words labeled by different types of labels: syntactic, morphological, and semantic. This large corpus adds value to the Arabic NLP field, and will certainly improve the quality of the training phase of Arabic NLP systems. Moreover it can be a suitable corpus to test and evaluate the quality of these systems. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85139393388&doi=10.1007%2f978-3-031-14748-7_12&partnerID=40&md5=5e4ea0e7df29b510ad4dae099d9f1725
DOI10.1007/978-3-031-14748-7_12
Revues: 

Partenaires

Localisation

Suivez-nous sur

         

    

Contactez-nous

ENSIAS

Avenue Mohammed Ben Abdallah Regragui, Madinat Al Irfane, BP 713, Agdal Rabat, Maroc

  Télécopie : (+212) 5 37 68 60 78

  Secrétariat de direction : 06 61 48 10 97

        Secrétariat général : 06 61 34 09 27

        Service des affaires financières : 06 61 44 76 79

        Service des affaires estudiantines : 06 62 77 10 17 / n.mhirich@um5s.net.ma

        Résidences : 06 61 82 89 77

Contacts

    

Education - This is a contributing Drupal Theme
Design by WeebPal.