camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.CAMeLBERT
Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.Arabic_ALA-LC_Romanization
Romanizing Arabic bibliographic records in the ALA-LC standard.WIDH_2020_Arabic_Text_Analysis
Material for the Text Analysis of Arabic course taught at the NYU Abu Dhabi Winter Institute in Digital Humanities 2020.samer-add-on
arabic_error_type_annotation
The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.palmyra
Gumar-Ngrams
The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.arabic-gec
Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.camel_parser
camel_morph
Camel Morphโs goal is to build large open-source morphological models for Arabic and its dialects across many genres and domains.deSeg
Unsupervised, De-lexical, Linguistic Segmentationgender-reinflection
Code, models, and data for "Gender-Aware Reinflection using Linguistically Enhanced Neural Models". COLING 2020, GeBNLP.ced_word_alignment
A character edit distance based word aligner.camel-tools-data
Repo containing data packages and catalogues used by CAMeL Tools.arafix_ocr
A tool for improving the output of generic Arabic OCR systems using an n-gram based post-correction approach.TOIA-2.0
muddler
The Muddler derived-file sharing utility.gender-rewriting
Code, models, and data for "User-Centric Gender Rewriting". NAACL 2022.seq2seq-transliteration-tool
maknuune_lexicon
CAMeLBERT_morphosyntactic_tagger
Code, models, and data for "Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects". Findings of ACL, 2022.camel-guidelines
HierarchicalArabicDialectID
Arabic-ATB-closed-class-list
A Modern Standard Arabic Closed-Class Word Listqalb
Code for "Utilizing Character and Word Embeddings for Text Normalization with Sequence-to-Sequence Models"gender-rewriting-shared-task
Evaluation code and data for the gender rewriting shared taskLove Open Source and this site? Check out how you can help us