• Stars
    star
    3
  • Rank 3,963,521 (Top 79 %)
  • Language
    Python
  • License
    GNU General Publi...
  • Created about 7 years ago
  • Updated about 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

The aim of this project is to process tashkeela corpus http://sourceforge.net/projects/tashkeela/ to clean it and to create a dictionary of Arabic words with diacritics

More Repositories

1

process-arabic-text

Pre-process arabic text (remove diacritics, punctuations and repeating characters)
Python
94
star
2

arabic-sentiment-analysis

Sentiment Analysis in Arabic tweets
Jupyter Notebook
71
star
3

comparable-text-miner

Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary translation, documents alignment, corpus information, text classification, tf-idf computation, text similarity computation, html documents cleaning
Python
33
star
4

tweets-collector

Collect tweets (tweets corpus) using Twitter API. Collection can be based on hashtags, keywords, geographical location
Python
26
star
5

ai-csci4304

AI Course
Jupyter Notebook
19
star
6

arabic-light-stemmer

Arabic light stemmer. Light stemming for Arabic words removes prefixes and suffixes and normalizes words
Java
17
star
7

khoja-stemmer-command-line

A command line version of Koja Stemmer (An Arabic rooting algorithm)
Java
17
star
8

ara-pronunciation-tool

A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based on https://github.com/nawarhalabi/Arabic-Phonetiser
Python
14
star
9

arabic-light-stemming-py

Arabic Light stemming with Python
Python
13
star
10

Arabic-News

Arabic News
Jupyter Notebook
12
star
11

arabic-spell-check-py

Spell check for Arabic text using python
Python
12
star
12

NLP-ICTS6361

NLP Course (ICTS6361)
HTML
11
star
13

emotion-lexicon

Arabic - English emotion lexicon
10
star
14

NLP-ICTS6361-2020

NLP Course (ICTS6361) - 2020
Jupyter Notebook
8
star
15

arwikiExtracts

Arabic Wikipedia Extracts
Python
7
star
16

Quran-QA

Quran QA
Jupyter Notebook
7
star
17

work-online-ds

Work online project - data science training
Jupyter Notebook
7
star
18

arabic-hatespeech-data

Arabic hate speech data
7
star
19

WikiDocsAligner

Align Wikipedia articles based on interlang links
Python
7
star
20

Deep-Learning-ICTS6361-2023

Deep Learning ICTS6361 2023
6
star
21

fit-bot-android

A chat bot for FIT (Android App)
Java
5
star
22

hate-speech

hate speech project (master)
Jupyter Notebook
5
star
23

anamil-natiqa

Memory match game to learn Arabic sign writing
ASP.NET
5
star
24

machine-translation-master-course

machine translation course
HTML
5
star
25

Arabic-Fakenews

Fake-news
Jupyter Notebook
5
star
26

arabic-dialects-id

Arabic dialects identification system
4
star
27

think-python-workshop

think python workshop
Jupyter Notebook
4
star
28

QAR

QAR: Quick Attendance registration system using NFC technology
Java
4
star
29

bbc-crawler

crawl news documents from BBC Arabic
Python
4
star
30

ShellCMDs4NLP

Useful shell commands for NLP people
Shell
4
star
31

smart-glass

Smart Glass
Python
3
star
32

wdmm1402-2019

Python Programming I
Python
3
star
33

Data-Science-CSCI-3320

Data Science CSCI 3320 BSc course
Jupyter Notebook
3
star
34

exam-db-old

Exam database CMS system
HTML
3
star
35

infant-cry-care

infant cry care
Jupyter Notebook
3
star
36

ar-wiki-topics

Arabic Wikipedia topics model using gensim library
Python
3
star
37

comparableWikiCoprus

Comparable Wikipedia Corpus (aligned documents)
3
star
38

NLP-ICTS6361-2023

NLP-ICTS6361-2023
Jupyter Notebook
3
star
39

Arabic-Proagenda-Detection

Arabic Proagenda Detection
Jupyter Notebook
3
star
40

arb-ipa-dict-man-tool

IPA Arabic phonetic dictionary
Java
3
star
41

egy-arb-dialect-id

Egyptian / Modern Standard Arabic language identification system
Python
3
star
42

emwre-env-prog

3
star
43

WDMM-1402

Multimedia Programming I - WDMM 1402 (Python Programming I)
Python
2
star
44

Math-Statistics-DataScience

Math and Statistics for Data Science
Jupyter Notebook
2
star
45

split-waw-arabic

Separate the conjunction Waw from Arabic words
Python
2
star
46

Users-stories-named-entities

Jupyter Notebook
2
star
47

motazsaad

Config files for my GitHub profile.
2
star
48

410700

Python Programming for Analytics
Jupyter Notebook
2
star
49

e-commerce-course

eCommerce course
2
star
50

examdb

Exam database CMS system
HTML
2
star
51

sphinx-eval

CMU sphinx evaluation scripts
Shell
2
star
52

wac-ds

Jupyter Notebook
2
star
53

Arabic-Stories-Corpus

Arabic Stories Corpus
HTML
2
star
54

login-ds

Jupyter Notebook
2
star
55

names-normalizer-arb

Normalize Names (Arabic and Foreign) in Arabic
Python
2
star
56

WDMM1405

Multimedia Programming II - WDMM1405 (Python Programming II)
HTML
2
star
57

WDMM1405-Flask

WDMM1405 (Python Programming II), Flask Examples source code
JavaScript
2
star
58

Gen-AI-for-Researchers

Gen AI for Researchers
Jupyter Notebook
2
star
59

moodle_testbank_xml2xlsx

convert moodle TF, MCQ questions from Moodle XML format into XLSX Excel
PHP
2
star
60

Arabic-NLP-tools

Arabic NLP tools
1
star
61

grad-sys-ios

Graduation Tracking System iOS
Swift
1
star
62

corpus2json

convert text corpus (directories and files) into a json file.
Python
1
star
63

osac-corpus

OSAC corpus
1
star
64

grad-sys-web

Graduation Project Tracking system
HTML
1
star
65

mobi-omr

Optical Mark recognition on mobile devices
1
star
66

assembly

Assembly code examples
Assembly
1
star
67

language-modeling

language modeling
Shell
1
star
68

Natural-Language-to-Python

Natural Language to Python code Translation
Jupyter Notebook
1
star
69

QuranApp

QuranApp
HTML
1
star
70

linux-commands-tutorial

Linux commands tutorial
1
star
71

jsc-crawler

crawl news documents from JSC
Python
1
star
72

campain-title-gen

campain's title generator
Shell
1
star
73

graduation-research

graduation research (CSCI4108) lecture notes
1
star
74

audio-processor

process audio files (convert to wav format, split wave, ....)
Python
1
star
75

Flask_TF

Flask & Tensorflow
Jupyter Notebook
1
star
76

jsc-news-broadcast

JSC news broadcast (speech corpus)
Python
1
star
77

ibridge-ds

iBridge project data science training
Jupyter Notebook
1
star
78

human-systems-3d-models

Human anatomy systems 3D models
JavaScript
1
star
79

Arabic-Propaganda-Detection-Research

Arabic Propaganda Detection Research
Jupyter Notebook
1
star
80

grad-sys-android

Graduation Tracking System Android
Java
1
star
81

arzWikiExtracts

Egyptian Wikipedia Extracts
1
star