There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
Configurable Text normaliser and tokeniser for Arabic texts. It normalises diacritics, Hamaza, digits, etc. It tokenises punctuation, digits, etc from text. It enforces text standard in several aspects: diacritics, letters, and tokenisation.