There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
This project was developed to identify the language of text using bigrams extracted from large corpora as reference. A crawler downloaded the data from Wikipedia. The system uses 43 languages to extract the bigrams.