There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
This is pipeline module for cerating bengali word clusters. Word clusters can be a very useful feature when it comes to Natular Language processing.It can be used as feature for several NLP tasks, for example, document classification, named entity recognition etc. It use word2vec to generate embedding models, Rtsne algorithm to reduce the dimentions, K-means to create and visualize clusters. This project uses Python to generate word2vec model, R to create and visualize word clusters and JAVA for preprocessing and labeling.