There are no reviews yet. Be the first to send feedback to the community and the maintainers!
nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.ROUGE-2.0
ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.phrase-at-scale
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than Englishopinosis-summarization
This repo contains code and dataset for the Opinosis Summarization FrameworkOpinRank
OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)spark-examples
Examples of code in sparkstop-words
Stop word listshashtags_test
Test hashtagsMicropinion-Generation-Dataset
Dataset for Micropinion Generation. Dataset is based on user reviews from CNET. The reviews are on products from various categories like tv, cell phones, gps etc.JavaPractice
Practice practice practice. Bubble sort, factorial, powerset, subarray, mergesort, remove duplicates, etc.Love Open Source and this site? Check out how you can help us