There are no reviews yet. Be the first to send feedback to the community and the maintainers!
nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.ROUGE-2.0
ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.phrase-at-scale
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than Englishopinosis-summarization
This repo contains code and dataset for the Opinosis Summarization FrameworkOpinRank
OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)clinical-concepts
Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to mine related concepts by leveraging the volume within large amounts of clinical notes.spark-examples
Examples of code in sparkstop-words
Stop word listshashtags_test
Test hashtagsJavaPractice
Practice practice practice. Bubble sort, factorial, powerset, subarray, mergesort, remove duplicates, etc.Love Open Source and this site? Check out how you can help us