An open-source NLP research library, built on PyTorch.
TensorFlow code and pre-trained models for BERT
Fixes contractions such as `you're` to `you are`
Fuzzy String Matching in Python
Topic Modelling for Humans
π LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
NLTK Source
Multilingual text (NLP) processing toolkit
State-of-the-Art Text Embeddings
π« Industrial-strength Natural Language Processing (NLP) in Python
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Lightning Fast Language Prediction π
A tool for extracting plain text from Wikipedia dumps
A little word cloud generator in Python