dair-ai/nlp_fundamentals

Stars
364
Rank 117,101 (Top 3 %)
Language
Jupyter Notebook
License
MIT License
Created almost 5 years ago
Updated about 4 years ago

dair-ai/nlp_fundamentals

dair-ai

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

📘 Contains a series of hands-on notebooks for learning the fundamentals of NLP

Fundamentals of NLP

(Work in Progress!)

Natural language processing (NLP) has made substantial advances in the past few years due to the success of modern techniques that are based on deep learning. With the rise of the popularity of NLP and the availability of different forms of large-scale data, it is now even more imperative to understand the inner workings of NLP techniques and concepts, from first principles, as they find their way into real-world usage and applications that affect society at large. Building intuitions and having a solid grasp of concepts are both important for coming up with innovative techniques, improving research, and building safe, human-centered AI and NLP technologies.

We introduce a new series called Fundamentals of NLP where we aim to teach about important NLP techniques and concepts starting from the first principles. We will introduce the theoretical aspect and motivation of each concept covered throughout the series. Then we will obtain hands-on experience by using bootstrap methods, industry-standard tools, and other open-source libraries to implement the different techniques. Along the way, we will also cover best practices, share important references, point out common mistakes to avoid when training and building NLP models, and discuss what lies ahead.

Join our Slack community to find our more about this and other ongoing projects. Feel free to reach out to me on Twitter for an invite to our Slack group.

Chapters

Chapter 1: Tokenization, Lemmatization, Stemming, and Sentence Segmentation -- Colab notebook, Web version

How to Contribute?

You can check out our Project page to see all the ongoing tasks or issues related to this research project. Lookout for the main nlp_fundamentals tag. Issues with the good first issue tag are good tasks to get started with.
You can also just check the issues tab.
You can ask anything related to this project in our Slack group.
Slack channel: #nlp_fundamentals

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

ML-Papers-Explained

Explanation to key concepts in ML

ML-Course-Notes

🎓 Sharing machine learning course / lecture notes.

Mathematics-for-ML

🧮 A collection of resources to learn mathematics for machine learning

ML-Notebooks

🔥 Machine Learning Notebooks

Jupyter Notebook

Transformers-Recipe

🧠 A study guide to learn about Transformers

nlp_paper_summaries

✍️ A carefully curated list of NLP paper summaries

GNNs-Recipe

🟠 A study guide to learn about Graph Neural Networks (GNNs)

MLOPs-Primer

A collection of resources to learn about MLOPs.

AI-Product-Index

A curated index to track AI-powered products.

d2l-study-group

🧠 Material for the Deep Learning Study Group

nlp_newsletter

📰Natural language processing (NLP) newsletter

awesome-ML-projects-guide

A guide to building awesome machine learning projects.

dair-ai.github.io

Home of DAIR.AI

emotion_dataset

😄 Dataset for Emotion Recognition Research

awesome-research-proposals-guide

A guide to improve your research proposals.

ml-nlp-paper-discussions

📄 A repo containing notes and discussions for our weekly NLP/ML paper discussions.

keep-learning-ml

A club to keep learning about ML

notebooks

🔬 Sharing your data science notebooks with the community has never been this easy.

Jupyter Notebook

covid_19_search_application

Text Similarity Search Application using Modern NLP and Elasticsearch

Jupyter Notebook

odsc_2020_nlp

Repository for ODSC talk related to Deep Learning NLP

research_emotion_analysis

😄 Multilingual emotion analysis research

maven-pe-for-llms-4

Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects

Jupyter Notebook

data_science_writing_primer

Writing Primer for Data Scientists

Jupyter Notebook

arxiv_analysis

A project to help explore research papers and fuel new discovery

Jupyter Notebook

pe-for-llms

Jupyter Notebook

llm-evaluator

Example for Logging LLM Evaluator Prompt Responses

Jupyter Notebook

paper_implementations

A project for implementing ML and NLP papers

maven-pe-for-llms

Jupyter Notebook

nlp-roadmap

A comprehensive roadmap to get informed of the NLP landscape.

ml-discussions

Discussing ML research, engineering, papers, resources, learning paths, best practices, and much more.

maven-pe-for-llms-6

Materials for the Prompt Engineering for LLMs (Cohort 6)

Jupyter Notebook

maven-pe-for-llms-8

Materials for the Prompt Engineering for LLMs (Cohort 8)

Jupyter Notebook

maven-pe-for-llms-7

Code, Demos, and Exercises for Prompt Engineering for LLMs Course

Jupyter Notebook

maven-pe-for-llms-12

Course material for Prompt Engineering for LLMs

Jupyter Notebook

maven-pe-for-llms-9

Materials for Prompt Engineering for LLMs (Cohort 9)

Jupyter Notebook

paper_presentations

All paper presentation material will be added here

nlp_research_highlights

Contains all issues of the NLP Research Highlights series

deep_affective_layer

😄 Building a deep learning based affective computing platform

maven-pe-for-llms-2

Jupyter Notebook

datasets

maven-pe-for-llms-11

Materials for the Prompt Engineering for LLMs Course (Cohort 11)

Jupyter Notebook

.github

meetups

Material for dair.ai meetups

tensorflow_notebooks

A repository containing Deep Learning and Machine Learning related TensorFlow notebooks.

maven-pe-for-llms-10

Materials for Cohort 10

Jupyter Notebook