• Stars
    star
    364
  • Rank 117,101 (Top 3 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created almost 5 years ago
  • Updated about 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

๐Ÿ“˜ Contains a series of hands-on notebooks for learning the fundamentals of NLP

Fundamentals of NLP

(Work in Progress!)

Natural language processing (NLP) has made substantial advances in the past few years due to the success of modern techniques that are based on deep learning. With the rise of the popularity of NLP and the availability of different forms of large-scale data, it is now even more imperative to understand the inner workings of NLP techniques and concepts, from first principles, as they find their way into real-world usage and applications that affect society at large. Building intuitions and having a solid grasp of concepts are both important for coming up with innovative techniques, improving research, and building safe, human-centered AI and NLP technologies.

We introduce a new series called Fundamentals of NLP where we aim to teach about important NLP techniques and concepts starting from the first principles. We will introduce the theoretical aspect and motivation of each concept covered throughout the series. Then we will obtain hands-on experience by using bootstrap methods, industry-standard tools, and other open-source libraries to implement the different techniques. Along the way, we will also cover best practices, share important references, point out common mistakes to avoid when training and building NLP models, and discuss what lies ahead.

Join our Slack community to find our more about this and other ongoing projects. Feel free to reach out to me on Twitter for an invite to our Slack group.

Chapters

Chapter 1: Tokenization, Lemmatization, Stemming, and Sentence Segmentation -- Colab notebook, Web version

How to Contribute?

  • You can check out our Project page to see all the ongoing tasks or issues related to this research project. Lookout for the main nlp_fundamentals tag. Issues with the good first issue tag are good tasks to get started with.
  • You can also just check the issues tab.
  • You can ask anything related to this project in our Slack group.
  • Slack channel: #nlp_fundamentals

More Repositories

1

Prompt-Engineering-Guide

๐Ÿ™ Guides, papers, lecture, notebooks and resources for prompt engineering
MDX
47,520
star
2

ML-YouTube-Courses

๐Ÿ“บ Discover the latest machine learning / AI courses on YouTube.
14,690
star
3

ml-visuals

๐ŸŽจ ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
13,103
star
4

ML-Papers-of-the-Week

๐Ÿ”ฅHighlighting the top ML papers every week.
9,856
star
5

ML-Papers-Explained

Explanation to key concepts in ML
7,016
star
6

ML-Course-Notes

๐ŸŽ“ Sharing machine learning course / lecture notes.
5,980
star
7

Mathematics-for-ML

๐Ÿงฎ A collection of resources to learn mathematics for machine learning
4,399
star
8

ML-Notebooks

๐Ÿ”ฅ Machine Learning Notebooks
Jupyter Notebook
3,270
star
9

Transformers-Recipe

๐Ÿง  A study guide to learn about Transformers
1,521
star
10

nlp_paper_summaries

โœ๏ธ A carefully curated list of NLP paper summaries
1,476
star
11

GNNs-Recipe

๐ŸŸ  A study guide to learn about Graph Neural Networks (GNNs)
1,095
star
12

MLOPs-Primer

A collection of resources to learn about MLOPs.
925
star
13

AI-Product-Index

A curated index to track AI-powered products.
755
star
14

d2l-study-group

๐Ÿง  Material for the Deep Learning Study Group
388
star
15

nlp_newsletter

๐Ÿ“ฐNatural language processing (NLP) newsletter
300
star
16

awesome-ML-projects-guide

A guide to building awesome machine learning projects.
242
star
17

dair-ai.github.io

Home of DAIR.AI
HTML
208
star
18

emotion_dataset

๐Ÿ˜„ Dataset for Emotion Recognition Research
197
star
19

awesome-research-proposals-guide

A guide to improve your research proposals.
185
star
20

ml-nlp-paper-discussions

๐Ÿ“„ A repo containing notes and discussions for our weekly NLP/ML paper discussions.
149
star
21

keep-learning-ml

A club to keep learning about ML
89
star
22

notebooks

๐Ÿ”ฌ Sharing your data science notebooks with the community has never been this easy.
Jupyter Notebook
37
star
23

covid_19_search_application

Text Similarity Search Application using Modern NLP and Elasticsearch
Jupyter Notebook
29
star
24

odsc_2020_nlp

Repository for ODSC talk related to Deep Learning NLP
23
star
25

research_emotion_analysis

๐Ÿ˜„ Multilingual emotion analysis research
Python
19
star
26

maven-pe-for-llms-4

Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects
Jupyter Notebook
17
star
27

data_science_writing_primer

Writing Primer for Data Scientists
Jupyter Notebook
17
star
28

arxiv_analysis

A project to help explore research papers and fuel new discovery
Jupyter Notebook
16
star
29

pe-for-llms

Jupyter Notebook
14
star
30

llm-evaluator

Example for Logging LLM Evaluator Prompt Responses
Jupyter Notebook
14
star
31

paper_implementations

A project for implementing ML and NLP papers
13
star
32

maven-pe-for-llms

Jupyter Notebook
12
star
33

nlp-roadmap

A comprehensive roadmap to get informed of the NLP landscape.
9
star
34

ml-discussions

Discussing ML research, engineering, papers, resources, learning paths, best practices, and much more.
8
star
35

maven-pe-for-llms-6

Materials for the Prompt Engineering for LLMs (Cohort 6)
Jupyter Notebook
8
star
36

maven-pe-for-llms-8

Materials for the Prompt Engineering for LLMs (Cohort 8)
Jupyter Notebook
8
star
37

maven-pe-for-llms-7

Code, Demos, and Exercises for Prompt Engineering for LLMs Course
Jupyter Notebook
6
star
38

maven-pe-for-llms-12

Course material for Prompt Engineering for LLMs
Jupyter Notebook
6
star
39

maven-pe-for-llms-9

Materials for Prompt Engineering for LLMs (Cohort 9)
Jupyter Notebook
6
star
40

paper_presentations

All paper presentation material will be added here
5
star
41

nlp_research_highlights

Contains all issues of the NLP Research Highlights series
5
star
42

deep_affective_layer

๐Ÿ˜„ Building a deep learning based affective computing platform
3
star
43

maven-pe-for-llms-2

Jupyter Notebook
3
star
44

datasets

AI Datasets
3
star
45

maven-pe-for-llms-11

Materials for the Prompt Engineering for LLMs Course (Cohort 11)
Jupyter Notebook
3
star
46

.github

2
star
47

meetups

Material for dair.ai meetups
2
star
48

tensorflow_notebooks

A repository containing Deep Learning and Machine Learning related TensorFlow notebooks.
1
star
49

maven-pe-for-llms-10

Materials for Cohort 10
Jupyter Notebook
1
star