• Stars
    star
    362
  • Rank 113,944 (Top 3 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 4 years ago
  • Updated over 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

๐Ÿ“˜ Contains a series of hands-on notebooks for learning the fundamentals of NLP

Fundamentals of NLP

(Work in Progress!)

Natural language processing (NLP) has made substantial advances in the past few years due to the success of modern techniques that are based on deep learning. With the rise of the popularity of NLP and the availability of different forms of large-scale data, it is now even more imperative to understand the inner workings of NLP techniques and concepts, from first principles, as they find their way into real-world usage and applications that affect society at large. Building intuitions and having a solid grasp of concepts are both important for coming up with innovative techniques, improving research, and building safe, human-centered AI and NLP technologies.

We introduce a new series called Fundamentals of NLP where we aim to teach about important NLP techniques and concepts starting from the first principles. We will introduce the theoretical aspect and motivation of each concept covered throughout the series. Then we will obtain hands-on experience by using bootstrap methods, industry-standard tools, and other open-source libraries to implement the different techniques. Along the way, we will also cover best practices, share important references, point out common mistakes to avoid when training and building NLP models, and discuss what lies ahead.

Join our Slack community to find our more about this and other ongoing projects. Feel free to reach out to me on Twitter for an invite to our Slack group.

Chapters

Chapter 1: Tokenization, Lemmatization, Stemming, and Sentence Segmentation -- Colab notebook, Web version

How to Contribute?

  • You can check out our Project page to see all the ongoing tasks or issues related to this research project. Lookout for the main nlp_fundamentals tag. Issues with the good first issue tag are good tasks to get started with.
  • You can also just check the issues tab.
  • You can ask anything related to this project in our Slack group.
  • Slack channel: #nlp_fundamentals

More Repositories

1

Prompt-Engineering-Guide

๐Ÿ™ Guides, papers, lecture, notebooks and resources for prompt engineering
MDX
42,643
star
2

ML-YouTube-Courses

๐Ÿ“บ Discover the latest machine learning / AI courses on YouTube.
14,210
star
3

ml-visuals

๐ŸŽจ ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
11,353
star
4

ML-Papers-of-the-Week

๐Ÿ”ฅHighlighting the top ML papers every week.
8,525
star
5

ML-Papers-Explained

Explanation to key concepts in ML
6,643
star
6

ML-Course-Notes

๐ŸŽ“ Sharing machine learning course / lecture notes.
5,857
star
7

Mathematics-for-ML

๐Ÿงฎ A collection of resources to learn mathematics for machine learning
4,127
star
8

ML-Notebooks

๐Ÿ”ฅ Machine Learning Notebooks
Jupyter Notebook
3,202
star
9

Transformers-Recipe

๐Ÿง  A study guide to learn about Transformers
1,488
star
10

nlp_paper_summaries

โœ๏ธ A carefully curated list of NLP paper summaries
1,468
star
11

GNNs-Recipe

๐ŸŸ  A study guide to learn about Graph Neural Networks (GNNs)
1,046
star
12

MLOPs-Primer

A collection of resources to learn about MLOPs.
923
star
13

AI-Product-Index

A curated index to track AI-powered products.
744
star
14

d2l-study-group

๐Ÿง  Material for the Deep Learning Study Group
387
star
15

nlp_newsletter

๐Ÿ“ฐNatural language processing (NLP) newsletter
300
star
16

awesome-ML-projects-guide

A guide to building awesome machine learning projects.
237
star
17

dair-ai.github.io

Home of DAIR.AI
HTML
189
star
18

emotion_dataset

๐Ÿ˜„ Dataset for Emotion Recognition Research
185
star
19

awesome-research-proposals-guide

A guide to improve your research proposals.
175
star
20

ml-nlp-paper-discussions

๐Ÿ“„ A repo containing notes and discussions for our weekly NLP/ML paper discussions.
151
star
21

keep-learning-ml

A club to keep learning about ML
89
star
22

notebooks

๐Ÿ”ฌ Sharing your data science notebooks with the community has never been this easy.
Jupyter Notebook
37
star
23

covid_19_search_application

Text Similarity Search Application using Modern NLP and Elasticsearch
Jupyter Notebook
30
star
24

odsc_2020_nlp

Repository for ODSC talk related to Deep Learning NLP
24
star
25

research_emotion_analysis

๐Ÿ˜„ Multilingual emotion analysis research
Python
18
star
26

data_science_writing_primer

Writing Primer for Data Scientists
Jupyter Notebook
17
star
27

maven-pe-for-llms-4

Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects
Jupyter Notebook
16
star
28

arxiv_analysis

A project to help explore research papers and fuel new discovery
Jupyter Notebook
16
star
29

pe-for-llms

Jupyter Notebook
14
star
30

llm-evaluator

Example for Logging LLM Evaluator Prompt Responses
Jupyter Notebook
14
star
31

paper_implementations

A project for implementing ML and NLP papers
14
star
32

maven-pe-for-llms

Jupyter Notebook
12
star
33

nlp-roadmap

A comprehensive roadmap to get informed of the NLP landscape.
9
star
34

ml-discussions

Discussing ML research, engineering, papers, resources, learning paths, best practices, and much more.
8
star
35

maven-pe-for-llms-6

Materials for the Prompt Engineering for LLMs (Cohort 6)
Jupyter Notebook
7
star
36

paper_presentations

All paper presentation material will be added here
6
star
37

maven-pe-for-llms-7

Code, Demos, and Exercises for Prompt Engineering for LLMs Course
Jupyter Notebook
5
star
38

nlp_research_highlights

Contains all issues of the NLP Research Highlights series
5
star
39

maven-pe-for-llms-8

Materials for the Prompt Engineering for LLMs (Cohort 8)
Jupyter Notebook
5
star
40

deep_affective_layer

๐Ÿ˜„ Building a deep learning based affective computing platform
4
star
41

maven-pe-for-llms-2

Jupyter Notebook
3
star
42

.github

2
star
43

meetups

Material for dair.ai meetups
2
star
44

tensorflow_notebooks

A repository containing Deep Learning and Machine Learning related TensorFlow notebooks.
1
star