• Stars
    star
    133
  • Rank 272,530 (Top 6 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 5 years ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Natural Language Processing For Everyone

GitHub Twitter @data4sci GitHub top language GitHub repo size GitHub last commit

Graphs For Science Sunday Briefing

Binder

Natural Language Processing For Everyone

Code and slides to accompany the online series of webinars: https://data4sci.com/nlp by Data For Science.

Run the code in Binder: Binder

The rise of online social platforms has resulted in an explosion of written text in the form of blogs, posts, tweets, wiki pages, and more. This new wealth of data provides a unique opportunity to explore natural language in its many forms, both as a way of automatically extracting information from written text and as a way of artificially producing text that looks natural.

In this class we introduce viewers to natural language processing from scratch. Each concept is introduced and explained through coding examples using nothing more than just plain Python and numpy. In this way, attendees learn in depth about the underlying concepts and techniques instead of just learning how to use a specific NLP library.

Schedule

1. Text Representation

  • Represent words and numbers
  • Use One-Hot Encoding
  • Implement Bag of Words
  • Apply stopwords
  • Understand TF/IDF
  • Understand Stemming

2. Topic Modeling

  • Find topics in documents
  • Perform Explicit Semantic Analysis
  • Understand Document clustering
  • Implement Latent Semantic Analysis
  • Implement Non-negative Matrix factorization

3. Sentiment Analysis

  • Quantify words and feelings
  • Use Negations and modifiers
  • Understand corpus based approaches

4. Applications

  • Understand Word2vec word embeddings
  • Define GloVe
  • Apply Language detection

Slides: http://data4sci.com/landing/nlp/

More Repositories

1

Causality

Jupyter Notebook
463
star
2

Epidemiology101

Epidemic Modeling for Everyone
Jupyter Notebook
282
star
3

Timeseries

Timeseries for everyone
Jupyter Notebook
245
star
4

DeepLearning

Deep Learning From Scratch
Jupyter Notebook
136
star
5

Probability

Applied Probability Theory for Everyone
Jupyter Notebook
114
star
6

Networks

Network and Graph Algorithms From Scratch
Jupyter Notebook
112
star
7

DataViz

Data Visualization With Matplotlib and Seaborn
Jupyter Notebook
101
star
8

CausalInference

Jupyter Notebook
93
star
9

Excel

Transforming Excel Analysis into Python and pandas Data Models
Jupyter Notebook
62
star
10

AdvancedTimeseries

Jupyter Notebook
48
star
11

Graphs4Sci

Jupyter Notebook
44
star
12

AdvancedNLP

Jupyter Notebook
38
star
13

RNN

Recurrent Neural Networks for Timeseries
Jupyter Notebook
23
star
14

InteractiveViz

Jupyter Notebook
22
star
15

Timeseries_LL

Time Series Modeling: ML and Deep Learning Approaches with Python
Jupyter Notebook
22
star
16

OpenAI

Jupyter Notebook
19
star
17

PyData2020

Why And What If: Causal Inference for Everyone
Jupyter Notebook
18
star
18

ChatGPT

Jupyter Notebook
14
star
19

LangChain

Jupyter Notebook
14
star
20

blockchain-data

Blockchain Technology For Data Scientists
Jupyter Notebook
11
star
21

epidemik

Compartmental Epidemic Models in Python
Python
11
star
22

Seaborn

Data Visualization With Seaborn
Jupyter Notebook
10
star
23

DataMining

Jupyter Notebook
9
star
24

Probability-And-Statistics

Jupyter Notebook
8
star
25

PyData2021

Jupyter Notebook
8
star
26

G4DS

Jupyter Notebook
8
star
27

LLM4DS

Jupyter Notebook
7
star
28

NLP_LL

Jupyter Notebook
5
star
29

Viz4Sci

Jupyter Notebook
4
star
30

CoVID19

A Data Driven Approach to Understanding COVID-19 with NetworkX
Jupyter Notebook
3
star
31

PyGotham2021

Jupyter Notebook
2
star