• Stars
    star
    775
  • Rank 58,632 (Top 2 %)
  • Language
    Jupyter Notebook
  • Created almost 5 years ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Set of real world data science tasks completed using the Python Pandas library

Pandas-Data-Science-Tasks

Set of real world data science tasks completed using the Python Pandas library.

Setup

To access all of the files I recommend you fork this repo and then clone it locally. Instructions on how to do this can be found here: https://help.github.com/en/github/getting-started-with-github/fork-a-repo

The other option is to click the green "clone or download" button and then click "Download ZIP". You then should extract all of the files to the location you want to edit your code.

Installing Jupyter Notebook: https://jupyter.readthedocs.io/en/latest/install.html
Installing Pandas library: https://pandas.pydata.org/pandas-docs/stable/install.html

Background Information:

This repo goes with my video on "Solving real world data science videos with Python Pandas!". Here is some information on that video.

In this video we use Python Pandas & Python Matplotlib to analyze and answer business questions about 12 months worth of sales data. The data contains hundreds of thousands of electronics store purchases broken down by month, product type, cost, purchase address, etc.

We start by cleaning our data. Tasks during this section include:

  • Drop NaN values from DataFrame
  • Removing rows based on a condition
  • Change the type of columns (to_numeric, to_datetime, astype)

Once we have cleaned up our data a bit, we move the data exploration section. In this section we explore 5 high level business questions related to our data:

  • What was the best month for sales? How much was earned that month?
  • What city sold the most product?
  • What time should we display advertisemens to maximize the likelihood of customerโ€™s buying product?
  • What products are most often sold together?
  • What product sold the most? Why do you think it sold the most?

To answer these questions we walk through many different pandas & matplotlib methods. They include:

  • Concatenating multiple csvs together to create a new DataFrame (pd.concat)
  • Adding columns
  • Parsing cells as strings to make new columns (.str)
  • Using the .apply() method
  • Using groupby to perform aggregate analysis
  • Plotting bar charts and lines graphs to visualize our results
  • Labeling our graphs

Check out the first video I did on Pandas:
https://youtu.be/vmEHCJofslg

Check out the videos I did on Matplotlib:
https://youtu.be/DAQNHzOcO5A
https://youtu.be/0P7QnIQDBJY

More Repositories

1

pandas

Data & Code for my video on the Pandas library of Python
Jupyter Notebook
902
star
2

NumPy

Jupyter Notebook & Data Associated with my Tutorial video on the Python NumPy Library
Jupyter Notebook
644
star
3

matplotlib_tutorial

Source code to go along with my tutorial to learn data visualization with the matplotlib library of Python
Jupyter Notebook
280
star
4

GUI

Source Code to go along with my video on how to program a gui in python using Tkinter
Python
214
star
5

sklearn

Data & Code associated with my tutorial on the sci-kit learn machine learning library in python
Jupyter Notebook
203
star
6

Connect4-Python

Connect 4 programmed in python using pygame
Python
183
star
7

generate-analytics-report

Generating Covid-19 Analytics Report PDFS with Python
Python
165
star
8

pycon2020

Natural Language Processing (NLP) in Python tutorial given for PyCon 2020 remote conference.
Jupyter Notebook
110
star
9

disney-data-science-tasks

Creation of a Disney Movie Dataset & Analysis using Python
Jupyter Notebook
87
star
10

web-scraping

Beautiful Soup web scraping tutorial
Jupyter Notebook
72
star
11

Basic-Python-Game

Source code that goes along with my video "How to program your first game! (in python)"
Python
65
star
12

neural-nets

Source code to go with my introductory video on neural nets in python
Jupyter Notebook
60
star
13

Alexa-Python

Python Files to be run from aws lambda to interact with Amazon Alexa :)
Python
38
star
14

Data-Science-Project-Ideas

Some code to go along with my video on data science project ideas
Jupyter Notebook
37
star
15

keithgalli.github.io

GitHub site where I plan to host some tutorial resources
HTML
34
star
16

Turtle-Python

Source Code from my YouTube video going over the turtle graphics library of python 3
Python
33
star
17

Data-Science-Tips

A repo where I'm going to start uploading short Python tip videos & exercises.
HTML
29
star
18

python-classes-tutorial

Code associated with my video "Everything you need to know about classes in Python": https://youtu.be/tmY6FEF8f1o
Python
25
star
19

scheduling-code

Python
24
star
20

lego-analysis

Jupyter Notebook
15
star
21

Podcast-Downloader

All code and resources for the multi-video research project we worked on studying the language used in tech.
Jupyter Notebook
13
star
22

Python-Tutorial-3

Code associated with the 3rd tutorial in my beginner python series
Python
12
star
23

TeamTrees

Visualizations of trees in Python created for the #TeamTrees movement to plant 20 million trees by 2020.
Python
12
star
24

rockpaperscissors

Source code for the two videos I did on programming rock, paper, scissors in python. First file is traditional implementation. Second file uses no if statements.
Python
11
star
25

Masterschool

Various resources that I have shared with my analytics bootcamp community
Jupyter Notebook
9
star
26

auto-publish-youtube-video

Source code that runs on Lambda to automatically publish a YouTube video when a certain subscriber count is reached
Python
8
star
27

game

Game code separated in several different parts so that we can progressively get more and more complex
Python
6
star
28

python-api-example

Python
5
star
29

MemoryAllocator

My Implementation of the C libraries memory allocation functions
C
5
star
30

colorization

Python
4
star
31

Leiserchess

Final Project for 6172 project in Fall 2015
C
4
star
32

regular-expressions

Code & text examples for my video tutorials on regular expressions (regex).
3
star
33

battlecode2015

Java
2
star