• Stars
    star
    169
  • Rank 224,453 (Top 5 %)
  • Language
    Jupyter Notebook
  • Created almost 9 years ago
  • Updated over 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Course materials for teaching data visualization in Python.

Python 2.7 Python 3.5 License

Data visualization with Python course

© 2016 Randal S. Olson

In this repository, you will find course materials for teaching data visualization with Python. I'll be developing these teaching materials over time to provide a hands-on course that will take you from beginner to intermediate level of programming data visualizations.

Some of these materials are based on the video course I developed with O'Reilly called Data Visualization Basics with Python.

Course Prerequisites

I designed this course with the expectation that you will have a basic working knowledge of Python, pandas, and the Jupyter Notebook. If you're unfamiliar with any of these topics, I have provided links to free online learning materials for them below.

Python has many excellent tutorials and books to learn from, many of which are listed in the /r/LearnPython wiki.

pandas has several tutorials covering its myriad features.

The Jupyter Notebook has thorough documentation for how to install and use it.

Table of Contents

Section 1: Introduction

This section simply introduces the course, the presenter(s), and what to expect from the course. There are no instructional materials for this section.

Section 2: Basics of data visualization

This section covers the basics of data visualization and design, such as choosing the right chart type for your message, good practices, and common pitfalls in data visualization. You will learn the fundamental lessons that anyone aspiring to become a great data visualization practitioner must learn.

Section 3: Data visualization in Python - matplotlib

This section dives into coding data visualizations with Python, in particular with the matplotlib library. You will learn how to code bar charts, line charts, scatter plots, and many more basic chart types.

More sections are currently being developed, including sections on statistical visualization with Seaborn and interactive visualization with Bokeh.

Contributing

If you would like to contribute to this course, please file an issue to discuss what you'd like to contribute. We're generally open to contributions, but we also have a specific scope and purpose for this course that we would like to stick to.

License and Usage Terms

Instructional Material

All instructional material in this repository is made available under the Creative Commons Attribution license. The following is a human-readable summary of (and not a substitute for) the full legal text of the CC BY 4.0 license.

You are free to:

  • Share—copy and redistribute the material in any medium or format
  • Adapt—remix, transform, and build upon the material

for any purpose, even commercially.

The licensor cannot revoke these freedoms as long as you follow the license terms.

Under the following terms:

  • Attribution—You must give appropriate credit (mentioning that your work is derived from work that is © Randal S. Olson and, where practical, linking to http://www.randalolson.com/), provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.

No additional restrictions—You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.

Notices:

  • You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation.
  • No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.

Software

Except where otherwise noted, the example programs and other software provided in this repository are made available under the OSI-approved MIT license.

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

More Repositories

1

Data-Analysis-and-Machine-Learning-Projects

Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Jupyter Notebook
6,107
star
2

TwitterFollowBot

A Python bot that automates several actions on Twitter, such as following users and favoriting tweets.
Python
1,309
star
3

datacleaner

A Python tool that automatically cleans data sets and readies them for analysis.
Python
1,054
star
4

reddit-analysis

A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.
Python
285
star
5

optimal-roadtrip-usa

Contains maps for the article, "Computing the optimal road trip across the U.S." and similar articles
HTML
230
star
6

sklearn-benchmarks

A centralized repository to report scikit-learn model performance across a variety of parameter settings and data sets.
Jupyter Notebook
210
star
7

reddit-twitter-bot

Looks up posts from reddit and automatically posts them on Twitter.
Python
137
star
8

name-age-calculator

Analyzes a name and guesses the age range of a person with that name.
HTML
43
star
9

redditviz

An interactive map of reddit: the "front page of the internet"
CSS
38
star
10

MarkovNetwork

Python implementation of Markov Networks for neural computing.
Python
36
star
11

ipython-notebook-workshop

Beginner's IPython Notebook Tutorial
19
star
12

baby-name-explorer

HTML
17
star
13

network-analysis-scripts

A bunch of useful scripts for analyzing networks.
Python
13
star
14

active-categorical-classifier

A tool that evolves small brains capable of scanning and classifying an image.
Jupyter Notebook
12
star
15

k-fold-cv-benchmark

Python
9
star
16

optimized-us-capitol-road-trip

HTML
9
star
17

crowd-machines

Jupyter Notebook
8
star
18

xrff2csv

A Python tool that converts XRFF files to CSV format.
Python
7
star
19

edd

A tool that evolves small brains capable of scanning and classifying an image.
C++
7
star
20

rhiever.github.io

Dr. Randal Olson's personal website
HTML
5
star
21

Collective-Cognition-Increases-Accuracy

Code for the model in the paper, "Accurate decisions in an uncertain world: collective cognition increases true positives while decreasing false positives."
Python
5
star
22

rhiever-bot

Bot that monitors /r/MUWs and runs the MUW script.
Python
4
star
23

big-ten-twitter-network

Interactive visualization of the Big Ten football teams on Twitter
JavaScript
3
star
24

biped-hyperneat

ODE implementation of a walking biped robot with HyperNEAT evolving the neural controller
PHP
3
star
25

dissertation-topic-network

Dissertation topic network
3
star
26

big-data-hw

2
star
27

Intro-to-Evolutionary-Modeling

Material for teaching biologists to work with digital evolutionary models.
2
star
28

rmagic-tutorial

A brief tutorial showing how Rmagic can be used in IPython Notebook.
2
star
29

marriage-divorce-stats

144 years of marriage and divorce in 1 chart
HTML
1
star
30

EvoRoboCodeGECCO2013

Description of our EvoRoboCode competition submission to GECCO 2013.
1
star
31

drug-alcohol-mentions

1
star
32

2014-01-30-mit

Software Carpentry bootcamp at Massachusetts Institute of Technology on January 30-31, 2014
Python
1
star
33

betting-game

Game Theory: betting game
C++
1
star
34

temp-repo

HTML
1
star
35

eos-old

Evolution of Swarming Platform
C++
1
star
36

ipython-example

Example notebook showing how to do statistics in IPython Notebook.
Python
1
star
37

AMT-biped-analysis

1
star
38

eos-active-perception

EOS with agents who have to actively perceive the environment with a fine-grained retina.
C++
1
star