• Stars
    star
    3,899
  • Rank 10,806 (Top 0.3 %)
  • Language
    Jupyter Notebook
  • License
    GNU General Publi...
  • Created about 10 years ago
  • Updated 9 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Text and supporting code for Think Stats, 2nd Edition

ThinkStats2

Order Think Stats from Amazon.com.

Download this book in PDF.

Read this book online.

Read the related blog, Probably Overthinking It.

Think Stats is an introduction to Statistics and Data Science for Python programmers. If you have basic skills in Python, you can use them to learn concepts in probability and statistics and practical skills for working with data.

  • This book emphasizes simple techniques you can use to explore real data sets and answer interesting questions.

  • It includes case studies using datasets from the National Institutes of Health and other sources.

  • Many of the exercises use short programs to run experiments and help readers develop understanding.

This book is available under a Creative Commons license, which means that you are free to copy, distribute, and modify it, as long as you attribute the source and donโ€™t use it for commercial purposes.

Working with the code

The easiest way to work with this code it to run it on Colab, which is a free service that runs Jupyter notebooks in a web browser. For every chapter, I provide two notebooks: one contains the code from the chapter and the exercises; the other also contains the solutions.

Chapter 1:

Examples from the chapter

Solutions to exercises

Chapter 2:

Examples from the chapter

Solutions to exercises

Chapter 3:

Examples from the chapter

Solutions to exercises

Chapter 4:

Examples from the chapter

Solutions to exercises

Chapter 5:

Examples from the chapter

Solutions to exercises

Chapter 6:

Examples from the chapter

Solutions to exercises

Chapter 7:

Examples from the chapter

Solutions to exercises

Chapter 8:

Examples from the chapter

Solutions to exercises

Chapter 9:

Examples from the chapter

Solutions to exercises

Chapter 10:

Examples from the chapter

Solutions to exercises

Chapter 11:

Examples from the chapter

Solutions to exercises

Chapter 12:

Examples from the chapter

Solutions to exercises

Chapter 13:

Examples from the chapter

Solutions to exercises

Chapter 14:

Examples from the chapter

Solutions to exercises

If you want to run these notebooks on your own computer, you can download them individually from GitHub or download the entire repository in a Zip file.

I developed this book using Anaconda, which is a free Python distribution that includes all the packages you'll need to run the code (and lots more). I found Anaconda easy to install. By default it does a user-level installation, so you don't need administrative privileges. You can download it here.

More Repositories

1

ThinkDSP

Think DSP: Digital Signal Processing in Python, by Allen B. Downey.
Jupyter Notebook
3,476
star
2

ThinkPython2

LaTeX source and supporting code for Think Python, 2nd edition, by Allen Downey.
TeX
2,378
star
3

ThinkBayes

Code repository for Think Bayes.
TeX
1,627
star
4

ThinkBayes2

Text and code for the forthcoming second edition of Think Bayes, by Allen Downey.
Jupyter Notebook
1,617
star
5

ThinkPython

Code examples and exercise solutions from Think Python by Allen Downey, published by O'Reilly Media.
PostScript
930
star
6

ModSimPy

Text and supporting code for Modeling and Simulation in Python
HTML
797
star
7

ThinkComplexity2

Book and code for Think Complexity, 2nd edition
Jupyter Notebook
706
star
8

ThinkOS

Text and supporting code for Think OS: A Brief Introduction to Operating Systems, by Allen Downey.
TeX
526
star
9

ThinkDataStructures

LaTeX source and supporting code for Think Data Structures: Algorithms and Information Retrieval in Java
TeX
510
star
10

ThinkJavaCode

Supporting code for Think Java by Allen Downey and Chris Mayfield.
Java
362
star
11

ElementsOfDataScience

An introduction to data science in Python, for people with no programming experience.
Jupyter Notebook
334
star
12

BayesMadeSimple

Code for a tutorial on Bayesian Statistics by Allen Downey.
Jupyter Notebook
330
star
13

LittleBookOfSemaphores

LaTeX source and supporting code for The Little Book of Semaphores, by Allen Downey.
TeX
237
star
14

CompStats

Code for a workshop on statistical interference using computational methods in Python.
Jupyter Notebook
215
star
15

empiricaldist

Python library that represents empirical distribution functions.
Jupyter Notebook
148
star
16

BiteSizeBayes

An introduction to Bayesian statistics using Python and (coming soon) R.
Jupyter Notebook
124
star
17

DSIRP

Data Structures and Information Retrieval in Python
Jupyter Notebook
121
star
18

ThinkCPP

Text and code for Think C++ by Allen Downey
PostScript
111
star
19

ExercisesInC

Exercises for people learning the C programming language
C
100
star
20

ThinkComplexity

Code for Allen Downey's book Think Complexity, published by O'Reilly Media.
PostScript
96
star
21

Swampy

Code for Swampy, a set of modules used in Think Python, first edition
Python
85
star
22

AstronomicalData

An introduction to working with astronomical data in Python.
Jupyter Notebook
85
star
23

PhysicalModelingInMatlab

Text and code for Physical Modeling in MATLAB
TeX
83
star
24

ProbablyOverthinkingIt

Supplementary material for my book, Probably Overthinking It.
Jupyter Notebook
82
star
25

ThinkPythonItalian

LaTeX source for the Italian Translation of Think Python.
TeX
81
star
26

DataExploration

Supporting code for a video series on best practices for exploratory data analysis.
Python
71
star
27

BayesianDecisionAnalysis

Repository for a workshop on Bayesian Decision Analysis
Jupyter Notebook
63
star
28

ExploratoryDataAnalysis

Repository for an online class on Exploratory Data Analysis in Python
Jupyter Notebook
63
star
29

ThinkJava

LaTeX source for Think Java, 1st edition, by Allen Downey and Chris Mayfield.
TeX
56
star
30

SurvivalAnalysisPython

Explorations of survival analysis in Python
Jupyter Notebook
48
star
31

BayesForUndergrads

Materials for a workshop on developing undergraduate classes on Bayesian statistics.
46
star
32

DataScience

Site for a Data Science class taught by Allen Downey
HTML
42
star
33

ComplexityScience

Repository for a workshop on Complexity Science
Jupyter Notebook
35
star
34

ThinkX

Python
30
star
35

BayesSeminar

Bayesian statistics seminars
Jupyter Notebook
30
star
36

ThinkStats3

Code and LaTeX source for Think Stats, third edition
29
star
37

BayesianInferencePyMC

Workshop on Bayesian inference using PyMC
Jupyter Notebook
26
star
38

PoliticalAlignmentCaseStudy

Notebooks and data for a case study on political alignment, outlook, and beliefs
Jupyter Notebook
23
star
39

ElementsOfDataScienceBook

Repository for the manuscript of Elements of Data Science
TeX
22
star
40

thinkjavasolutions5

Automatically exported from code.google.com/p/thinkjavasolutions
Java
21
star
41

blair-walden-project

The Blair Walden Project: in 1845 Henry David Thoreau went to live in the woods... a year later his journal was found.
19
star
42

Portfolio

Portfolio of Allen Downey at Olin College
HTML
18
star
43

ThinkPythonSolutions

Automatically exported from code.google.com/p/thinkpythonsolutions
Python
17
star
44

ProbablyOverthinkingIt2

New repo for projects related to my blog, Probably Overthinking It.
Jupyter Notebook
16
star
45

MarriageNSFG

Repository for a project using NSFG data to explore marriage patterns in the US.
Stata
15
star
46

clink

A network measurement tool, described at http://allendowney.com/research/clink/
C
12
star
47

RecidivismCaseStudy

Case study on evaluating statistical tools that predict recidivism.
Jupyter Notebook
11
star
48

SignalsAndSystemsAndDynamics

Code and examples for an experimental class on signals, systems, and dynamics
MATLAB
10
star
49

GssReligion

Code and data for measuring and predicting religious affiliation using GSS data.
Jupyter Notebook
10
star
50

ModSim

Modeling and Simulation in Python and MATLAB/Octave
Jupyter Notebook
10
star
51

GunControlGenerational

Data and analysis related to generational changes in attitudes toward gun control
Jupyter Notebook
9
star
52

ThinkPerl6

Text and supporting code for Think Perl 6 by Laurent Rosenfeld with Allen Downey
TeX
9
star
53

DataQnA

Data Q&A: Questions and answers about data and statistics
Jupyter Notebook
9
star
54

ModSimMatlab

Text and supporting code for Modeling and Simulation.
Makefile
8
star
55

JavaOOP

Supporting code for the OOP in Java independent study
Java
8
star
56

DSIRPSolutions

Solutions to the exercises in Data Structures and Information Retrieval in Python (DSIRP)
Jupyter Notebook
8
star
57

SoftwareSystems

Repo for software related to Software Systems at Olin College.
C
8
star
58

ThinkBayes2Translations

Translations of Think Bayes.
Jupyter Notebook
8
star
59

JupyterAsciidocTemplate

Template for converting Jupyter notebooks to an asciidoc book.
Jupyter Notebook
7
star
60

internet-religion

Data and code for an analysis of Internet use and religious affiliation using data from the GSS.
Python
6
star
61

plastex-oreilly

Branch of plastex that generates DocBook 4.5 that meets O'Reilly style guidelines.
TeX
6
star
62

AtmoChem

Atmospheric chemistry data and analysis
Jupyter Notebook
6
star
63

TheShakes

Jupyter Notebook
5
star
64

complexity

Automatically exported from code.google.com/p/complexity
PostScript
5
star
65

PythonCounterPmf

Examples using Python's Counter collection to implement a probability mass function (PMF)
Jupyter Notebook
5
star
66

FirstLateNSFG

Data and analysis for "Are first babies more likely to be late?"
Jupyter Notebook
4
star
67

PythonFun

Jupyter Notebook
4
star
68

ThinkJavaSequel

Text and supporting code for Think DS: Data Structures in Java, by Allen Downey.
4
star
69

matlabsolutions

Automatically exported from code.google.com/p/matlabsolutions
MATLAB
4
star
70

ThinkOCaml

Automatically exported from code.google.com/p/thinkocaml
PostScript
4
star
71

Notebooks

A repo for iPython notebooks.
4
star
72

ISSPRegression

Exploration of the data from the Crowdsourced Replication Initiative
Makefile
4
star
73

thinkjava5

Automatically exported from code.google.com/p/thinkapjava
TeX
3
star
74

plastex-docbook

DocBook renderer plugin templates and classes for the plasTeX engine
Python
3
star
75

GssExtract

Jupyter Notebook
3
star
76

SoftwareDesign

Directories and unit tests for exercises in Software Design at Olin College.
Python
3
star
77

InspectionParadox

Code and data for an article on length-biased sampling and the inspection paradox
Jupyter Notebook
2
star
78

OlinPyShop

Code for Python workshops from Olin College
2
star
79

TeamAllocation

Code for making team allocations under constraints.
Python
2
star
80

QEACode

Code for Quantitative Engineering Analysis (QEA) class at Olin College
2
star
81

thinkpythonchinese

Automatically exported from code.google.com/p/thinkpythonchinese
TeX
2
star
82

simulating

2
star
83

LongTailedDistributions

Data and code from a series of papers about long-tailed distributions in the Internet.
2
star
84

AfroBarometer

Jupyter Notebook
1
star
85

python-in-hydrology

Automatically exported from code.google.com/p/python-in-hydrology
1
star
86

a-bad-synthesizer

Arduino-based analog-digital synthesizer
Python
1
star
87

2019-08-27-needham

Python
1
star