• Stars
    star
    609
  • Rank 71,131 (Top 2 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 5 years ago
  • Updated about 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A Case Study Approach to Successful Data Science Projects Using Python, Pandas, and Scikit-Learn

GitHub issues GitHub forks GitHub stars PRs Welcome

Data Science Projects with Python

Data Science Projects with Python is designed to give you practical guidance on industry-standard data analysis and machine learning tools in Python, with the help of realistic data. The course will help you understand how you can use pandas and Matplotlib to critically examine a dataset with summary statistics and graphs, and extract the insights you seek to derive. You will continue to build on your knowledge as you learn how to prepare data and feed it to machine learning algorithms, such as regularized logistic regression and random forest, using the scikit-learn package. You’ll discover how to tune the algorithms to provide the best predictions on new and, unseen data. As you delve into later chapters, you’ll be able to understand the working and output of these algorithms and gain insight into not only the predictive capabilities of the models but also their reasons for making these predictions.

Data Science Projects with Python by Stephen Klosterman

What you will learn

  • Install the required packages to set up a data science coding environment
  • Load data into a Jupyter notebook running Python
  • Use Matplotlib to create data visualizations
  • Fit a model using scikit-learn
  • Use lasso and ridge regression to regularize the model
  • Fit and tune a random forest model and compare performance with logistic regression
  • Create visuals using the output of the Jupyter notebook
  • Use k-fold cross-validation to select the best combination of hyperparameters

Hardware requirements

For an optimal student experience, we recommend the following hardware configuration:

  • Processor: Intel Core i5 or equivalent
  • Memory: 4 GB RAM or higher
  • Storage: 35 Gb or higher

Software requirements

  • OS: Windows 7 SP1 64-bit, Windows 8.1 64-bit or Windows 10 64-bit, Ubuntu Linux, or the latest version of OS X
  • Browser: Google Chrome/Mozilla Firefox Latest Version
  • Notepad++/Sublime Text as IDE (Optional, as you can practice everything using Jupyter notecourse on your browser)
  • Python 3.4+ (latest is recommended) installed (from https://python.org)
  • Python libraries as needed (Jupyter, Numpy, Pandas, Matplotlib, BeautifulSoup4, and so on)

More Repositories

1

SQL-for-Data-Analytics

Perform fast and efficient data analysis with the power of SQL
Jupyter Notebook
253
star
2

Data-Science-for-Marketing-Analytics

Achieve your marketing goals with the data analytics power of Python
Jupyter Notebook
200
star
3

Data-Wrangling-with-Python

Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Jupyter Notebook
118
star
4

Python-API-Development-Fundamentals

Develop a full-stack web application with Python and Flask
Python
117
star
5

Data-Visualization-with-Python

Discover how Matplotlib and Seaborn can help clearly communicate and present your newly acquired insight
Jupyter Notebook
101
star
6

Interactive-Data-Visualization-with-Python

Present your data as an effective and compelling story
Jupyter Notebook
95
star
7

Applied-Deep-Learning-with-Python

Applied Deep Learning with Python, published by Packt
HTML
75
star
8

Data-Structures-and-Algorithms-in-Java

Sharpen your problem solving skills by learning core computer science concepts in a pain-free manner
Java
70
star
9

CPP-Data-Structures-and-Algorithm-Design-Principles

Leverage the power of modern C++ to build robust and scalable applications
C++
69
star
10

Advanced-CPlusPlus

Master the technique of confidently writing robust C++ code
C++
52
star
11

Data-Science-with-Python

Combine Python with machine learning principles to discover hidden patterns in raw data
Jupyter Notebook
49
star
12

Natural-Language-Processing-Fundamentals

Use Python and NLTK to build out your own text classifiers and solve common NLP problems
Jupyter Notebook
45
star
13

Beginning-Application-Development-with-TensorFlow-and-Keras

Learn to design, develop, train, and deploy TensorFlow and Keras models as real-world applications
Jupyter Notebook
42
star
14

Kubernetes-Design-Patterns-and-Extensions-eLearning

Master container cluster management skills and transform the way you develop and deploy applications
Go
40
star
15

Professional-JavaScript

Fast-track your web development career using the powerful features of advanced JavaScript
JavaScript
39
star
16

Deep-Learning-for-Natural-Language-Processing

Solve your natural language processing problems with smart deep neural networks
Jupyter Notebook
35
star
17

Big-Data-Analysis-with-Python

Combine Spark and Python to process large datasets and unlock the power of parallel computing and machine learning
Jupyter Notebook
28
star
18

Advanced-JavaScript

Discover how you can accelerate your web development with all of the powerful features and benefits of modern JavaScript
JavaScript
26
star
19

Cpp-Fundamentals

Hit the ground running with C++
C++
23
star
20

Applied-Deep-Learning-with-PyTorch

Demystify neural networks with PyTorch
Jupyter Notebook
22
star
21

Applied-Unsupervised-Learning-with-Python

Discover hidden patterns and relationships in unstructured data with Python
Jupyter Notebook
22
star
22

Artificial-Intelligence-and-Machine-Learning-Fundamentals

Get started with the development of real-world applications that are powered by the latest AI advances
Python
21
star
23

Spring-Boot-2-Fundamentals

Learn how you can quickly build and deploy production-ready microservices within the Java and JVM ecosystem
Java
19
star
24

Command-Line-Fundamentals

Learn to use the Unix command-line tools and Bash shell scripting
Shell
19
star
25

Machine-Learning-Fundamentals

Use Python and scikit-learn to get up and running with the hottest developments in machine learning
Jupyter Notebook
18
star
26

Applied-Data-Science-with-Python-and-Jupyter

Use powerful industry-standard tools to unlock new, actionable insights from your data
HTML
17
star
27

Applied-Deep-Learning-with-Keras

Solve complex real-life problems with the simplicity of Keras
Jupyter Notebook
17
star
28

Professional-Azure-SQL-Database-Administration-Second-Edition

Equip yourself with the skills required to manage and maintain data on the Cloud.
C#
16
star
29

Advanced-Serverless-Architectures-with-Microsoft-Azure

Design complex serverless systems quickly with the scalability and benefits of Azure
C#
13
star
30

MongoDB-Express-Angular-and-Node.js-Fundamentals

A practical guide to the tried-and-true production-ready MEAN stack, with additional tips, tricks, and best practices
CSS
13
star
31

Beginning-Frontend-Development-with-React

Simplify your frontend development workflow and enhance the user experience of your applications with React
JavaScript
13
star
32

Machine-Learning-with-AWS

Learn how you can use the power of cloud services for your own machine learning and artificial intelligence projects
Python
13
star
33

Data-Science-Projects-With-Python-Second-Edition

Jupyter Notebook
12
star
34

Beginning-Jenkins

Accelerate deliverables, manage builds, and automate pipelines with Continuous Integration
Python
12
star
35

BeginningAPIDevelopmentwithNode.js

Build highly scalable, developer-friendly APIs for the modern web with JavaScript and Node.js
JavaScript
12
star
36

Serverless-Architectures-with-AWS

Discover how you can migrate from traditional deployments to serverless architectures with AWS
JavaScript
12
star
37

Beginning-DevOps-with-Docker

Automate the deployment of your environment with the power of the Docker toolchain
Dockerfile
12
star
38

Data-Science-for-Marketing-Analytics-elearning

Achieve your marketing goals with the data analytics power of Python
Jupyter Notebook
11
star
39

Artificial-Intelligence-and-Machine-Learning-Fundamentals-eLearning

Learn to develop real-world applications powered by the latest advances in intelligent systems
Python
10
star
40

Server-Side-Enterprise-Development-with-Angular

Use Angular Universal to pre-render your web pages, improving SEO and application UX
TypeScript
10
star
41

The-Modern-Cpp-Challenger-eLearning

Become an expert C++ programmer by solving real-world programming problems
C++
9
star
42

BuildingGoogleCloudPlatformSolutions

Build, deploy, maintain, and monitor applications on the Google Cloud Platform (GCP)
JavaScript
9
star
43

Applied-Supervised-Learning-with-R

Use machine learning libraries of R to build models that solve problems and predict business trends
Jupyter Notebook
9
star
44

Java-Fundamentals

A fast-paced and pragmatic introduction to one of the world's most popular programming languages
Java
9
star
45

Julia-1-Programming-Complete-Reference-Guide

Discover Julia, a high performance language for technical computing
Jupyter Notebook
9
star
46

Beginning-React

Simplify your frontend development workflow and enhance the user experience of your applications with React
JavaScript
8
star
47

Kubernetes-Workshop

Learn how to build and run cloud-native applications running atop Kubernetes.
Go
8
star
48

Introduction-to-DevOps-with-Kubernetes

Build scalable cloud-native applications using DevOps patterns created with Kubernetes
HTML
8
star
49

Beginning-PHP

Master the latest features of PHP 7 and fully embrace modern PHP development
PHP
8
star
50

MySQL-Workshop

A simple, low tech approach to quickly get you working with MySQL with real world examples from those who use it every day.
TSQL
7
star
51

The-Cpp-Workshop

Cut through the noise and get real results in this hands-on, step-by-step approach to learning C++.
C++
7
star
52

The-Java-Workshop

Java
7
star
53

Professional-SQL-Server-High-Availability-and-Disaster-Recovery

Implement tried-and-true high availability and disaster recovery solutions with SQL Server
PowerShell
7
star
54

Professional-Scala

Combine object-oriented and functional programming to build high-performance applications
XSLT
7
star
55

Beginning-GraphQL

Fetch data faster and more efficiently whilst improving the overall performance of your web application
JavaScript
7
star
56

The-Python-Workshop

Become a proficient, productive and powerful programmer with Python
Jupyter Notebook
7
star
57

Cpp-Fundamentals-eLearning

Hit the ground running with C++
C++
7
star
58

Applied-Data-Visualization-with-R-and-ggplot2

Use ggplot2 and R to help clearly visualize, communicate and present your data to a wide range of audiences
R
7
star
59

Applied-Unsupervised-Learning-with-R

Uncover hidden relationships and patterns with k-means clustering, hierarchical clustering, and PCA
R
7
star
60

Big-Data-Processing-with-Apache-Spark-eLearning

Efficiently tackle large datasets and perform big data analysis with Spark and Python
Python
7
star
61

R-Programming-Fundamentals

Learn how to clean, analyse and present your datasets within the R ecosystem
R
6
star
62

BuildingMicroserviceswithJavaScript

Learn quick and practical methods for developing microservices
HTML
6
star
63

The-Data-Science-Workshop

Jupyter Notebook
6
star
64

Beginning-Data-Science-with-Python-and-Jupyter

Use powerful industry-standard tools within Jupyter and the Python ecosystem to unlock new, actionable insights from your data
HTML
6
star
65

Python-Fundamentals

A practical guide for learning Python, complete with real-world projects for you to explore
Python
6
star
66

Data-Science-Projects-with-Python-eLearning

A case study approach to successful data science projects using Python pandas and scikit learn
Jupyter Notebook
6
star
67

Big-Data-Analysis-with-Python-eLearning

Combine Spark and Python to process large datasets and unlock the power of parallel computing and machine learning
Jupyter Notebook
6
star
68

Applied-Supervised-Learning-with-Python

Code files for Supervised Learning with Python, published by Packt
Jupyter Notebook
5
star
69

Ultimate-Godot-Game-Developer-Projects-eLearning

Master game building by creating cross-platform 2D and 3D games with Godot
GDScript
5
star
70

Beginning-Modern-CSharp-and-DotNet-Development-eLearning

Build your understanding of the C# language and the .NET Core Framework to create robust cross-platform applications
C#
5
star
71

Beginning-ASP_DOT_NET

Discover how you can build and optimize web applications with ASP.NET Core
C#
5
star
72

Natural-Language-Processing-Fundamentals-eLearning

Build intelligent applications that can interpret the human language to deliver impactful results
Jupyter Notebook
5
star
73

Selenium-Fundamentals-eLearning

Perform browser and Web UI automation using Selenium
HTML
5
star
74

Professional-Scala-eLearning

Combine object-oriented and functional programming to build high-performance applications
XSLT
5
star
75

Beginning-Vue.js_eLearning

Build modern interactive JavaScript applications with this progressive front-end web development framework
Vue
4
star
76

The-PHP-Workshop

Cut through the noise and get real results in this hands-on, step-by-step approach to learning PHP
PHP
4
star
77

Beginning-Server-Side-Application-Development-with-Angular

Discover how to rapidly prototype SEO-friendly web applications with Angular Universal
TypeScript
4
star
78

The-React-Workshop

JavaScript
4
star
79

Building-Computer-Vision-Projects-with-OpenCV-4-and-CPP

Develop real-world computer vision and image processing applications with the latest features of OpenCV 4
Jupyter Notebook
4
star
80

Data-Visualization-with-Python-eLearning

Create an impact with meaningful data insights using interactive and engaging visuals
Jupyter Notebook
4
star
81

Beginning-Selenium

Speed up your internal testing with web browser and user interface automation powered by Selenium
4
star
82

Splunk-7-Essentials-eLearning

Build an elaborate Splunk enterprise environment that will extract powerful insights from your machine-generated big data
PowerShell
3
star
83

Beginning-ASP.Net-eLearning

Discover how you can build and optimize web applications with ASP.NET Core
C#
3
star
84

Keras-2.X-Projects-eLearning

Leverage the power of Keras to build and train state-of-the-art deep learning models
Jupyter Notebook
3
star
85

Professional-Node.js

Create modular and scalable applications with ease by using the best features of Node.js
HTML
3
star
86

The-Ruby-Workshop

Cut through the noise and get real results in this hands-on, step-by-step approach to learning Ruby
Ruby
3
star
87

Serverless-Architectures-with-Kubernetes

Create production-ready Kubernetes clusters and run serverless applications on them
Go
3
star
88

PythonFundamentals

Python
3
star
89

Ultimate-Java-Masterclass-eLearning

Meet one of the most flexible programming languages that can help you build great cross-platform applications
Java
3
star
90

Build-modern-web-applications-with-.NET

Learn the basics of .NET using C# and ASP.NET MVC and apply the tools and techniques to quickly and efficiently build robust applications.
JavaScript
3
star
91

Cloud-Native-Continuous-Integration-and-Delivery

Build, test, and deploy cloud-native applications in the cloud-native way
Go
3
star
92

Beginning-Java-Data-Structures-and-Algorithms

Beginning Java Data Structures and Algorithms
Java
3
star
93

Big-Data-Processing-with-Apache-Spark

Efficiently tackle large datasets and big data analysis challenges using Apache Spark and Python
Python
3
star
94

Complete-Guide-to-Kotlin

Kotlin
3
star
95

Datasets-of-Master-Data-Science-with-Python

Datasets of Master Data Science with Python, published by Packt. For the complete code files, visit https://github.com/TrainingByPackt/Data-Science-with-Python
3
star
96

Beginning-Machine-Learning-with-AWS-eLearning

Explore the power of cloud services for your machine learning and artificial intelligence projects
Python
3
star
97

Python-Fundamentals-eLearning

A practical guide for learning Python, complete with real-world projects for you to explore
Python
3
star
98

Beginning-Vue.js

Build modern interactive JavaScript applications with this progressive front-end web development framework
JavaScript
3
star
99

Cloud-Native-Applications-in-Java-eLearning

Build highly scalable microservice-based applications with Java for the cloud
Python
2
star
100

Intelligent-Projects-Using-Python-eLearning

9 real-world AI projects leveraging machine learning and deep learning with TensorFlow and Keras
Python
2
star