• Stars
    star
    1
  • Language
    Jupyter Notebook
  • Created over 5 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Data Preprocessing is a data mining technique that involves transforming raw data into an understandable format. Real-world data is often incomplete, inconsistent, and/or lacking in certain behaviors or trends, and is likely to contain many errors. Data Preprocessing is a proven method of resolving such issues

More Repositories

1

07.-Pandas

Work All Scenario Of Pandas Library For Data Analysis
Jupyter Notebook
3
star
2

19.-Analyzing-Police-Activity-With-Python-And-Pandas

Now that you have learned the foundations of pandas, this course will give you the chance to apply that knowledge by answering interesting questions about a real dataset! You will explore the Stanford Open Policing Project dataset and analyze the impact of gender on police behavior. During the course, you will gain more practice cleaning messy data, creating visualizations, combining and reshaping datasets, and manipulating time series data. Analyzing Police Activity with pandas will give you valuable experience analyzing a dataset from start to finish, preparing you for your data science career!
Jupyter Notebook
3
star
3

29.-Logistic-Regression

Jupyter Notebook
2
star
4

14.-Seaborn

Data Visualization With python most attractive library seaborn
Jupyter Notebook
1
star
5

20.-Python-For-Data-Analysis-Pandas

Jupyter Notebook
1
star
6

02.-Python-For-Data-Analysis---Numpy

Jupyter Notebook
1
star
7

27.-Data-Capstone-Project

HTML
1
star
8

21.-Panda-Excercises

Jupyter Notebook
1
star
9

24.-Pandas-Built-in-Data---Viz

Jupyter Notebook
1
star
10

04.-Data-Visualization-With-Seaborn

Jupyter Notebook
1
star
11

22.-Data-Visualization-With-Matplotlib

Jupyter Notebook
1
star
12

13.-Data-Science-For-All

Including all the modules of python
Jupyter Notebook
1
star
13

03.-What-Your-Name-Telling-You

In this project, you will explore the baby names dataset compiled by the Social Security Administration. The data (name, year of birth, sex and number) are from a 100 percent sample of Social Security card applications after 1879. This dataset is a minehouse and contains lots of interesting stories to explore. In this project, you will explore some of these interesting stories around trendiness in names, and estimating a person's age from their name.
Jupyter Notebook
1
star
14

26.-Graphical-Plotting

Jupyter Notebook
1
star
15

25.-Plotly-And-Cufflinks

Jupyter Notebook
1
star
16

28.-Linear-Regression

Jupyter Notebook
1
star
17

23.-Data-Visualization-With-Seaborn

Jupyter Notebook
1
star
18

15.-Starting-With-Statistical-Numpy

Jupyter Notebook
1
star
19

08.-Matplotlib

Practices
Jupyter Notebook
1
star
20

06.-Prediction-Model

National Stock Exchange- Tata Global Price Close Model Prediction
Jupyter Notebook
1
star
21

01-Python-Crash-Course

Jupyter Notebook
1
star
22

17.-Machine-Learning---Part-02---Regression-Model

In the previous chapter, you made use of image and political datasets to predict binary as well as multiclass outcomes. But what if your problem requires a continuous outcome? Regression, which is the focus of this chapter, is best suited to solving such problems. You will learn about fundamental concepts in regression and apply them to predict the life expectancy in a given country using Gapminder data.
Jupyter Notebook
1
star
23

18.-Merging-DataFrame-With-Pandas

As a Data Scientist, you'll often find that the data you need is not in a single file. It may be spread across a number of text files, spreadsheets, or databases. You want to be able to import the data of interest as a collection of DataFrames and figure out how to combine them to answer your central questions. This course is all about the act of combining, or merging, DataFrames, an essential part of any working Data Scientist's toolbox. You'll hone your pandas skills by learning how to organize, reshape, and aggregate multiple data sets to answer your specific questions.
Jupyter Notebook
1
star
24

05.-Customizing-Numpy-Framework

NumPy is a Python package to efficiently do data science. Learn to work with the NumPy array, a faster and more powerful alternative to the list, and take your first steps in data exploration.
Jupyter Notebook
1
star
25

09.-Introduction-To-Python-For-Data-Science

Python is a general-purpose programming language that is becoming more and more popular for doing data science. Companies worldwide are using Python to harvest insights from their data and get a competitive edge. Unlike any other Python tutorial, this course focuses on Python specifically for data science. In our Intro to Python class, you will learn about powerful ways to store and manipulate data as well as cool data science tools to start your own analyses.
Jupyter Notebook
1
star
26

16.-Statistical-Plots-With-Seaborn

This is a high-level tour of the Seaborn plotting library for producing statistical graphics in Python. The tour covers Seaborn tools for computing and visualizing linear regressions as well as tools for visualizing univariate distributions (e.g., strip, swarm, and violin plots) and multivariate distributions (e.g., joint plots, pair plots, and heatmaps). This also includes a discussion of grouping categories in plots.
Jupyter Notebook
1
star
27

11.-Intermediate-Python-For-Data-Science

The intermediate python course is crucial to your data science curriculum. Learn to visualize real data with matplotlib's functions and get to know new data structures such as the dictionary and the Pandas DataFrame. After covering key concepts such as boolean logic, control flow and loops in Python, you're ready to blend together everything you've learned to solve a case study using hacker statistics.
Jupyter Notebook
1
star