• Stars
    star
    6
  • Rank 2,539,965 (Top 51 %)
  • Language
    Jupyter Notebook
  • Created over 5 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Final project of my immersive course in Data Science at General Assembly. It consisted of a lot of Web Scraping + several regression techniques to predict current value of football players. Conclusions inside.

More Repositories

1

Networks_Analysis_plus_Recommendations_system

This project explores the classic MovieLens dataset, first from a networks perspective, analyzing the relationship between users and movies. Later, in the main part of the project, we built and evaluate several Recommendations Systems.
Jupyter Notebook
17
star
2

SQL_practice_and_application

Some simple notebooks to show basic SQL skills through Pandas
Jupyter Notebook
11
star
3

Simple_web_scraper

Simple web scraper using a little bit of regex to obtain a list of books and authors
Jupyter Notebook
10
star
4

NLP_with_20newsgroups

In this brief project we're gonna explore a few NLP tools using a Sklearn dataset and the following modelling techniques: bag of words, Hashing and TF-IDF vectorizer.
Jupyter Notebook
10
star
5

Dealing_with_class_imbalance

In this repository you'll find a theoretical introduction to the problem of class imbalance, as well as a notebook with examples about how to use some of the algorithms mentioned in the theoretical guide.
Jupyter Notebook
8
star
6

Spark_theoretical_practical_application

In this repository I'll be exploring in deep three labs from my Immersive Course in Data Science about Spark including some basic map reduce and SQL-Spark operations, as well as a bit of modelling through Spark
Jupyter Notebook
6
star
7

Basic_skills_with_python

First project ever done with Python about data structures, functions and some statistics/probability to describe and refine a Pokemon gameplay.
Jupyter Notebook
4
star
8

Random_forest_theoretical_practical_application

This brief project explores first the theoretical background behind Random Forest, followed by its application with the Boston Housing dataset
Jupyter Notebook
3
star
9

From_job_posts_to_salaries_classification

For this project I explored different machine learning classification models to predict four salary categories for Data Science job posts using publications from Indeed.co.uk. The goal was to obtain an accuracy of 0.8 or higher on both the train and test group, which implies predicting correctly at least 80% of the total population.
Jupyter Notebook
3
star
10

Predicting_house_prices_with_ames_dataset

This projected aimed to estimate the sale price of properties based on their "fixed" characteristics, such as neighborhood, lot size, number of stories, etc. In second place, I tried to estimate the value of possible changes and renovations to properties from the variation in sale price not explained by the fixed characteristics. The goal was to estimate the potential return on investment when making specific improvements to properties. This project uses the Ames housing data recently made available on Kaggle.
Jupyter Notebook
2
star
11

SES_Uruguay

This repository and notebook try to analyze the Socioeconomic Status in Uruguay bases on data from a national survey
Jupyter Notebook
1
star
12

Several_things

This is my repository to save ideas, bunches of code, things I don't wanna lose, etc.
Jupyter Notebook
1
star