gonzaferreiro/Several_things

Stars
1
Language
Jupyter Notebook
Created over 5 years ago
Updated about 5 years ago

gonzaferreiro/Several_things

gonzaferreiro

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

This is my repository to save ideas, bunches of code, things I don't wanna lose, etc.

Networks_Analysis_plus_Recommendations_system

This project explores the classic MovieLens dataset, first from a networks perspective, analyzing the relationship between users and movies. Later, in the main part of the project, we built and evaluate several Recommendations Systems.

Jupyter Notebook

SQL_practice_and_application

Some simple notebooks to show basic SQL skills through Pandas

Jupyter Notebook

Simple_web_scraper

Simple web scraper using a little bit of regex to obtain a list of books and authors

Jupyter Notebook

NLP_with_20newsgroups

In this brief project we're gonna explore a few NLP tools using a Sklearn dataset and the following modelling techniques: bag of words, Hashing and TF-IDF vectorizer.

Jupyter Notebook

Dealing_with_class_imbalance

In this repository you'll find a theoretical introduction to the problem of class imbalance, as well as a notebook with examples about how to use some of the algorithms mentioned in the theoretical guide.

Jupyter Notebook

Market_value_football_players

Final project of my immersive course in Data Science at General Assembly. It consisted of a lot of Web Scraping + several regression techniques to predict current value of football players. Conclusions inside.

Jupyter Notebook

Spark_theoretical_practical_application

In this repository I'll be exploring in deep three labs from my Immersive Course in Data Science about Spark including some basic map reduce and SQL-Spark operations, as well as a bit of modelling through Spark

Jupyter Notebook

Basic_skills_with_python

First project ever done with Python about data structures, functions and some statistics/probability to describe and refine a Pokemon gameplay.

Jupyter Notebook

Random_forest_theoretical_practical_application

This brief project explores first the theoretical background behind Random Forest, followed by its application with the Boston Housing dataset

Jupyter Notebook

From_job_posts_to_salaries_classification

For this project I explored different machine learning classification models to predict four salary categories for Data Science job posts using publications from Indeed.co.uk. The goal was to obtain an accuracy of 0.8 or higher on both the train and test group, which implies predicting correctly at least 80% of the total population.

Jupyter Notebook

Predicting_house_prices_with_ames_dataset

This projected aimed to estimate the sale price of properties based on their "fixed" characteristics, such as neighborhood, lot size, number of stories, etc. In second place, I tried to estimate the value of possible changes and renovations to properties from the variation in sale price not explained by the fixed characteristics. The goal was to estimate the potential return on investment when making specific improvements to properties. This project uses the Ames housing data recently made available on Kaggle.

Jupyter Notebook

SES_Uruguay

This repository and notebook try to analyze the Socioeconomic Status in Uruguay bases on data from a national survey

Jupyter Notebook