There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Networks_Analysis_plus_Recommendations_system
This project explores the classic MovieLens dataset, first from a networks perspective, analyzing the relationship between users and movies. Later, in the main part of the project, we built and evaluate several Recommendations Systems.SQL_practice_and_application
Some simple notebooks to show basic SQL skills through PandasSimple_web_scraper
Simple web scraper using a little bit of regex to obtain a list of books and authorsNLP_with_20newsgroups
In this brief project we're gonna explore a few NLP tools using a Sklearn dataset and the following modelling techniques: bag of words, Hashing and TF-IDF vectorizer.Dealing_with_class_imbalance
In this repository you'll find a theoretical introduction to the problem of class imbalance, as well as a notebook with examples about how to use some of the algorithms mentioned in the theoretical guide.Market_value_football_players
Final project of my immersive course in Data Science at General Assembly. It consisted of a lot of Web Scraping + several regression techniques to predict current value of football players. Conclusions inside.Spark_theoretical_practical_application
In this repository I'll be exploring in deep three labs from my Immersive Course in Data Science about Spark including some basic map reduce and SQL-Spark operations, as well as a bit of modelling through SparkBasic_skills_with_python
First project ever done with Python about data structures, functions and some statistics/probability to describe and refine a Pokemon gameplay.Random_forest_theoretical_practical_application
This brief project explores first the theoretical background behind Random Forest, followed by its application with the Boston Housing datasetFrom_job_posts_to_salaries_classification
For this project I explored different machine learning classification models to predict four salary categories for Data Science job posts using publications from Indeed.co.uk. The goal was to obtain an accuracy of 0.8 or higher on both the train and test group, which implies predicting correctly at least 80% of the total population.Predicting_house_prices_with_ames_dataset
This projected aimed to estimate the sale price of properties based on their "fixed" characteristics, such as neighborhood, lot size, number of stories, etc. In second place, I tried to estimate the value of possible changes and renovations to properties from the variation in sale price not explained by the fixed characteristics. The goal was to estimate the potential return on investment when making specific improvements to properties. This project uses the Ames housing data recently made available on Kaggle.SES_Uruguay
This repository and notebook try to analyze the Socioeconomic Status in Uruguay bases on data from a national surveyLove Open Source and this site? Check out how you can help us