Kaggle - Regression
"Those who cannot remember the past are condemned to repeat it." -- George Santayana
This is a compiled list of Kaggle competitions and their winning solutions for regression problems.
The purpose to complie this list is for easier access and therefore learning from the best in data science.
Literature review is a crucial yet sometimes overlooked part in data science. To avoid reinventing the wheels and get inspired on how to preprocess, engineer, and model the data, it's worth spend 1/10 to 1/5 of the project time just researching how people deal with similar problems/datasets.
Time spent on literature review is time well spent.
This is only one list of the whole compilation. For other lists of competitions and solutions, please refer to:
- Kaggle - Classification
- Kaggle - Sequence
- Kaggle - Image
- Kaggle - Miscellaneous
Hope the compilation can save you efforts and offer you insights. Enjoy!
======
Grupo Bimbo Inventory Demand
Wed 8 Jun 2016 - Tue 30 Aug 2016
Maximize sales and minimize returns of bakery goods
======
Kobe Bryant Shot Selection
Fri 15 Apr 2016 β Mon 13 Jun 2016
Which shots did Kobe sink?
======
Home Depot Product Search Relevance
Mon 18 Jan 2016 β Mon 25 Apr 2016
Predict the relevance of search results on homedepot.com
======
Rossmann Store Sales
Wed 30 Sep 2015 β Mon 14 Dec 2015
Forecast sales using store, promotion, and competitor data
======
How Much Did It Rain? II
Thu 17 Sep 2015 β Mon 7 Dec 2015
Predict hourly rainfall using data from polarimetric radars
======
Caterpillar Tube Pricing
Mon 29 Jun 2015 β Mon 31 Aug 2015
Model quoted prices for industrial tube assemblies
======
Liberty Mutual Group: Property Inspection Prediction
Mon 6 Jul 2015 β Fri 28 Aug 2015
Quantify property hazards before time of inspection
======
ECML/PKDD 15: Taxi Trip Time Prediction (II)
Fri 24 Apr 2015 β Wed 1 Jul 2015
Predict the total travel time of taxi trips based on their initial partial trajectories
======
Bike Sharing Demand
Wed 28 May 2014 β Fri 29 May 2015
Forecast use of a city bikeshare system
======
Walmart Recruiting II: Sales in Stormy Weather
Wed 1 Apr 2015 β Mon 25 May 2015
Walmart challenges participants to accurately predict the sales of 111 potentially weather-sensitive products (like umbrellas, bread, and milk) around the time of major weather events at 45 of their retail locations.
======
How Much Did It Rain?
Fri 9 Jan 2015 β Fri 15 May 2015
Predict probabilistic distribution of hourly rain given polarimetric radar measurements
======
Restaurant Revenue Prediction
Mon 23 Mar 2015 β Mon 4 May 2015
Predict annual restaurant sales based on objective measurements
======
Finding Elo
Mon 20 Oct 2014 β Mon 23 Mar 2015
Predict a chess player's FIDE Elo rating from one game
======
Africa Soil Property Prediction Challenge
Wed 27 Aug 2014 β Tue 21 Oct 2014
Predict physical and chemical properties of soil using spectral measurements
======
Liberty Mutual Group - Fire Peril Loss Cost
Tue 8 Jul 2014 β Tue 2 Sep 2014
Predict expected fire losses for insurance policies
======
Walmart Recruiting - Store Sales Forecasting
Thu 20 Feb 2014 β Mon 5 May 2014
In this recruiting competition, job-seekers are provided with historical sales data for 45 Walmart stores located in different regions. Each store contains many departments, and participants must project the sales for each department in each store.
======
PAKDD 2014 - ASUS Malfunctional Components Prediction
Sun 26 Jan 2014 β Tue 1 Apr 2014
Predict malfunctional components of ASUS notebooks
======
Loan Default Prediction - Imperial College London
Fri 17 Jan 2014 β Fri 14 Mar 2014
Constructing an optimal portfolio of loans
======
See Click Predict Fix
Sun 29 Sep 2013 β Wed 27 Nov 2013
Predict which 311 issues are most important to citizens
======
AMS 2013-2014 Solar Energy Prediction Contest
Mon 8 Jul 2013 β Fri 15 Nov 2013
Forecast daily solar energy with an ensemble of weather models
======
The Big Data Combine Engineered by BattleFin
Fri 16 Aug 2013 β Tue 1 Oct 2013
Predict short term movements in stock prices using news and sentiment data provided by RavenPack
======
See Click Predict Fix - Hackathon
Sat 28 Sep 2013 β Sun 29 Sep 2013
Predict which 311 issues are most important to citizens
======
RecSys2013: Yelp Business Rating Prediction
Wed 24 Apr 2013 β Sat 31 Aug 2013
RecSys Challenge 2013: Yelp business rating prediction
======
Yelp Recruiting Competition
Wed 27 Mar 2013 β Sun 30 Jun 2013
The goal of this competition is to estimate the number of Useful votes a review will receive.
======
dunnhumby & hack/reduce Product Launch Challenge
Sat 11 May 2013 β Sat 11 May 2013
The success or failure of a new product launch is often evident within the first few weeks of sales. Can you predict a product's destiny?
======
ICDAR2013 - Handwriting Stroke Recovery from Offline Data
Wed 20 Mar 2013 β Sat 20 Apr 2013
Predict the trajectory of a handwritten signature
======
Blue Book for Bulldozers
Fri 25 Jan 2013 β Wed 17 Apr 2013
Predict the auction sale price for a piece of heavy equipment to create a "blue book" for bulldozers.
======
Job Salary Prediction
Wed 13 Feb 2013 β Wed 3 Apr 2013
Predict the salary of any UK job ad based on its contents.
======
Observing Dark Worlds
Fri 12 Oct 2012 β Sun 16 Dec 2012
Can you find the Dark Matter that dominates our Universe? Winton Capital offers you the chance to unlock the secrets of dark worlds.
======
U.S. Census Return Rate Challenge
Fri 31 Aug 2012 β Sun 11 Nov 2012
Predict census mail return rates.
======
Global Energy Forecasting Competition 2012 - Wind Forecasting
Thu 6 Sep 2012 β Wed 31 Oct 2012
A wind power forecasting problem: predicting hourly power generation up to 48 hours ahead at 7 wind farms
======
Global Energy Forecasting Competition 2012 - Load Forecasting
Sat 1 Sep 2012 β Wed 31 Oct 2012
A hierarchical load forecasting problem: backcasting and forecasting hourly loads (in kW) for a US utility with 20 zones.
======
Raising Money to Fund an Organizational Mission
Wed 18 Jul 2012 β Tue 18 Sep 2012
Help worthy organizations more efficiently target and recruit loyal donors to support their causes.
======
Online Product Sales
Fri 4 May 2012 β Tue 3 Jul 2012
Predict the online sales of a consumer product based on a data set of product features.
======
Psychopathy Prediction Based on Twitter Usage
Mon 14 May 2012 β Fri 29 Jun 2012
Identify people who have a high degree of Psychopathy based on Twitter usage.
======
Benchmark Bond Trade Price Challenge
Fri 27 Jan 2012 β Mon 30 Apr 2012
Develop models to accurately predict the trade price of a bond.
======
EMC Data Science Global Hackathon (Air Quality Prediction)
Sat 28 Apr 2012 β Sun 29 Apr 2012
Build a local early warning systems to accurately predict dangerous levels of air pollutants on an hourly basis.
======
Algorithmic Trading Challenge
Fri 11 Nov 2011 β Sun 8 Jan 2012
Develop new models to accurately predict the market response to large trades.
======
Allstate Claim Prediction Challenge
Wed 13 Jul 2011 β Wed 12 Oct 2011
A key part of insurance is charging each customer the appropriate price for the risk they represent.
======
dunnhumby's Shopper Challenge
Fri 29 Jul 2011 β Fri 30 Sep 2011
Going grocery shopping, we all have to do it, some even enjoy it, but can you predict it? dunnhumby is looking to build a model to better predict when supermarket shoppers will next visit the store and how much they will spend.
======
Wikipedia's Participation Challenge
Tue 28 Jun 2011 β Tue 20 Sep 2011
This competition challenges data-mining experts to build a predictive model that predicts the number of edits an editor will make five months from the end date of the training dataset.
======
Mapping Dark Matter
Mon 23 May 2011 β Thu 18 Aug 2011
Measure the small distortion in galaxy images caused by dark matter
======
Deloitte/FIDE Chess Rating Challenge
Mon 7 Feb 2011 β Wed 4 May 2011
This contest, sponsored by professional services firm Deloitte, will find the most accurate system to predict chess outcomes, and FIDE will also bring a top finisher to Athens to present their system
======
RTA Freeway Travel Time Prediction
Tue 23 Nov 2010 β Sun 13 Feb 2011
This competition requires participants to predict travel time on Sydney's M4 freeway from past travel time observations.
======
Tourism Forecasting Part Two
Mon 20 Sep 2010 β Sun 21 Nov 2010
Part two requires competitors to predict 793 tourism-related time series. The winner of this competition will be invited to contribute a discussion paper to the International Journal of Forecasting.
======
Chess ratings - Elo versus the Rest of the World
Tue 3 Aug 2010 β Wed 17 Nov 2010
This competition aims to discover whether other approaches can predict the outcome of chess games more accurately than the workhorse Elo rating system.
======
Tourism Forecasting Part One
Mon 9 Aug 2010 β Sun 19 Sep 2010
Part one requires competitors to predict 518 tourism-related time series. The winner of this competition will be invited to contribute a discussion paper to the International Journal of Forecasting.
======
World Cup 2010 - Take on the Quants
Thu 3 Jun 2010 β Fri 11 Jun 2010
Quants at Goldman Sachs and JP Morgan have modeled the likely outcomes of the 2010 World Cup. Can you do better?
======
World Cup 2010 - Confidence Challenge
Thu 3 Jun 2010 β Fri 11 Jun 2010
The Confidence Challenge requires competitors to assign a level of confidence to their World Cup predictions.
======