There are no reviews yet. Be the first to send feedback to the community and the maintainers!
DocumentUnderstanding
Research papers and code on information extraction from image/pdfDataQuality
Tutorial and examples of Data Quality in Big Data SystemR2Time
R connector for OpenTSDB: Analyzing large time-series data in R environment using data-intensive capabilities.ICDM2015
3rd Prize In the ICDM 2015 Drawbridge Cross-Device ChallengenifiIoT
kaggleCompetition1
Machine learning competition samplesRTNiFiStreamProcessors
IoT MQTT sensor stream capture for Apache NiFichatbot
Tensorflow seq2seq chatbot in pythonScriptsDebian
Scripts for Ubuntu/DebianRhipe-1
Cool datamining projectopentsdb_spark
Scalable and distributed OpenTSDB spark connector using python and pandasRestaurantRevenuePrediction
Prediction Revenue for new location based on location and demographyML-examples
Different machine learning examplescoursera-exploratory-data-analysis
coursera-exploratory-data-analysis-courseKagglePokerRule
ProductClassification
Otto Group Product Classification Challengebnpanel
Automatically exported from code.google.com/p/bnpanelSparkExample
Spark examplesCoursera-Practical-Machine-Learning
PDHC
Secure deletion in Hadoop ClusterkaggleCompetition
Different kaggle competitions using machine learning techniques.walmart-recuriting-sales
Walmart Recruiting - Sales in Stormy WeatherFailurePrediction
Failure Prediction in Hadoop ClusterRCAanalysis
RFKaggleTitanic
Predict survival on the Titanic (using Random Forests) in RgoogleTraceAnalysis
Google released cluster dataset on year 2011. Based on this dataset we perform several different analysis on Spark.ANN-ML
Neural network for data analysisMR-TSDB
This framework uses MapReduce programming model to read data from OpenTSDBtwitter_context
Extracting context from twitter APILove Open Source and this site? Check out how you can help us