• Stars
    star
    1
  • Language
    Scala
  • Created over 8 years ago
  • Updated over 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Taken the dataset from KDD 2012 cup which is of around 10gb. We have placed the training file on hadoop cluster. Initially, we have used pig to transform the data. Later, we have used Spark MLlib for dimensionality reduction and model building. Finally, evaluated the models using some evaluation metrics.