Kaushik Rajan (@KaushikRajanRK)
  • Stars
    star
    2
  • Global Rank 2,219,069 (Top 77 %)
  • Followers 2
  • Registered almost 7 years ago
  • Most used languages
    R
    50.0 %
    Java
    50.0 %
  • Location 🇮🇪 Ireland
  • Country Total Rank 3,835
  • Country Ranking
    R
    142
    Java
    793

Top repositories

1

Data-Mining-on-Customer-Churn-Classification

Implemented multiple classifiers to classify if a customer will leave or stay with the company based on multiple independent variables. Tools used: > RStudio for Exploratory data analysis, Data Pre-processing and building the models > Tableau and RStudio for Visualization > LATEX for documentation Machine learning models used: > Random Forest > C5.0 > Decision tree > Neural Network > K-Nearest Neighbour > Naive Bayes > Support Vector Machine Methodology: CRISP-DM
R
1
star
2

Analysis-of-Crime-Big-Data-using-MapReduce

Analyzed Crime Big data of Washington DC to solve the following business queries: > Which hour has the highest crime count? > Which shift has the highest crime count? > Year wise crime count > Hour wise crime count > Crime count by an offense > Average of Shift wise crime count The data was initially stored in MySql which was then moved to HDFS using SQOOP, from where 4 MapReduce operations are doing using JAVA in Eclipse IDE. The outputs of the queries are then moved to HBase using SQOOP. Two more MapReduce operations are done using PIG, the output of which is also moved to HBase using SQOOP. All the outputs were then moved to the local system and are visualized using RStudio and Tableau. Tools used: > MySQL, HDFS and HBase to store the data > SCOOP to move the data from one database to another > JAVA (Eclipse IDE) and PIG to run the MapReduce queries > RStudio for data pre-processing and visualization > Tableau for visualization > LATEX for Documentation
Java
1
star