Sahil Bhange (@sahilbhange)
  • Stars
    star
    57
  • Global Rank 311,131 (Top 11 %)
  • Followers 21
  • Following 17
  • Registered about 7 years ago
  • Most used languages
    Python
    46.2 %
    R
    30.8 %
    Scala
    7.7 %
  • Location πŸ‡ΊπŸ‡Έ United States
  • Country Total Rank 63,906
  • Country Ranking
    Scala
    1,098
    R
    3,839

Top repositories

1

hive-sql-slowly-changing-dimension

Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered hive table performance comparison
Python
16
star
2

Facebook-Data-Extraction

#DataPipeLine #ETL - Created is a Facebook data extraction utility to extract the publicly available data on Facebook. Used Facebook Graph API and Python to extract the data and loaded the data into the CSV files for further analysis.
Python
14
star
3

spark-slowly-changing-dimension

Spark implementation of Slowly Changing Dimension type 2
Scala
11
star
4

YouTube-comments-Spam-Detector

YouTube Spam comments classifier using Naive Bayes and SVM
Jupyter Notebook
3
star
5

Spark-Practice-Repository

Apache Spark practice (Core API, Data Frames and Spark SQL) using Python
Python
2
star
6

Defensive-Forecasting

Defensive Forecasting is an online forecasting technique for Binary Labels
Python
2
star
7

No-Show-Patients-Analysis

Model to predict patients who will likely to miss the booked appointment using logistic regression and random forest machine learning techniques
R
2
star
8

Machine-Learning-Assignments-1

Machine learning course assignments
R
1
star
9

Kiva-Loan-Data-Warehouse

#Pyspark#HDFS#Spark#DataAnalysis - Kiva loan data mart will be used to transform and analyse Kiva loan data and to help understand lenders targeted loan community
Python
1
star
10

Stayzilla-Operation-Failure-Analysis

The project is to analyze the operation failure of the Stayzilla.com (an Airbnb like startup in India)
R
1
star
11

Exploratory-and-Descriptive-Analysis-on-Loan-Dataset

The project is to conduct a set of exploratory analysis and performing various machine learning techniques to predict loan borrower’s default rate with that we have tried various data visualization techniques to show data distribution
R
1
star
12

Intro-to-Data-Science-Assignment

Introduction to Data Science course Assignments
Jupyter Notebook
1
star
13

Text-Language-Detector

Language Detector program using Google Translator API
Python
1
star