• Stars
    star
    1
  • Language
    Scala
  • Created over 6 years ago
  • Updated over 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This project is an analysis of usefulness of Stack Overflow using K-means Cluster Algorithm(A BSP algorithm)

More Repositories

1

hadoop_cricket_analysis

Cricket Data Analytics Using HDFS and Mapreduce APIs
Java
4
star
2

Kafka_Spark_Streaming

This repository consists of code written in scala which takes in streaming data from kafka consumer client and runs using spark-submit to catch on the streaming data
Scala
2
star
3

Analysis-of-Hollywood-Movies-Using-HDFS-and-Mapreduce-APIs

This project is an analysis of the number of hollywood movies made from 1913 to 2014 using as HDFS as file distribution system and using Mapreduce Framework as execution engine.
Java
2
star
4

CSIR_scrapping

It includes data scrapping work using BeautifulSoup(Python) in CSIR-CDRI internship.It also includes work involved of data cleansing and visualization of data given in form of excel sheets which is first cleansed using xlrd module and then visulaized using Matplotlib.
Jupyter Notebook
1
star
5

Peer_to_Peer_Chatbox

This project is a peer to peer chatbox using a mysql database and is built in Java using Swing for developing GUI.
Java
1
star
6

UpGrad_Big_Data_Task

Java
1
star
7

Kafka_Consumer_Producer_Scripts

This repository consists of Kafka custom Consumer and Producer clients written using Kafka APIs in Java
Java
1
star
8

anand

CSS
1
star
9

Competitive_Programming

It consists of some competitive programming questions and questions for practice solved before
Java
1
star
10

Analysis_of_Time_usage_using-_SparkSQL

This project is an analysis of time usage using SparkSQL and other Spark APIs
Scala
1
star
11

Decision-Tree-Through-Spark

This project is used to predict weather forecast(low humidity days - susceptitbility for forest fire) through implementation of a Decision-Tree Supervised learning algorithm on Spark Execution Engine over Databricks Cluster (its Community Cloud Service)
Jupyter Notebook
1
star
12

Clustering_Weather_Dataset

Cluster Analysis on a Weather Dataset to identify different Weather patterns using K-Means Clustering Algorithm Using Spark Execution Engine over a Databricks Cluster
Jupyter Notebook
1
star
13

EEFL_Hackathon_CCMS

This repository consists of challenges 1 and 2 prototype model for the CCMS .
Jupyter Notebook
1
star
14

loadshedder

loadshedder
Go
1
star
15

zookeeper_Znode_Operations_Using_Java_APIs

This repository consists of set of operations performed on Znodes using Zookeeper APIs.
Java
1
star