• Stars
    star
    35
  • Rank 751,178 (Top 15 %)
  • Language
  • License
    MIT License
  • Created over 5 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.

More Repositories

1

Udacity-Data-Engineering-Projects

Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Python
1,488
star
2

goodreads_etl_pipeline

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Python
1,286
star
3

Optimizing-Public-Transportation

A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.
Python
29
star
4

Big_Data_Project

Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an Ensemble model to classify whether the news is fake or not.
Python
17
star
5

Spark_Packaged_project

This project contains pyspark jobs to create data pipelines and shows how to distribute the project package on Cluster.
Python
6
star
6

SF-Crime-Statistics

A Kafka and Spark Streaming Integration project : SF Crime Statistics with Spark Streaming
Python
3
star
7

IPL-analysis-with-Python-Pandas

This project provides an analysis on IPL(Indian premier League) stats from Year 2008 to 2017.
Jupyter Notebook
2
star
8

Uppaal_Model_Checking

Model Checking For Automated Machine Learning Models
q
2
star
9

Yelp_Project

This project is to create a Data lake for Yelp data-set and further using the it to create an Analytical Sandbox Data Science purpose and also creating a data warehouse for reporting purpose.
Jupyter Notebook
2
star
10

SOEN_6441

A multiplayer board Risk Game.
Java
1
star
11

Black-Friday-Sales-Analysis

This Project gives an insight into few statistics related to black Friday Sale.
Jupyter Notebook
1
star