• Stars
    star
    13
  • Rank 1,512,713 (Top 30 %)
  • Language
    Python
  • Created over 8 years ago
  • Updated about 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Spark (PySpark) script that applies dynamic time warping to Energy usage data (using the python fastdtw package)

More Repositories

1

Spark

Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References
Jupyter Notebook
71
star
2

SparkHBaseExample

Spark code to analyze HBase Snapshots
Scala
35
star
3

network_topology_analysis

Code to collect and analyze traceroute data within a network topology
Scala
25
star
4

HDP_Tuning_Unofficial

Collection of HDP Tuning Tricks & Tips (unofficial guide)
Python
17
star
5

SparkPhoenix

Spark Example using Phoenix to interact with HBase
Scala
14
star
6

Datasets

Interesting Public Datasets
11
star
7

Apache_NiFi

Code, projects, and references for Apache NiFi
Python
10
star
8

Google-Cloud-Scripts

Google Cloud Platform Scripts
Python
9
star
9

docker_containers

Docker Containers with HDP Services/Code (Spark, Kafka, NiFi, Solr, Tensorflow...)
JavaScript
6
star
10

DL_Image_Classification

Deep Learning Image Classification - Scripts and Links
JavaScript
5
star
11

iaa-2023

Institute for Advanced Analytics, 2023
Jupyter Notebook
5
star
12

Apache_Hive

Apache Hive (SQL on Hadoop) Syntax, Cheatsheet, and Projects
Python
4
star
13

iaa-2022

Institute for Advanced Analytics, 2022
Jupyter Notebook
4
star
14

iaa_2020

Institute for Advanced Analytics - 2020
Python
3
star
15

hive_udf

Apache Hive - UDF Example with Python
Python
3
star
16

Hortonworks_Hackathon_Ad_Server

Hortonworks Hackathon - Ad Server Assets
JavaScript
3
star
17

python

Python Scripts, Tricks, and References
Python
2
star
18

nfl_predictions

NFL Predictions (WebApp with PySpark)
JavaScript
2
star
19

iaa_2021

Institute for Advanced Analytics 2021
Jupyter Notebook
2
star
20

Disaster_Recovery

Resources, tricks, and recommendations for DR (Disaster Recovery) Hadoop clusters
Shell
1
star
21

Sqoop

Sqoop - Bulk Load Data into HDFS, Hive, HBase, etc.
1
star
22

sas_esp

SAS Event Stream Processing
Python
1
star
23

Apache-Ranger

Hadoop Security and Policy Management - Syntax, Tricks, and Resources
Python
1
star
24

cloud-endpoints

Cloud Run API Backend
Shell
1
star
25

gcp_dataflow

Google Dataflow - Scripts and References
Python
1
star
26

gcp-data-streaming

Google Cloud Data Streaming Architecture
Python
1
star
27

HBase_Phoenix

Apache HBase & Phoenix Scripts and Code Examples
Shell
1
star
28

sas

SAS Scripts
SAS
1
star
29

Google-ML

Google Machine Learning Script and Assets
Python
1
star
30

video-processing

GCP Video Processing with Speech to Text
HTML
1
star
31

ML-Model-Deployment

Scripts, Tips, and Tricks for Deploying ML and Deep Learning Models into Production
Shell
1
star
32

Hortonworks_Installation

Hortonworks DataFlow (HDF) Installation/Config, Scripts, and Tricks
Shell
1
star
33

GPUs_Tensorflow

GPU Scripts to setup and run Tensorflow (and other DL/ML libraries)
Shell
1
star
34

hortonworks_hdf_workshop

Hortonworks HDF Workshop Vagrant Image
1
star
35

video_analysis

Real-time Video Analysis, Object Detection
Python
1
star
36

Cloud-DevOps

Google Cloud DevOps Scripts and References
Python
1
star
37

genai-text-to-3d-mesh

Kubernetes deployment for text to 3d Mesh using Open AI Point-e
HCL
1
star
38

Apache-Atlas

Hadoop Data Lineage and Metatdata - Configuration, Scripts, and Tricks
Python
1
star