• Stars
    star
    5
  • Rank 2,861,937 (Top 57 %)
  • Language
    Python
  • License
    Creative Commons ...
  • Created over 1 year ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Varied ways of deploying PySpark code to EMR and how the EMR CLI can make it all as easy as a single command.

More Repositories

1

metabase-athena-driver

An Amazon Athena driver for Metabase 0.32 and later
Clojure
225
star
2

athena-sqlite

A SQLite driver for S3 and Amazon Athena 😳
Python
96
star
3

mwhich

Generic API to search for movies or TV shows across Netflix, Hulu, iTunes, and Amazon Video on Demand
Ruby
73
star
4

faker-cli

Command-line interface to quickly generate fake CSV and JSON data
Python
72
star
5

duckdb-athena-extension

An experimental Athena extension for DuckDB 🐀
Rust
49
star
6

modern-data-lake-storage-layers

Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work
Jupyter Notebook
47
star
7

demo-code

Bits of code I use during live demos
Jupyter Notebook
28
star
8

damons-data-lake

All the code related to building my own data lake
CSS
22
star
9

athena-federation-python-sdk

Unofficial Python SDK for Athena Federation
Python
16
star
10

golang-sse-demo

A brief demo of real-time plotting with Plotly, Go, and server-sent events
Go
15
star
11

ci-cd-serverless-spark

Demo for GitHub Universe 2022
Python
12
star
12

emr-serverless-sql-cli

An experimental tool for running SQL on EMR Serverless
Python
8
star
13

go-meerkat

Meerkat API documentation and Go client
Go
8
star
14

dm-whacker

A bookmarklet to automatically delete Twitter Direct Messages
JavaScript
8
star
15

s3-diff-uploader

Python code to demonstrate differential uploading of files to S3.
Python
6
star
16

zoomit

Launch Zoom meetings in a single click πŸ–±
Go
5
star
17

metabase-trino-driver

Trino Driver for Metabase
Clojure
5
star
18

firejab

A simple Campfire to Jabber bridge
Ruby
5
star
19

s3mpty

A batteries-included tool for deleting the contents of versioned S3 buckets.
Go
5
star
20

redpill

A simple script to get my base OS X system up and running
Ruby
5
star
21

macdownloads

Repository of Downloads for OS X
4
star
22

notatsxsw

A combination of jealousy and rage resulted in a Google AppEngine proxy that would filter out SxSW tweets.
Python
4
star
23

ideas

Damon's Ideas
3
star
24

is-remote

A journal of my adventures in remote work
3
star
25

tweepml

TweepML is an XML format used to represent a list of Tweeps (Twitter users)
Ruby
3
star
26

ugrep

Hacked up shell script to grep in UTF-16 files
Shell
3
star
27

emr-job-templates

A sample repository of production-ready Spark code for use with Amazon EMR.
Python
2
star
28

syslog-to-athena

Use Fluentd to send syslogs to Athena for great querying
Dockerfile
2
star
29

metabase-datasette-driver

A Datasette driver for Metabase
Clojure
2
star
30

athena-query-stats

Query your Athena query history using Athena πŸ™†β€β™‚οΈ
Python
2
star
31

slugplot

Weather visualization to show change in average temperature over time.
Jupyter Notebook
2
star
32

athena-excel

Python
2
star
33

jupyter-static-website

A way to continuously deploy Jupyter notebooks to a static website backed by S3.
Jupyter Notebook
2
star
34

emr-eks-airflow2-plugin

An experimental Airflow 2.0 plugin for EMR on EKS
Python
2
star
35

spark-local-environment

An example of using EMR Serverless container image for local environment
Dockerfile
2
star
36

log4j-us

Dynamic log4j generator
HTML
1
star
37

ziply-dsl-monitor

My DSL was severely broken...so I graphed it.
HTML
1
star
38

forklift

Forklift your cargo into different places 🚚
Go
1
star
39

sample-code

Various code bits I run into
Java
1
star
40

emr-eks-terraform

Example of deploying EMR on EKS with Terraform
HCL
1
star
41

choirmaster

Go-based poller for dynamic data sources to make them sing with choir.io
Go
1
star
42

airflow-example-dags

Example dags for airflow experimentation
Python
1
star
43

athena-gmail

Athena Gmail connector
Python
1
star
44

spark-tweeter

I know ... you always wanted your Spark jobs to be able to tweet, right?
Go
1
star
45

byteable-calc

A byte-size HTML/JS calculator for making big numbers human-readable
HTML
1
star
46

cargo-crates

An easy way to build data extractors in Docker.
Python
1
star