• Stars
    star
    4
  • Rank 3,296,530 (Top 66 %)
  • Language
    Python
  • Created almost 8 years ago
  • Updated almost 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

How to use zip and gzip files in Apache Spark

More Repositories

1

jupyter-cadquery

An extension to render cadquery objects in JupyterLab via pythreejs
Python
257
star
2

three-cad-viewer

A CAD viewer component based on three.js
JavaScript
178
star
3

vscode-ocp-cad-viewer

A viewer for OCP based Code-CAD (CadQuery, build123d) integrated into VS Code
Python
96
star
4

spark-yarn-rest-api

Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstation
Python
24
star
5

Spark-ETL-Atlas

A small project to show how to add lineage to Atlas when using Spark as ETL tool
Jupyter Notebook
12
star
6

cadquery-massembly

A manual assembly system based on mates
Python
11
star
7

cadquery-jupyter-extension

An extension to view X3DOM content created by CadQuery 2.x
Jupyter Notebook
9
star
8

ssh_ipykernel

A remote jupyter kernel via ssh
Python
9
star
9

cad-viewer-widget

An ipywidget based interface to the Javascript three-cad-viewer
Python
8
star
10

vscode-cadquery-viewer

A viewer for CadQuery integrated into VS Code based on three-cad-viewer
TypeScript
7
star
11

Spark-Masterclass

Shell
7
star
12

db-12-vscode

Send code blocks (Python, SQL, Scala, R) to a Databricks cluster
TypeScript
6
star
13

ranger-options

Helper to set some Apache Ranger 0.6 options
Python
5
star
14

Alg123d

Python
4
star
15

zeppelin-ipython-shim

A thin layer to enable visualisation libraries like Bokeh to draw into the browser using the IPython display system calls
Python
4
star
16

audio-fingerprinting-dejavu

Audio fingerprinting algorithm based on https://github.com/worldveil/dejavu ported to scala
Scala
4
star
17

kubernetes-on-bare-metal

Shell
3
star
18

zeppelin2md

Convert Zeppelin Notebooks to Github Markdown as Readme.md
Python
2
star
19

20newsgroups-spark

2
star
20

db-12-kernel

Python
2
star
21

tiny-hadoop

A pseudo-distributed HDFS with YARN, Spark 2.3 and Oozie in a docker container
Shell
2
star
22

advanced-angular-for-pyspark

Adding a way to call javascript methods from Python in Apache Zeppelin
Python
2
star
23

nvd3-stat

Python
1
star
24

mlflow-experiments

Jupyter Notebook
1
star
25

kerberos-hdfs

A podman pod with kdc, kereberized hdfs and spark
Shell
1
star
26

spark-app.g8

Scala
1
star
27

jupyter-cadquery-widgets

Widgets for jupyter-cadquery
JavaScript
1
star
28

ocp-tessellate

Tessellate OCP (https://github.com/cadquery/OCP) objects to use with threejs
Python
1
star
29

cadquery-vtk-viewer

Experimental vtk custom Juypter widget
JavaScript
1
star
30

zeppelin-visualizations

Python
1
star
31

Log-Over-Http

A logger to post log messages via HTTP to a central web server that renders it (useful for distributed programs, e.g. Soark on YARN)
Scala
1
star