• Stars
    star
    5
  • Rank 2,861,937 (Top 57 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created 9 months ago
  • Updated 8 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

More Repositories

1

lighthouse

Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.
Scala
60
star
2

tpcds-dbt-duckdb

This repository contains the tpcds queries together with the code required to run this benchmark for dbt and duckdb
HCL
17
star
3

webinar-containers

HCL
14
star
4

elections2024-website

TypeScript
12
star
5

python-and-spark-for-data-analysis

A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course given by Patrick Varilly to one of our clients in December 2015
Jupyter Notebook
11
star
6

sync-upgrade

Python
5
star
7

spark_on_azure_batch_demo

Python
5
star
8

dihub-python-for-data-scientists-2015

Presentation, notebooks and supporting files for Meetup "Python for Data Scientists", given by Patrick Varilly at the European Data Innovation Hub in Brussels on Thu 17 Sep 2015.
Jupyter Notebook
5
star
9

nmbs-realtime-feed

Python
4
star
10

train-occupancy

Predicts NMBS train occupancy based on irail data and survey data from a mobile app
Python
3
star
11

conveyor-samples

Samples on how to use Conveyor.
Jupyter Notebook
3
star
12

iceberg-ingestion

Public repository containing sample code for how to improve ETL ingestion processes with Apache Iceberg
Python
3
star
13

pyhouse

Port of Lighthouse for pyspark
Python
3
star
14

nifi-dataminded-bundle

Java
3
star
15

wharlord

Scala
2
star
16

terraform-provider-conveyor

2
star
17

platform-quack-quack-ka-ching

The duck escapes with theย credits.
2
star
18

homebrew-conveyor-formulas

Brew tap repository for Conveyor
Python
2
star
19

dbt-testing-hackaton

Python
1
star
20

climate

This is a demo repo that reads xco2 data from the ACOS GOSAT project from NASA and loads it into elasticsearch
Python
1
star
21

conveyor-roadmap

This is the public roadmap for Conveyor.
1
star
22

academy_airflow

Exercises for the Data Minded Academy course on Apache Airflow
Python
1
star
23

academy_linux

Shell
1
star
24

academy_git

Python
1
star
25

gcp-foundation-example

HCL
1
star
26

datafy-cobrademo

A demo of running Cobra in Datafy
Python
1
star
27

conveyor-templates

Cookiecutter templates used by Conveyor.
Python
1
star