Data Minded (@datamindedbe)

Top repositories

1

lighthouse

Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.
Scala
60
star
2

tpcds-dbt-duckdb

This repository contains the tpcds queries together with the code required to run this benchmark for dbt and duckdb
HCL
17
star
3

webinar-containers

HCL
14
star
4

elections2024-website

TypeScript
12
star
5

python-and-spark-for-data-analysis

A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course given by Patrick Varilly to one of our clients in December 2015
Jupyter Notebook
11
star
6

sync-upgrade

Python
5
star
7

spark_on_azure_batch_demo

Python
5
star
8

dihub-python-for-data-scientists-2015

Presentation, notebooks and supporting files for Meetup "Python for Data Scientists", given by Patrick Varilly at the European Data Innovation Hub in Brussels on Thu 17 Sep 2015.
Jupyter Notebook
5
star
9

llm-hackathon

Jupyter Notebook
5
star
10

nmbs-realtime-feed

Python
4
star
11

train-occupancy

Predicts NMBS train occupancy based on irail data and survey data from a mobile app
Python
3
star
12

conveyor-samples

Samples on how to use Conveyor.
Jupyter Notebook
3
star
13

iceberg-ingestion

Public repository containing sample code for how to improve ETL ingestion processes with Apache Iceberg
Python
3
star
14

pyhouse

Port of Lighthouse for pyspark
Python
3
star
15

nifi-dataminded-bundle

Java
3
star
16

wharlord

Scala
2
star
17

terraform-provider-conveyor

2
star
18

platform-quack-quack-ka-ching

The duck escapes with theย credits.
2
star
19

homebrew-conveyor-formulas

Brew tap repository for Conveyor
Python
2
star
20

dbt-testing-hackaton

Python
1
star
21

climate

This is a demo repo that reads xco2 data from the ACOS GOSAT project from NASA and loads it into elasticsearch
Python
1
star
22

conveyor-roadmap

This is the public roadmap for Conveyor.
1
star
23

academy_airflow

Exercises for the Data Minded Academy course on Apache Airflow
Python
1
star
24

academy_linux

Shell
1
star
25

academy_git

Python
1
star
26

gcp-foundation-example

HCL
1
star
27

datafy-cobrademo

A demo of running Cobra in Datafy
Python
1
star
28

conveyor-templates

Cookiecutter templates used by Conveyor.
Python
1
star