Data Minded (@datamindedbe)

Top repositories

1

lighthouse

Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.
Scala
60
star
2

tpcds-dbt-duckdb

This repository contains the tpcds queries together with the code required to run this benchmark for dbt and duckdb
HCL
15
star
3

webinar-containers

HCL
13
star
4

elections2024-website

TypeScript
12
star
5

python-and-spark-for-data-analysis

A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course given by Patrick Varilly to one of our clients in December 2015
Jupyter Notebook
11
star
6

sync-upgrade

Python
5
star
7

spark_on_azure_batch_demo

Python
5
star
8

dihub-python-for-data-scientists-2015

Presentation, notebooks and supporting files for Meetup "Python for Data Scientists", given by Patrick Varilly at the European Data Innovation Hub in Brussels on Thu 17 Sep 2015.
Jupyter Notebook
5
star
9

llm-hackathon

Jupyter Notebook
5
star
10

nmbs-realtime-feed

Python
4
star
11

train-occupancy

Predicts NMBS train occupancy based on irail data and survey data from a mobile app
Python
3
star
12

conveyor-samples

Samples on how to use Conveyor.
Jupyter Notebook
3
star
13

pyhouse

Port of Lighthouse for pyspark
Python
3
star
14

nifi-dataminded-bundle

Java
3
star
15

wharlord

Scala
2
star
16

terraform-provider-conveyor

2
star
17

homebrew-conveyor-formulas

Brew tap repository for Conveyor
Python
2
star
18

iceberg-ingestion

Public repository containing sample code for how to improve ETL ingestion processes with Apache Iceberg
Python
2
star
19

dbt-testing-hackaton

Python
1
star
20

climate

This is a demo repo that reads xco2 data from the ACOS GOSAT project from NASA and loads it into elasticsearch
Python
1
star
21

academy_linux

Shell
1
star
22

conveyor-roadmap

This is the public roadmap for Conveyor.
1
star
23

academy_airflow

Exercises for the Data Minded Academy course on Apache Airflow
Python
1
star
24

academy_git

Python
1
star
25

gcp-foundation-example

HCL
1
star
26

datafy-cobrademo

A demo of running Cobra in Datafy
Python
1
star
27

conveyor-templates

Cookiecutter templates used by Conveyor.
Python
1
star