• Stars
    star
    32
  • Rank 779,532 (Top 16 %)
  • Language
    Scala
  • Created over 6 years ago
  • Updated about 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

More Repositories

1

evol

a python grammar for evolutionary algorithms and heuristics
Python
185
star
2

whirl

Fast iterative local development and testing of Apache Airflow workflows
Shell
179
star
3

dbt-excel

[DEPRECATED] A dbt adapter for Excel.
Python
85
star
4

iterative-broadcast-join

The iterative broadcast join example code.
Scala
69
star
5

pytest-dbt-core

Pytest plugin for dbt core
Python
52
star
6

pydantic-avro

This library can convert a pydantic class to a avro schema or generate python code from a avro schema.
Python
47
star
7

airflow-testing-examples

Python
46
star
8

jiraview

Extract data from JIRA through REST and create charts.
Python
35
star
9

rhyme-with-ai

Rhyme with AI
Python
35
star
10

pydantic-spark

Python
22
star
11

llm-archetype-batch-use-case

General solution to archetype LLM batch use case
Python
21
star
12

risk-analysis

Genetic algorithms and the game of Risk
Jupyter Notebook
19
star
13

piven

Prediction Intervals with specific value prediction
Python
16
star
14

build-your-own-search-engine

This repository contains code to build an MVP search engine with google like interface.
Python
16
star
15

dbt-data-ai-summit

Code that was used as an example during the Data+AI Summit 2020
15
star
16

airflow-training-skeleton

Skeleton project for Apache Airflow training participants to work on.
Python
15
star
17

airflow_workspace

Workspace for Airflow training, inlcuding docker and docker compose
Dockerfile
14
star
18

ansible_cluster

Instant Hadoop cluster with Ansible and Cobbler - Just Add Water.
Shell
13
star
19

python-devcontainer-template

Shows you how to use a Devcontainer for your Python project πŸ³β™‘πŸ
Python
12
star
20

airflow-helm

Smarty
11
star
21

doobie-monix-jdbc-example

Example project demonstrating easy, concise and typechecked JDBC access
Scala
10
star
22

ParallelConnection

Python
10
star
23

auto-tagger

Tagging texts with tags automatically
Python
9
star
24

asekuro

A utility tool to automate certain tasks with Jupyter notebooks.
Python
9
star
25

openllm-starter

Get started with open source LLMs on a GPU
Jupyter Notebook
8
star
26

druid-ansible

Ansible scripts to create druid cluster
Python
7
star
27

databricks-cdk

Deployment of databricks resources with cdk
Python
7
star
28

godatadriven-blog

Sources to our blog
HTML
6
star
29

organization-pr-scanner

Python
6
star
30

dropblox

Drop some Blox! The one who drops in the most efficient way wins! πŸ†
Jupyter Notebook
6
star
31

stackexchange-parquet

Spark job for converting the StackExchange Network data into parquet format.
Scala
5
star
32

taster-sessions

4
star
33

Kedro-Azureml-Starter

Python
4
star
34

c4-model-example

ASL
4
star
35

prometheus-kafka-offsets

Scala
4
star
36

datamesh

Material for the DataMesh presentation at GoDataFest 2021
Jupyter Notebook
4
star
37

feature_catalog

A package to define features and create them via a simple API
Python
4
star
38

github-contributions

Gather and analyse Github contributions with dbt-duckdb
TypeScript
4
star
39

private-package-in-gcp-tutorial

In this tutorial, we will register a package in GCP Artifact Registry both manually as well as with CICD. In the end, you will be able to install your own private package with pip just like you're used to.
Python
3
star
40

public_cloudera_on_azure

This is needed because the Azure template cannot read from a private github
Shell
3
star
41

code_breakfast_materialize_metabase

Building a real-time analytics dashboard with Materialize and Metabase
3
star
42

flink-streaming-xke

Example how to use Flink with Kafka
Java
3
star
43

pydantic-examples

3
star
44

godatadriven-vision

Computer vision with python and OpenCV
CSS
3
star
45

mlops-workshop

How to MLOps: Experiment tracking & deployment πŸ“Š
Jupyter Notebook
3
star
46

pr-scraper

Tracks our pull requests in public repositories
Python
2
star
47

monopoly-analysis

bigger simulations = moar profit
Python
2
star
48

code-breakfast-deep-learning

Material for PyData Code Breakfast: Introduction to Deep Learning
Jupyter Notebook
2
star
49

provision-nifi-hdinsight

Scripts to provision NiFi to HDInsight
Shell
2
star
50

os-training-materials

A selection of notebooks coming from the GoDataDriven trainings
HTML
2
star
51

dbt-bi-exposures

Python package that collects dbt exposure metadata from different BI providers such as Power BI, Tableau, Looker, Metabase, etc.
Python
2
star
52

duck-pond

A lightweight data lake using dbt-duckdb
2
star
53

ddsw-2018-dsp-workshop

Jupyter Notebook
2
star
54

azureml_experiment_tracking_tutorial

Python
2
star
55

hive-summary-loader

Scripts that load data into hive and create a summary for it
Python
2
star
56

clubcloud_dbt_soda

Repository for Club Cloud workshop on dbt + SodaSQL
Python
2
star
57

hadoop-ds-workshop

Hadoop Data Science workshop
Python
1
star
58

azure_function_python_remote_build

Example repository to show how a minimal Python application can be built and deployed remotely as Azure Function.
HCL
1
star
59

azure_function_terraform

HCL
1
star
60

data-centric-ai-hackathon

Jupyter Notebook
1
star
61

mac-install

Setting up your new Macbook with an install script.
Shell
1
star
62

airflow-aks-dags

Python
1
star
63

hdp-smokey

Hadoop smoke testing framework
Python
1
star
64

tmnl-spark-graphs-training

Python
1
star
65

code_breakfast_tutorial

HTML
1
star
66

bandit-friday

February 2021 GDD Friday project of Roel, Rogier and Vadim
Jupyter Notebook
1
star
67

balancing-heroes-and-pokemon

Balancing Heroes and Pokemon in Real Time: A Streaming Variant of Trueskill for Online Ranking
Scala
1
star