There are no reviews yet. Be the first to send feedback to the community and the maintainers!
buenavista
A Postgres Proxy Server in Pythonexhibit
A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.duckdbt
The Modern Data Stack in a Python packagede4ml
Supporting materials/code examples for my course in data engineering for machine learning.avro-json
Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.geojson
Scala library for working with GeoJSON records using Esri's Geometry API for Javatarget-duckdb
A Singer.io target for DuckDBdriskill
Either[Hotel in Austin, Prototype of a Scala Distributed Collections API]nba_monte_carlo
The Modern Data Stack in a (Smaller) Boxlineage
An R package for tracking the transformations applied to the vectors in a data frame.supernova
A starter kit for working with supernova schemas.dbt-buenavista
The dbt adapter for a Buena Vista database proxy serverhive-scd
A new kind of slowly changing dimension pattern for Apache Hive.crunch-demo
A demo application for getting started with Apache Crunch.dbt-mysql
MySQL plugin for dbtsaferdd
Tools for working with dirty data in Apache Spark.attribution
MapReduce job for creating multitouch attribution models.avroplay
Me messing around with some Avro stuffs3-demo
Demo dbt-duckdb against localstack w/the new fsspec config options in version 1.4.1hanukkahofdata
My solutions to the 2023 Hanukkah of Datacdh-mapreduce-ext
Classes in the new mapreduce.* API that are not part of CDH3 yet.avro-json-serde
A wrapper that uses the Hive AvroSerDe to deserialize data as JSON for use with Hive Streaminghosprunner
Love Open Source and this site? Check out how you can help us