There are no reviews yet. Be the first to send feedback to the community and the maintainers!
buenavista
A Postgres Proxy Server in Pythonexhibit
A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.duckdbt
The Modern Data Stack in a Python packagede4ml
Supporting materials/code examples for my course in data engineering for machine learning.avro-json
Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.geojson
Scala library for working with GeoJSON records using Esri's Geometry API for Javatarget-duckdb
A Singer.io target for DuckDBdriskill
Either[Hotel in Austin, Prototype of a Scala Distributed Collections API]nba_monte_carlo
The Modern Data Stack in a (Smaller) Boxlineage
An R package for tracking the transformations applied to the vectors in a data frame.supernova
A starter kit for working with supernova schemas.mz-fastapi
A FastAPI utility for building HTTP endpoints powered by Materialize TAIL queriesdbt-buenavista
The dbt adapter for a Buena Vista database proxy serverhive-scd
A new kind of slowly changing dimension pattern for Apache Hive.crunch-demo
A demo application for getting started with Apache Crunch.saferdd
Tools for working with dirty data in Apache Spark.attribution
MapReduce job for creating multitouch attribution models.avroplay
Me messing around with some Avro stuffs3-demo
Demo dbt-duckdb against localstack w/the new fsspec config options in version 1.4.1hanukkahofdata
My solutions to the 2023 Hanukkah of Datacdh-mapreduce-ext
Classes in the new mapreduce.* API that are not part of CDH3 yet.avro-json-serde
A wrapper that uses the Hive AvroSerDe to deserialize data as JSON for use with Hive Streaminghosprunner
Love Open Source and this site? Check out how you can help us