Treeverse (@treeverse)

Top repositories

1

lakeFS

lakeFS - Data version control for your data lake | Git for data
Go
4,294
star
2

lakeFS-samples

lakefs-samples repository
Jupyter Notebook
69
star
3

lakeview

lakeview is a visibility tool for S3 based data lakes
Python
30
star
4

airflow-provider-lakeFS

lakeFS airflow operator
Python
26
star
5

blogs

supporting code for lakeFS blogs
Jupyter Notebook
23
star
6

boto-s3-router

Boto S3 Router provides a Boto3-like client that routes requests between S3 clients according to the bucket and the key in the request.
Python
18
star
7

charts

Helm charts
Smarty
18
star
8

lakeFS-hooks

a simple lakeFS webhook for pre-commit and pre-merge validation of data objects
Python
12
star
9

terminus

Track and enforce quota on S3
Go
7
star
10

dais-challenge

Data + AI Summit 2022 lakeFS challenge
Shell
6
star
11

lakeFS-axolotl-the-developer-mascot

Axolotl, lakeFS Developers Mascot
5
star
12

hadoop-router-fs

RouterFileSystem is a Hadoop FileSystem implementation that transforms URIs at runtime according to provided configurations. It then routes file system operations to another Hadoop file system that executes it against the underlying object store.
Java
4
star
13

blog-presto-local

Presto environment part of blog post
Shell
4
star
14

lakefs-spark-extensions

Spark SQL extensions for lakeFS
Scala
2
star
15

lakefs-iceberg-catalog

Java
2
star
16

docs-lakeFS

lakeFS documentation - docs.lakefs.io
HTML
2
star
17

spark-client

A lakeFS client for Apache Spark
Scala
1
star
18

onboarding

Repository to hold on-boarding resources
1
star
19

lakefs-iceberg

A custom Iceberg catalog implementation for lakeFS
Java
1
star