• Stars
    star
    290
  • Rank 142,981 (Top 3 %)
  • Language
    Go
  • License
    Apache License 2.0
  • Created almost 3 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A version control system to manage large files.

Apache License Go tests status

ArtiVC

ArtiVC (Artifacts Version Control) is a handy command-line tool for data versioning on cloud storage. With only one command, it helps you neatly snapshot your data and Switch data between versions. Even better, it seamlessly integrates your existing cloud environment. ArtiVC supports three major cloud providers (AWS S3, Google Cloud Storage, Azure Blob Storage) and the remote filesystem using SSH.

asciicast

Try it out from the Getting Started guide

Features

  • Data Versioning: Version your data like versioning code. ArtiVC supports commit history, commit message, and version tag. You can diff two commits, and pull data from the specific version.
  • Use your own storage: We are used to putting large files in NFS or S3. To use ArtiVC, you can keep putting your files on the same storage without changes.
  • No additional server is required: ArtiVC is a CLI tool. No server or gateway is required to install and operate.
  • Multiple backends support: ArtiVC natively supports local filesystem, remote filesystem (by SSH), AWS S3, Google Cloud Storage, and Azure Blob Storage as backend. And 40+ backends are supported through Rclone integration. Learn more
  • Painless Configuration: No one like to configure. So we leverage the original configuration as much as possible. Use .ssh/config for ssh access, and use aws configure, gcloud auth application-default login, az login for the cloud platforms.
  • Efficient storage and transfer: The file structure of the repository is stored and transferred efficiently by design. It prevents storing duplicated content and minimum the number of files to upload when pushing a new version. Learn more

Documentation

For more detail, please read the ArtiVC documentation

More Repositories

1

piperider

Code review for data in dbt
Python
479
star
2

primehub

open-source MLOps platform
Shell
390
star
3

colab-xterm

Open a terminal in colab, including the free tier.
Python
366
star
4

crane

Crane is a easy-to-use and beautiful desktop application helps you build manage your container images.
TypeScript
283
star
5

awesome-public-dbt-projects

A curated list of awesome public DBT projects
82
star
6

k8s-iperf

Run network performance test in kubernetes cluster
Shell
31
star
7

primehub-aws-cdk

Life is short, don't waste time on setting k8s environment. One-click CDK to set up AWS EKS with PrimeHub.
TypeScript
28
star
8

taxi_rides_ny_duckdb

PipeRider dbt workshop for DataTalksClub DE Zoomcamp
16
star
9

dimon

Python
15
star
10

primehub-python-sdk

PrimeHub Python SDK
Python
13
star
11

primehub-console

PrimeHub Console UI
TypeScript
12
star
12

primehub-controller

🎮 PrimeHub Controller
Go
11
star
13

piperider-compare-action

Shell
9
star
14

git-repo-analytics

Python
7
star
15

model-deployment-examples

Shell
5
star
16

primehub-site

A static site of PrimeHub.
JavaScript
5
star
17

showcase

showcase
Jupyter Notebook
5
star
18

PrimeLM

The large language model service and project dialogue system platform, built for enterprises, is provided by InfuseAI.
5
star
19

dbt-nthu-kktv

4
star
20

kube-notebooks

Jupyter Notebooks ❤️ Kubernetes
Jupyter Notebook
4
star
21

awesome-primehub-apps

Collection of awesome PrimeHub Apps
Python
3
star
22

primehub-job

Python
2
star
23

primehub-seldon-servers

Python
2
star
24

primehub-install

Shell
2
star
25

auto-img-cls

Jupyter Notebook
2
star
26

piperider-action

JavaScript
2
star
27

piperider-blog

PipeRider blog built in Jekyll
HTML
2
star
28

WaysOfML

JavaScript
2
star
29

primehub-remote-deploy

The example of primehub-python-sdk, deploy the PrimeHub deployment to remote cluster
Python
2
star
30

PipeRider-Documentation

1
star
31

Homebrew-ArtiVC

homebrew formula for artiv
Ruby
1
star
32

TaoKanOperator

A Kubernetes operator for transferring PVC data to the remote cluster
Go
1
star
33

dbt-infuse-finance

Python
1
star
34

dbt-project-pull-request-visualizer

A tool to visualize the GitHub Pull Request of a dbt project
Python
1
star
35

one.primehub.io

JavaScript
1
star
36

primehub-dataset-upload

Python
1
star
37

primehub-examples

When the PrimeHub Notebook start, we will get the primehub-example as our example folder. This repository will put the example into the folder.
Makefile
1
star
38

piperider-getting-started

1
star