• Stars
    star
    111
  • Rank 289,288 (Top 7 %)
  • Language
    Scala
  • License
    Apache License 2.0
  • Created about 6 years ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.

schemer

Build Status Maven Docker Pulls

Schema registry with support for CSV, TSV, AVRO, JSON and Parquet. Has ability to infer schema from a given data source.

Schemer UI [WIP]

Schemer UI is the wizard based frontend for Schemer. It provides a wizard based schema creation and versioning workflow apart from browsing and search capabilities. It is a work in progress. More screens

Schemer Core

schemer-core is the core library that implements most of the logic needed to understand the supported schema types along with the schema inference. To use schemer-core directly, just add it to your dependencies:

libraryDependencies += "com.indix" %% "schemer" % "v0.2.3"

Schemer Registry

schemer-registry is a schema registry for storing the metadata about schema and schema versions. It provides a GraphQL API for adding, viewing and inferring schemas.

Schemer Registry is available as a docker image at DockeHub

Running Locally

Local docker based PostgreSQL can be run as follows:

docker run -e POSTGRES_USER=schemer -e POSTGRES_PASSWORD=schemer -e PGDATA=/var/lib/postgresql/data/pgdata -e POSTGRES_DB=schemer -v $(pwd)/schemer_db:/var/lib/postgresql/data/pgdata -p 5432:5432 postgres:9.5.0

Remove schmer_db folder to clear all data and start from scratch.

The registry service can be run using sbt:

sbt "project registry" ~reStart

More Repositories

1

whatthelang

Lightning Fast Language Prediction 🚀
Python
157
star
2

aws-maintenance-lambda

A lambda function to send alerts (to Slack, HipChat) on AWS maintenance events.
JavaScript
133
star
3

web-auto-extractor

Automatically extracts structured information from webpages
JavaScript
106
star
4

matsya

Place ASGs on the right Spot Market
Scala
39
star
5

gocd-s3-artifacts

Set of GoCD plugins to publish and fetch artifacts from Amazon S3
Java
36
star
6

formland

A simple, super-flexible, extensible config based form generator for React.
TypeScript
33
star
7

sparkplug

Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
Scala
28
star
8

mlflow-gocd

GoCD plugins to work with MLFlow as model repository in a CD flow
Java
27
star
9

css-optimum-selector

Helps to extract shortest optimal css-selector and multi-selector.
CSS
26
star
10

gocd-mesos

Autoscale GOCD agents on top of a mesos cluster
Scala
16
star
11

javascript-easy-object

Now easily access or modify an object in javascript with javascript-easy-object.
JavaScript
13
star
12

vasuki

Scale GoCD Agents on demand with Docker
Go
13
star
13

kafkajs-lz4

🗜 LZ4 compression codec for KafkaJS
TypeScript
12
star
14

rocks

RocksDB Ops CLI
Go
11
star
15

indix-radar

Indix Tech Radar
JavaScript
10
star
16

vamana

Autoscaling toolkit based on custom Application Metrics
Scala
9
star
17

terraform-aws-maintenance-lambda

Terraform module to deploy aws-maintenance-lambda - A lambda function to send alerts (to Slack, HipChat) on AWS maintenance events.
HCL
7
star
18

indix.github.io

Indix Open Source website
HTML
6
star
19

utils

Scala utils for anything and everything
Scala
5
star
20

indix-api-java

Indix API Java client
Java
4
star
21

indix-api-nodejs

Indix API NodeJS Client
JavaScript
4
star
22

bubblewrap

Asynchronous crawler utils
HTML
3
star
23

ml2npy

Export spark ml SparseVectors as numpy csr matrix
Scala
3
star
24

crawler4j

crawler4j fork from Google code
Java
2
star
25

marathon-logger

Push marathon app logs to local syslog daemon
Go
2
star
26

the-vision

Reusable react components
HTML
2
star
27

auto-tag-s3-bucket

Automatically tag S3 buckets with tags from a Google Spreadsheet
Python
2
star
28

indix-api-ruby

Ruby client for indix API
Ruby
2
star
29

Mobile-Product-Search

This app is a representation of how Indix API can be used to leverage information on stores, brands, products which forms the skeleton of Retail Industry.
Objective-C
1
star
30

ansible-ruby

Ansible role to install rvm and ruby
1
star
31

abelwatch

Alerting tool on top of Abel
Go
1
star
32

hadoop-as-publisher

Hadoop Autoscaling Metric Publisher
Python
1
star
33

mod_evasive

git mirror of mod_evasive apache module from http://www.zdziarski.com/blog/wp-content/uploads/2010/02/mod_evasive_1.10.1.tar.gz
C
1
star
34

rocksdb-io

hadoop formats, cascading tap and scalding sources for RocksDB
Scala
1
star
35

ansible-telegraf

Ansible role to install telegraf
1
star
36

abel

Business stats collection/aggregation
Scala
1
star
37

ansible-monit

Ansible role for monit
1
star