• This repository has been archived on 23/Apr/2023
  • Stars
    star
    30
  • Rank 814,160 (Top 17 %)
  • Language
  • Created over 4 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Machine Learning infrastructure/architecture/operation for productionization

More Repositories

1

tabula-py

Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame
Python
2,061
star
2

julia-100-exercises

julia version of 100 numpy exercises
Jupyter Notebook
128
star
3

Mykytea-python

Python wrapper for KyTea
C++
36
star
4

notebooks

Jupyter Notebook
31
star
5

MeCab.jl

Julia binding of Japanese morphological analyzer MeCab
Julia
21
star
6

cloudera-parcel

customized cloudera-parcel
Python
13
star
7

sparkavro

Load Avro data into Spark with sparklyr
R
12
star
8

ibis-demo

Demo notebook of Ibis for "Spark + Python + Dita science Festival"
Jupyter Notebook
12
star
9

homebrew-cloudera

Homebrew Formulas for cloudera tools
Ruby
10
star
10

sparklyr-distribute

Example code of spark_apply with sparklyr for CDH
R
8
star
11

NLTK-pyspark

Example repository for NLTK execution on PySpark cluster with Cloudera Data Science Workbench
Python
8
star
12

spacyr-sparklyr

Example code of spacyr with sparklyr
R
8
star
13

tdworkflow

Unofficial Treasure Workflow Client
Python
7
star
14

cdsw-simple-serving-python

Python
7
star
15

Mykytea-ruby

Ruby wrapper for KyTea
C++
7
star
16

amazon-movie-review

Recommendation for Amazon movie review data
Python
6
star
17

pollynomial

AWS Polly wrapper for Ruby: Text to speech gem
Ruby
6
star
18

solar-power-prediction

Jupyter Notebook
5
star
19

hocon-validator

HOCON validator
Python
5
star
20

cJuman-installer

This is installer for cJuman which is wrapper of JUMAN.
C
5
star
21

cdsw-serve-docker

REST API server example with Docker for Cloudera Data Science Workbench
5
star
22

docker-sphinx-recommonmark

Sphinx documentation toolchain, including latex and recommonmark in an Ubuntu docker container.
Dockerfile
5
star
23

cloudera-sparklyr

Build script and Demo for Cloudera Director with Sparklyr
HTML
4
star
24

sparklytd

spaklyr plugin for td-spark to connect TD from R
R
4
star
25

digdaglog2sql

Extract SQLs from digdag log
Python
4
star
26

mecab-on-pyspark

Example code for distributing Python packages on Spark cluster
Python
3
star
27

implyr-example

Example repository of implyr
R
3
star
28

JPKyteaTokenizer

Japanese tokenizer with KyTea for nltk
Python
3
star
29

pficommon_json_test

pficommon::text::json test
C++
3
star
30

molehill

Hivemall SQLs and digdag workflows generator
Python
3
star
31

morph-websocket

Real time morphological analyzing web-app.
Ruby
2
star
32

cookiecutter-digdag

A template generates digdag workflows for SQL and Python
Python
2
star
33

audience_generator

Create dummy data for Audience Studio on Treasure Data
Python
2
star
34

homebrew-jumanpp

A Homebrew formula for juman++ http://nlp.ist.i.kyoto-u.ac.jp/index.php?JUMAN++
Ruby
2
star
35

kytea_sinatra

Test application for KyTea with Sinatra
Ruby
2
star
36

JuliaTokyoTutorial

Julia Tokyo Tutorial
2
star
37

ml_intern2015

Cookpad summer intern 2015 exercise
Python
1
star
38

chezou-hugo

HTML
1
star
39

japan_weather

Python
1
star
40

mizuyarilink_octopress

CSS
1
star
41

prelims-cli

Python
1
star
42

ConfidenceWeighted.jl

confidence weighted classifier
Julia
1
star