• Stars
    star
    11
  • Rank 1,694,829 (Top 34 %)
  • Language
    Scala
  • License
    Apache License 2.0
  • Created over 8 years ago
  • Updated over 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

spark package to natively read S3 files instead of through hadoop, improving speed

More Repositories

1

PyFunctional

Python library for creating data pipelines with chain functional programming
Python
2,380
star
2

pygraph

CLI interface to python graphviz
Python
44
star
3

spot-price-reporter

Fetch and plot AWS spot pricing history
Python
23
star
4

docker-workshop

Docker workshop going over how to create a set of microservices
Python
12
star
5

wikidata-rust

Fast rust-based parser for wikidata dumps
Rust
8
star
6

slurm-tools

Python
6
star
7

GrappaRDD

Resilient Distributed Datasets from Apache Spark with distributed shared memory using Grappa
C++
4
star
8

kuro

Experiment manager for ML experiments
Python
3
star
9

entilzha.github.io

Personal webpage sourcecode
JavaScript
3
star
10

pandoc-viewer

Make viewing docs in vim and pandoc easier
Go
3
star
11

plda-spark

Code implementing parallel latent dirichlet allocation in spark. Work in progress.
Scala
2
star
12

nrcs-api

Source code for using NRCS Snotel API at http://www.wcc.nrcs.usda.gov/web_service/awdb_web_service_landing.htm
Java
2
star
13

music-theory

Python
1
star
14

acl-service-handbook

Community sourced handbook for how to propose/create/run *ACL workshops
1
star
15

EMAlgorithmCoinExample

Implementation of the EM Algorithm for the coin example found here: http://ai.stanford.edu/~chuongdo/papers/em_tutorial.pdf
Scala
1
star
16

dotfiles-old

Configuration files (dotfiles) repo
Vim Script
1
star
17

publications

Repository listing my published papers.
TeX
1
star
18

fever

Partial re-implementation of fever models based on google language repo
Python
1
star
19

acl-miniconf-old

JavaScript
1
star
20

leaderboard

Source code for CU Boulder leaderboard web app
Python
1
star
21

overseer

Python
1
star
22

talks

List of talks
Jupyter Notebook
1
star
23

nips-lda-spark

Testing LDA implementation
Scala
1
star
24

sg-snowprofile

Creates snow profiles from txt of snowpit data. Created by SnowGeek
Python
1
star
25

reinforcement-learning

Learning about RL
Python
1
star