• Stars
    star
    2
  • Language
    Java
  • License
    GNU General Publi...
  • Created almost 9 years ago
  • Updated about 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Apply statistical text analysis to messages, Wikipages, documents, or any other collection.

More Repositories

1

OpenTSx

Let's simplify time series analysis on data streams for low latency decision support!
Java
8
star
2

Hadoop.TS

A collection of tools for time series processing in Hadoop.
Java
8
star
3

HDGS

HDGS is an abstraction layer for advanced data structures stored in Hadoop HDFS or HBase.
HTML
5
star
4

ks-inspector

Understand how your KSQL queries and KStreams-Applications depend on shared topics in a Kafka cluster.
Java
5
star
5

giraphl

giraphl is a library related to Apache Giraph. It is a place to collect things around graph, which fits not into a fork of the Giraph project.
Shell
5
star
6

etosha

etosha
JavaScript
4
star
7

hadoop-admin-and-developer-scripts

A collection of scripts, reciepes and notes to support admin and developement tasks on a fresh installed hadoop cluster or gateway node.
JavaScript
3
star
8

MorphMiner

MorphMiner is in its core a Morphline development tool. But it can also be seen as a data collection tool for scientists.It allows data ingestion into Hadoop clusters.
Java
3
star
9

FluctuationAnalysis

A set of time series analysis algorithms, e.g. DFA and MFDFA
Java
3
star
10

crunch.TS

the fast way of building time series processing pipelines for Hadoop using Crunch
Java
3
star
11

ghc

gephi-hadoop-connector
Java
2
star
12

WikiExplorer

A tool to collect statistical data about mediawikipages using the Mediawiki API.
Java
2
star
13

kstreams-perf-test

Performance testing methodology and tools to benchmark KStreams applications.
Java
2
star
14

graphx-layouts

Lets layout large graphs using GraphX and Spark
Scala
2
star
15

TSCache

A webinterface for time series and bucket storage in HBase.
Java
1
star
16

cdsw-engine-custom-01

Add Maven to a custom engine.
Shell
1
star
17

cdsw-dl4j-mvdp-on-cdh

Demo of a minimal viable data product on CDH using Deeplearning4J.
1
star
18

da_graph_frames_lab02

Some exercise using graph frames
Scala
1
star
19

wikixmldumpstatistics

A toolbox for analysing mediawikixml dumpfiles for text statistics
Java
1
star
20

Snaffer2

Some tools to collect network traffic statistics.
Python
1
star
21

Humulus

The CumulusRDF HBase connector prototype is built here.
Java
1
star
22

AvroOperations

Supplementary material for Avro tutorials
Java
1
star
23

IoT-Workshop-2018

1
star