• Stars
    star
    161
  • Rank 233,470 (Top 5 %)
  • Language
    Java
  • Created over 9 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Clustering benchmarks

Datasets

This project contains collection of labeled clustering problems that can be found in the literature. Most of datasets were artificially created.

The benchmark includes:

Artificial data

2d-10c 2d-20c-no0 2d-3c-no123 2d-4c-no4 2d-4c-no9 2d-4c 2sp2glob 3-spiral 3MC D31 DS577 DS850 R15 aggregation atom banana birch-rg1 birch-rg2 birch-rg3 chainlink cluto-t4.8k cluto-t5.8k cluto-t7.10k cluto-t8.8k complex8 complex9 compound cure-t0-2000n-2D cure-t1-2000n-2D cure-t2-4k curves1 curves2 dartboard1 dartboard2 dense-disk-3000 dense-disk-5000 diamond9 disk-1000n disk-3000n disk-4000n disk-4500n disk-4600n disk-5000n disk-6000n donut1 donut2 donut3 donutcurves ds2c2sc13 ds3c3sc6 ds4c2sc8 elliptical_10_2 elly-2d10c13s engytime flame fourty golfball hepta insect jain long1 long2 long3 longsquare lsun mopsi-finland mopsi-joensuu pathbased rings s-set1 s-set2 s-set3 s-set4 sizes1 sizes2 sizes3 sizes4 sizes5 smile1 smile2 smile3 spherical_4_3 spherical_5_2 spherical_6_2 spiral spiralsquare square1 square2 square3 square4 square5 st900 target tetra triangle1 triangle2 twenty twodiamonds wingnut xclara zelnik1 zelnik2 zelnik3 zelnik4 zelnik5 zelnik6

Experiments

This project contains set of clustering methods benchmarks on various dataset. The project is dependent on Clueminer project.

in order to run benchmark compile dependencies into a single JAR file:

mvn assembly:assembly

Consensus experiment

allows running repeated runs of the same algorithm:

./run consensus --dataset "triangle1" --repeat 10

by default k-means algorithm is used.

For available datasets see resources folder.

More Repositories

1

puppet-mesos

Puppet module for managing Mesos nodes
Ruby
70
star
2

DaVinciResolve-API-Docs

Ruby
56
star
3

es-dedupe

Tool for removing duplicate documents from Elasticsearch
Python
54
star
4

DaVinciResolve-metadata

EXIF metadata synchronization tool
Lua
38
star
5

puppet-accounts

Simple hierachical management of Linux user accounts, groups and SSH keys
Ruby
34
star
6

mesos-deb-packaging

Mesos package for Debian, Ubuntu
Shell
31
star
7

puppet-zookeeper

Puppet module for managing Apache ZooKeeper
Ruby
22
star
8

kafka-manager-docker

kafka-manager in Docker container
Makefile
19
star
9

housing

School task at AI
CLIPS
18
star
10

clueminer

interactive clustering platform
Java
13
star
11

storm-mesos

Storm framework for Mesos with Debian packaging
Java
6
star
12

docker-nodejs-helloworld

node.js hello world example based on Debian
Makefile
6
star
13

crashctl

Simple tool for crashing server diagnosis
Shell
5
star
14

java-chart-benchmark

Benchmark of Java plotting libraries
Java
5
star
15

mesos-torque

Python
5
star
16

bicing

school project
Java
4
star
17

puppet-beegfs

Manage BeeGFS parallel system installations.
HTML
4
star
18

gnuplot-sty

A LaTeX package for including gnuplot into documents
4
star
19

CLIPS-FAQ

Translation of a document from Catalan to English
4
star
20

cl-compiler

A school project at UPC FIB
C++
4
star
21

curve-fit-demo

Splines and curve interpolation demo in Java
Java
3
star
22

handl-data-generators

Clustering data generators
C
3
star
23

x01avt

Řešené předměty k předmětu Algebra pro výpočetní techniku (FEL ČVUT)
3
star
24

dcos-cerebro

DC/OS package for Cerebro
Shell
3
star
25

uwsgi-deb-packaging

custom package with python3 support
Shell
2
star
26

puppet-pgprobackup

Automated pg_probackup
Ruby
2
star
27

dcos-traefik

Traefik DC/OS package
Shell
2
star
28

puppet-fluentbit

Puppet
2
star
29

puppet-hindsight

Manages Hindsight log processing engine
Ruby
2
star
30

puppet-clickhouse

Pupppet module to manage Clickhouse installation
Ruby
2
star
31

fast-community

A graph clustering algorithm
C++
2
star
32

docker-icinga2-dashing

Makefile
2
star
33

nscp-deb-packaging

Shell
1
star
34

gluster-monitor

Debian package for gluster-monitor
Python
1
star
35

puppet-pubkey

Generate ssh key pair and exports public key
Ruby
1
star
36

puppet-torque

Puppet module for managing Torque
Ruby
1
star
37

data-mining-cure-algorithm

Automatically exported from code.google.com/p/data-mining-cure-algorithm
Java
1
star
38

mi-rub

Ruby
1
star
39

nginx-passenger-deb-packaging

Debian/Ubuntu package for customized nginx binary with passenger support
Shell
1
star
40

puppet-storm

Puppet module for managing Storm
Puppet
1
star
41

deric-r

Puppet module for r-project installation
Ruby
1
star
42

marathon-deb-packaging

Shell
1
star
43

clustevalPackages

Java
1
star
44

treeviz

Java
1
star
45

spark-word-count

Simple Spark demo
Java
1
star
46

spark-deb-packaging

customization of spark deb package
Shell
1
star
47

clueminer-cli

Java
1
star
48

kafka-packaging

Ruby
1
star