Data Systems Group (@DataSystemsGroupUT)

Top repositories

1

AutoML_Survey

204
star
2

dataeng

Repository fo Data Engineering Course
Jupyter Notebook
50
star
3

SmartML

SmartML: Supervised Machine Learning Automation in R
R
23
star
4

SPARKSQLRDFBenchmarking

A systematic Benchmarking on the performance of Spark-SQL for processing Vast RDF datasets
Scala
14
star
5

Adaptive-Watermarks

An approach to apply concept drifts and ADWIN on the streams event time to reason about the progress of watermarks
Java
9
star
6

Flink-Stream-SQL-Examples

Java
6
star
7

DLBench

A repository for benchmarking deep learning frameworks on different data sets
Jupyter Notebook
5
star
8

ismartml

Python
5
star
9

DC-classification

DC classification with unlabeled data
Python
5
star
10

PAPyA

Prescriptive Performance Analysis in Python Actions
HTML
5
star
11

Benchmarking-Big-Streams-Systems

An extension of Yahoo!'s benchmarking of big streaming systems
Java
5
star
12

ConformanceCheckingUsingTries

A Trie-based approach to efficiently compute alignment approximations
Java
4
star
13

Interpretability-comparison

Jupyter Notebook
4
star
14

ICEP

D2IA is a Flink library that uses Flink CEP to declaratively define event intervals and reason about their relationships using Allen's interval algebra.
Java
4
star
15

Minaret

A tool chain to intelligently search for scholars matching research disciplines
CSS
4
star
16

Distributed-SmartML

Scala
3
star
17

BigFeat

Automated feature engineering project
Python
3
star
18

auto_feature_engineering

Automated Feature Engineering Project
Jupyter Notebook
2
star
19

differential-reasoner

Rust
2
star
20

HyperParameterTunability

Jupyter Notebook
2
star
21

Process-Mining-Pipelines

Jupyter Notebook
1
star
22

csmartml

TypeScript
1
star
23

RDFBenchRankLib

Jupyter Notebook
1
star
24

automl_exploration_vs_exploitation

Jupyter Notebook
1
star
25

DLLD

DLLD: Deep Learning Framework for Lesion Detection on Medical Image
Python
1
star
26

AutoMLDesignDecisions

A Micro Analysis for the Design Decisions of the AutoML Process
Jupyter Notebook
1
star
27

ACDTE

Code for the Automatic Concept based Decision Tree Explanations
Jupyter Notebook
1
star
28

StreamingConformanceChecker

Streaming Conformance Checker on top of Beamline
Java
1
star
29

DISGD

A distributed shared-nothing variant of the incremental stochastic gradient descent algorithm
Java
1
star
30

Process-Discovery-over-unordered-streams

A Flink library to implement both a buffer-based and a speculative out-of-order event arrival handlers for online process discovery
Java
1
star