• Stars
    star
    5
  • Rank 2,780,788 (Top 57 %)
  • Language
    JavaScript
  • License
    MIT License
  • Created almost 5 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

The Data Civilizer end-to-end data preparation system - 2.0

More Repositories

1

Arabesque

Scalable Graph Mining
Java
61
star
2

LLMeBench

Benchmarking Large Language Models
Python
61
star
3

dialectID

Automatic Dialect Detection Repository
Jupyter Notebook
38
star
4

ArabicASRChallenge2016

This repository
Java
31
star
5

DeepBlocker

Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space Exploration"
Python
27
star
6

FarasaSegmenter

Java
21
star
7

sleep_awake_benchmark

This code is part of the paper ''A Large Scale Benchmark to Validate Sleep-Wake Scoring Algorithms'' currently under review.
Jupyter Notebook
17
star
8

dialectal_arabic_resources

Shell
16
star
9

RDFframes

A python API for exposing and processing RDF data from sparql endpoints for data mining and machine learning models in convenient formats like Pandas dataframes.
Python
15
star
10

tasrif

Tasrif is a python library for processing of wearable data from fitness trackers and wearable health devices
Python
15
star
11

dialectal_arabic_tools

Roff
14
star
12

PDNS-Net

Passive DNS Dataset of Domain Resolutions
Jupyter Notebook
13
star
13

Arabic_speech_code_switching

The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguistic and the acoustic cues. This dataset is a potential benchmark for DCS in spontaneous speech.
13
star
14

dialectal_arabic_segmenter

Arabic Dialects Segmenter Using Keras/BiLSTM/ChainCRF
Python
10
star
15

ArabicSpellChecker

Java
10
star
16

QARiB

Python
9
star
17

QLFactChecking

Python
8
star
18

e-wer

Word Error Rate Estimation
Python
8
star
19

data_civilizer_system

JavaScript
8
star
20

dialectal_arabic_pos_tagger

Python
7
star
21

coclean

CoClean: Collaborative Data Cleaning
Jupyter Notebook
7
star
22

deepemotion

Arabic emotion recognition using deep neural networks
Python
7
star
23

CMDL

Cross-Modal Data Discovery over Structured and Unstructured Data Lakes
Jupyter Notebook
6
star
24

multiRefWER

Python
5
star
25

Kunafa

A python script using perf and PMU to monitor memory bandwidth, cache, and other performance metrics
Python
5
star
26

EmbLookup

Repository containing implementation for the ICDE 2022 paper "Accelerating Entity Lookups in Knowledge Graphs Through Embeddings"
Python
4
star
27

QADI

QCRI Arabic Dialect Identification
4
star
28

EvaluationMetrics

Code to evaluate cqa runs in the ECML 2016 challenge on reranking questions in community question answering
Java
4
star
29

gym-gharrafa

OpenAI Gym environment used in the KDD2019 Paper "Time Critic Policy Gradient Methods for Traffic Signal Control in Complex and Congested Scenarios"
Jupyter Notebook
4
star
30

qcri-svm-segmenter

Segmenter of dialectal Arabic developed by QCRI, ALT team and published in EACL 2017 and CONLL 2017
PLSQL
3
star
31

DiplomaticPulse

Python
3
star
32

WikiQAar

Cross-language English-Arabic corpus derived from WikiQA
3
star
33

RetClean

AI for Data Preparation
Python
3
star
34

alt_public

ALT research group publications
TeX
3
star
35

alt-hackathon-docs

QCRI Speech Recognition and Machine Translation API's for Hackathons
JavaScript
3
star
36

PropagandaTechniquesAnalysisBERT

Python
3
star
37

dial-diac

A System for Diacritizing Four Varieties of Arabic
CSS
2
star
38

cmuqhack2017

repository for cmu-q hackathon 2017
Python
2
star
39

gpsmap

Real time map creation and update using gps data
Python
2
star
40

tacotron2

Jupyter Notebook
2
star
41

QLN-LiveNewsDemo

QCRI Live News Demonstration
PHP
2
star
42

apihub

serve and publish API
Python
2
star
43

COVID19-MAL-Blacklist

COVID19 Themed Domain Blacklist
2
star
44

dbcopier

Ruby tool to copy a database of any type to a database of any other type
Ruby
2
star
45

dockerized_moses_server

Dockerfile
2
star
46

ArabicSpeechTextProcessing

Processing Dialectal Arabic Speech Transcription
Jupyter Notebook
2
star
47

QCAI-TransportaionGroup-TrImpute

Python
2
star
48

deepdialect

Deep Arabic Dialect Detection
Python
1
star
49

PHD_Datasets

PHD Data collect by Matheus Araujo
1
star
50

qcri-demo-icassp-2018-client

The client side of QCRI demo at ICASSP 2018
CSS
1
star
51

RDF-generator

Converting structured and semi-structured data into RDF graphs.
Python
1
star
52

QCAI-TransportaionGroup-GTI

GTI: Graph Trajectory Imputation
1
star
53

flight2vec

Boeing flight2vec project
HTML
1
star
54

Zagel

Chatting app based on the crosscloud platform
CSS
1
star
55

Arabesque-Skeleton

Skeleton template for new projects on top of Arabesque
Java
1
star
56

TrafQ

TrafQ is a collection of Open AI environments and baseline algorithms for Traffic Light control optimization with Reinforcement Learning. Environments simulate real road networks and traffic.
Jupyter Notebook
1
star
57

ADAC-traffic

Offline reinforcement learning for traffic signal control
1
star
58

compromised

Detecting Compromised and Attack domains
Jupyter Notebook
1
star