• Stars
    star
    61
  • Rank 497,051 (Top 10 %)
  • Language
    Java
  • License
    Apache License 2.0
  • Created about 9 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Scalable Graph Mining

More Repositories

1

LLMeBench

Benchmarking Large Language Models
Python
79
star
2

dialectID

Automatic Dialect Detection Repository
Jupyter Notebook
39
star
3

ArabicASRChallenge2016

This repository
Java
30
star
4

DeepBlocker

Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space Exploration"
Python
30
star
5

FarasaSegmenter

Java
22
star
6

sleep_awake_benchmark

This code is part of the paper ''A Large Scale Benchmark to Validate Sleep-Wake Scoring Algorithms'' currently under review.
Jupyter Notebook
20
star
7

dialectal_arabic_resources

Shell
16
star
8

PDNS-Net

Passive DNS Dataset of Domain Resolutions
Jupyter Notebook
16
star
9

RDFframes

A python API for exposing and processing RDF data from sparql endpoints for data mining and machine learning models in convenient formats like Pandas dataframes.
Python
15
star
10

dialectal_arabic_tools

Roff
14
star
11

tasrif

Tasrif is a python library for processing of wearable data from fitness trackers and wearable health devices
Python
14
star
12

Arabic_speech_code_switching

The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguistic and the acoustic cues. This dataset is a potential benchmark for DCS in spontaneous speech.
14
star
13

ArabicSpellChecker

Java
11
star
14

dialectal_arabic_segmenter

Arabic Dialects Segmenter Using Keras/BiLSTM/ChainCRF
Python
10
star
15

e-wer

Word Error Rate Estimation
Python
10
star
16

QARiB

Python
9
star
17

QLFactChecking

Python
8
star
18

data_civilizer_system

JavaScript
8
star
19

dialectal_arabic_pos_tagger

Python
7
star
20

coclean

CoClean: Collaborative Data Cleaning
Jupyter Notebook
7
star
21

deepemotion

Arabic emotion recognition using deep neural networks
Python
7
star
22

CMDL

Cross-Modal Data Discovery over Structured and Unstructured Data Lakes
Jupyter Notebook
6
star
23

multiRefWER

Python
5
star
24

datacivilizer2

The Data Civilizer end-to-end data preparation system - 2.0
JavaScript
5
star
25

Kunafa

A python script using perf and PMU to monitor memory bandwidth, cache, and other performance metrics
Python
5
star
26

QADI

QCRI Arabic Dialect Identification
5
star
27

RetClean

AI for Data Preparation
Python
5
star
28

EmbLookup

Repository containing implementation for the ICDE 2022 paper "Accelerating Entity Lookups in Knowledge Graphs Through Embeddings"
Python
4
star
29

EvaluationMetrics

Code to evaluate cqa runs in the ECML 2016 challenge on reranking questions in community question answering
Java
4
star
30

gym-gharrafa

OpenAI Gym environment used in the KDD2019 Paper "Time Critic Policy Gradient Methods for Traffic Signal Control in Complex and Congested Scenarios"
Jupyter Notebook
4
star
31

QCAI-TransportaionGroup-TrImpute

Python
4
star
32

qcri-svm-segmenter

Segmenter of dialectal Arabic developed by QCRI, ALT team and published in EACL 2017 and CONLL 2017
PLSQL
3
star
33

DiplomaticPulse

Python
3
star
34

WikiQAar

Cross-language English-Arabic corpus derived from WikiQA
3
star
35

alt_public

ALT research group publications
TeX
3
star
36

alt-hackathon-docs

QCRI Speech Recognition and Machine Translation API's for Hackathons
JavaScript
3
star
37

PropagandaTechniquesAnalysisBERT

Python
3
star
38

Text2TTP

A Tool for Semantic Ranking for Automated Adversarial Technique Annotation in Security Text
Jupyter Notebook
3
star
39

dial-diac

A System for Diacritizing Four Varieties of Arabic
CSS
2
star
40

cmuqhack2017

repository for cmu-q hackathon 2017
Python
2
star
41

gpsmap

Real time map creation and update using gps data
Python
2
star
42

tacotron2

Jupyter Notebook
2
star
43

QLN-LiveNewsDemo

QCRI Live News Demonstration
PHP
2
star
44

dbcopier

Ruby tool to copy a database of any type to a database of any other type
Ruby
2
star
45

COVID19-MAL-Blacklist

COVID19 Themed Domain Blacklist
2
star
46

dockerized_moses_server

Dockerfile
2
star
47

ArabicSpeechTextProcessing

Processing Dialectal Arabic Speech Transcription
Jupyter Notebook
2
star
48

apihub

serve and publish API
Python
2
star
49

QCAI-TransportaionGroup-GTI

GTI: Graph Trajectory Imputation
2
star
50

multilingual-latent-concepts

Code associated with the ACL24 paper titled, "Exploring Alignment in Shared Cross-Lingual Spaces"
Jupyter Notebook
2
star
51

compromised

Detecting Compromised and Attack domains
Jupyter Notebook
2
star
52

deepdialect

Deep Arabic Dialect Detection
Python
1
star
53

PHD_Datasets

PHD Data collect by Matheus Araujo
1
star
54

qcri-demo-icassp-2018-client

The client side of QCRI demo at ICASSP 2018
CSS
1
star
55

RDF-generator

Converting structured and semi-structured data into RDF graphs.
Python
1
star
56

flight2vec

Boeing flight2vec project
HTML
1
star
57

Zagel

Chatting app based on the crosscloud platform
CSS
1
star
58

Arabesque-Skeleton

Skeleton template for new projects on top of Arabesque
Java
1
star
59

TrafQ

TrafQ is a collection of Open AI environments and baseline algorithms for Traffic Light control optimization with Reinforcement Learning. Environments simulate real road networks and traffic.
Jupyter Notebook
1
star
60

ADAC-traffic

Offline reinforcement learning for traffic signal control
1
star