QCRI (@qcri)

Top repositories

1

Arabesque

Scalable Graph Mining
Java
61
star
2

LLMeBench

Benchmarking Large Language Models
Python
61
star
3

dialectID

Automatic Dialect Detection Repository
Jupyter Notebook
38
star
4

ArabicASRChallenge2016

This repository
Java
31
star
5

DeepBlocker

Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space Exploration"
Python
27
star
6

FarasaSegmenter

Java
21
star
7

sleep_awake_benchmark

This code is part of the paper ''A Large Scale Benchmark to Validate Sleep-Wake Scoring Algorithms'' currently under review.
Jupyter Notebook
17
star
8

dialectal_arabic_resources

Shell
16
star
9

RDFframes

A python API for exposing and processing RDF data from sparql endpoints for data mining and machine learning models in convenient formats like Pandas dataframes.
Python
15
star
10

tasrif

Tasrif is a python library for processing of wearable data from fitness trackers and wearable health devices
Python
15
star
11

dialectal_arabic_tools

Roff
14
star
12

PDNS-Net

Passive DNS Dataset of Domain Resolutions
Jupyter Notebook
13
star
13

Arabic_speech_code_switching

The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguistic and the acoustic cues. This dataset is a potential benchmark for DCS in spontaneous speech.
13
star
14

dialectal_arabic_segmenter

Arabic Dialects Segmenter Using Keras/BiLSTM/ChainCRF
Python
10
star
15

ArabicSpellChecker

Java
10
star
16

QARiB

Python
9
star
17

QLFactChecking

Python
8
star
18

e-wer

Word Error Rate Estimation
Python
8
star
19

data_civilizer_system

JavaScript
8
star
20

dialectal_arabic_pos_tagger

Python
7
star
21

coclean

CoClean: Collaborative Data Cleaning
Jupyter Notebook
7
star
22

deepemotion

Arabic emotion recognition using deep neural networks
Python
7
star
23

CMDL

Cross-Modal Data Discovery over Structured and Unstructured Data Lakes
Jupyter Notebook
6
star
24

multiRefWER

Python
5
star
25

datacivilizer2

The Data Civilizer end-to-end data preparation system - 2.0
JavaScript
5
star
26

Kunafa

A python script using perf and PMU to monitor memory bandwidth, cache, and other performance metrics
Python
5
star
27

EmbLookup

Repository containing implementation for the ICDE 2022 paper "Accelerating Entity Lookups in Knowledge Graphs Through Embeddings"
Python
4
star
28

QADI

QCRI Arabic Dialect Identification
4
star
29

EvaluationMetrics

Code to evaluate cqa runs in the ECML 2016 challenge on reranking questions in community question answering
Java
4
star
30

gym-gharrafa

OpenAI Gym environment used in the KDD2019 Paper "Time Critic Policy Gradient Methods for Traffic Signal Control in Complex and Congested Scenarios"
Jupyter Notebook
4
star
31

qcri-svm-segmenter

Segmenter of dialectal Arabic developed by QCRI, ALT team and published in EACL 2017 and CONLL 2017
PLSQL
3
star
32

DiplomaticPulse

Python
3
star
33

WikiQAar

Cross-language English-Arabic corpus derived from WikiQA
3
star
34

RetClean

AI for Data Preparation
Python
3
star
35

alt_public

ALT research group publications
TeX
3
star
36

alt-hackathon-docs

QCRI Speech Recognition and Machine Translation API's for Hackathons
JavaScript
3
star
37

PropagandaTechniquesAnalysisBERT

Python
3
star
38

dial-diac

A System for Diacritizing Four Varieties of Arabic
CSS
2
star
39

cmuqhack2017

repository for cmu-q hackathon 2017
Python
2
star
40

gpsmap

Real time map creation and update using gps data
Python
2
star
41

tacotron2

Jupyter Notebook
2
star
42

QLN-LiveNewsDemo

QCRI Live News Demonstration
PHP
2
star
43

apihub

serve and publish API
Python
2
star
44

COVID19-MAL-Blacklist

COVID19 Themed Domain Blacklist
2
star
45

dbcopier

Ruby tool to copy a database of any type to a database of any other type
Ruby
2
star
46

dockerized_moses_server

Dockerfile
2
star
47

ArabicSpeechTextProcessing

Processing Dialectal Arabic Speech Transcription
Jupyter Notebook
2
star
48

QCAI-TransportaionGroup-TrImpute

Python
2
star
49

deepdialect

Deep Arabic Dialect Detection
Python
1
star
50

PHD_Datasets

PHD Data collect by Matheus Araujo
1
star
51

qcri-demo-icassp-2018-client

The client side of QCRI demo at ICASSP 2018
CSS
1
star
52

RDF-generator

Converting structured and semi-structured data into RDF graphs.
Python
1
star
53

QCAI-TransportaionGroup-GTI

GTI: Graph Trajectory Imputation
1
star
54

flight2vec

Boeing flight2vec project
HTML
1
star
55

Zagel

Chatting app based on the crosscloud platform
CSS
1
star
56

Arabesque-Skeleton

Skeleton template for new projects on top of Arabesque
Java
1
star
57

TrafQ

TrafQ is a collection of Open AI environments and baseline algorithms for Traffic Light control optimization with Reinforcement Learning. Environments simulate real road networks and traffic.
Jupyter Notebook
1
star
58

ADAC-traffic

Offline reinforcement learning for traffic signal control
1
star
59

compromised

Detecting Compromised and Attack domains
Jupyter Notebook
1
star