• Stars
    star
    30
  • Rank 839,658 (Top 17 %)
  • Language
    Python
  • License
    BSD 3-Clause "New...
  • Created over 3 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space Exploration"

More Repositories

1

LLMeBench

Benchmarking Large Language Models
Python
79
star
2

Arabesque

Scalable Graph Mining
Java
61
star
3

dialectID

Automatic Dialect Detection Repository
Jupyter Notebook
39
star
4

ArabicASRChallenge2016

This repository
Java
30
star
5

FarasaSegmenter

Java
22
star
6

sleep_awake_benchmark

This code is part of the paper ''A Large Scale Benchmark to Validate Sleep-Wake Scoring Algorithms'' currently under review.
Jupyter Notebook
20
star
7

dialectal_arabic_resources

Shell
16
star
8

PDNS-Net

Passive DNS Dataset of Domain Resolutions
Jupyter Notebook
16
star
9

RDFframes

A python API for exposing and processing RDF data from sparql endpoints for data mining and machine learning models in convenient formats like Pandas dataframes.
Python
15
star
10

dialectal_arabic_tools

Roff
14
star
11

tasrif

Tasrif is a python library for processing of wearable data from fitness trackers and wearable health devices
Python
14
star
12

Arabic_speech_code_switching

The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguistic and the acoustic cues. This dataset is a potential benchmark for DCS in spontaneous speech.
14
star
13

ArabicSpellChecker

Java
11
star
14

dialectal_arabic_segmenter

Arabic Dialects Segmenter Using Keras/BiLSTM/ChainCRF
Python
10
star
15

e-wer

Word Error Rate Estimation
Python
10
star
16

QARiB

Python
9
star
17

QLFactChecking

Python
8
star
18

data_civilizer_system

JavaScript
8
star
19

dialectal_arabic_pos_tagger

Python
7
star
20

coclean

CoClean: Collaborative Data Cleaning
Jupyter Notebook
7
star
21

deepemotion

Arabic emotion recognition using deep neural networks
Python
7
star
22

CMDL

Cross-Modal Data Discovery over Structured and Unstructured Data Lakes
Jupyter Notebook
6
star
23

multiRefWER

Python
5
star
24

datacivilizer2

The Data Civilizer end-to-end data preparation system - 2.0
JavaScript
5
star
25

Kunafa

A python script using perf and PMU to monitor memory bandwidth, cache, and other performance metrics
Python
5
star
26

QADI

QCRI Arabic Dialect Identification
5
star
27

RetClean

AI for Data Preparation
Python
5
star
28

EmbLookup

Repository containing implementation for the ICDE 2022 paper "Accelerating Entity Lookups in Knowledge Graphs Through Embeddings"
Python
4
star
29

EvaluationMetrics

Code to evaluate cqa runs in the ECML 2016 challenge on reranking questions in community question answering
Java
4
star
30

gym-gharrafa

OpenAI Gym environment used in the KDD2019 Paper "Time Critic Policy Gradient Methods for Traffic Signal Control in Complex and Congested Scenarios"
Jupyter Notebook
4
star
31

QCAI-TransportaionGroup-TrImpute

Python
4
star
32

qcri-svm-segmenter

Segmenter of dialectal Arabic developed by QCRI, ALT team and published in EACL 2017 and CONLL 2017
PLSQL
3
star
33

DiplomaticPulse

Python
3
star
34

WikiQAar

Cross-language English-Arabic corpus derived from WikiQA
3
star
35

alt_public

ALT research group publications
TeX
3
star
36

alt-hackathon-docs

QCRI Speech Recognition and Machine Translation API's for Hackathons
JavaScript
3
star
37

PropagandaTechniquesAnalysisBERT

Python
3
star
38

Text2TTP

A Tool for Semantic Ranking for Automated Adversarial Technique Annotation in Security Text
Jupyter Notebook
3
star
39

dial-diac

A System for Diacritizing Four Varieties of Arabic
CSS
2
star
40

cmuqhack2017

repository for cmu-q hackathon 2017
Python
2
star
41

gpsmap

Real time map creation and update using gps data
Python
2
star
42

tacotron2

Jupyter Notebook
2
star
43

QLN-LiveNewsDemo

QCRI Live News Demonstration
PHP
2
star
44

dbcopier

Ruby tool to copy a database of any type to a database of any other type
Ruby
2
star
45

COVID19-MAL-Blacklist

COVID19 Themed Domain Blacklist
2
star
46

dockerized_moses_server

Dockerfile
2
star
47

ArabicSpeechTextProcessing

Processing Dialectal Arabic Speech Transcription
Jupyter Notebook
2
star
48

apihub

serve and publish API
Python
2
star
49

QCAI-TransportaionGroup-GTI

GTI: Graph Trajectory Imputation
2
star
50

multilingual-latent-concepts

Code associated with the ACL24 paper titled, "Exploring Alignment in Shared Cross-Lingual Spaces"
Jupyter Notebook
2
star
51

compromised

Detecting Compromised and Attack domains
Jupyter Notebook
2
star
52

deepdialect

Deep Arabic Dialect Detection
Python
1
star
53

PHD_Datasets

PHD Data collect by Matheus Araujo
1
star
54

qcri-demo-icassp-2018-client

The client side of QCRI demo at ICASSP 2018
CSS
1
star
55

RDF-generator

Converting structured and semi-structured data into RDF graphs.
Python
1
star
56

flight2vec

Boeing flight2vec project
HTML
1
star
57

Zagel

Chatting app based on the crosscloud platform
CSS
1
star
58

Arabesque-Skeleton

Skeleton template for new projects on top of Arabesque
Java
1
star
59

TrafQ

TrafQ is a collection of Open AI environments and baseline algorithms for Traffic Light control optimization with Reinforcement Learning. Environments simulate real road networks and traffic.
Jupyter Notebook
1
star
60

ADAC-traffic

Offline reinforcement learning for traffic signal control
1
star