Florian Boudin (@boudinfl)

Top repositories

1

pke

Python Keyphrase Extraction module
Python
1,551
star
2

ake-datasets

Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Shell
138
star
3

takahe

takahe is a multi-sentence compression module
Python
54
star
4

sume

Sume is an implementation of the concept-based ILP model for summarization.
Python
36
star
5

centrality_measures_ijcnlp13

Centrality Measures for Graph-Based Keyphrase Extraction
Python
13
star
6

taln-archives

TALN Archives is a digital archive of French research articles in Natural Language Processing
TeX
12
star
7

kea

A tokenizer for French
JavaScript
11
star
8

ir-using-kg

Keyphrase Generation for Scientific Document Retrieval
Python
11
star
9

acm-cr

ACM-CR: A Manually Annotated Test Collection for Citation Recommendation
TeX
8
star
10

hulth-2003-pre

Preprocessed Inspec keyphrase extraction benchmark dataset
Shell
8
star
11

duc-2001-pre

Preprocessed DUC 2001 keyphrase extraction benchmark dataset
7
star
12

semeval-2010-pre

Preprocessed SemEval-2010 benchmark dataset for keyphrase extraction
7
star
13

marujo-2012-pre

Preprocessed Marujo keyphrase extraction benchmark dataset
Shell
5
star
14

redefining-absent-keyphrases

Code and dataset for the paper "Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness"
Python
5
star
15

krapivin-2009-pre

Preprocessed Krapivin keyphrase extraction benchmark dataset
Python
4
star
16

lina-msc

LINA-msc is a dataset for evaluating Multi-sentence Compression in French.
3
star
17

kepy

kepy is a keyphrase extraction module in Python
Python
2
star
18

cross-language_IR

Un cours de deux heures sur la recherche d'information cross-lingue
TeX
2
star
19

wikinews-2013-pre

Preprocessed Wikinews Keyphrase benchmark dataset
Python
1
star
20

boudinfl.github.io

website
HTML
1
star
21

CLIREC

CLinical Information Retrieval Evaluation Collection
Jupyter Notebook
1
star
22

pke-benchmarking

Jupyter Notebook
1
star