• Stars
    star
    1
  • Language
    Java
  • License
    MIT License
  • Created about 2 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This repository contains the code and the data for our SPIRE'22 paper on unintended train--test leakage with neural retrieval models.

More Repositories

1

small-text

Active Learning for Text Classification in Python
Python
531
star
2

summary-explorer

Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.
CSS
43
star
3

ECIR-2015-and-SEMEVAL-2015

The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.
Java
37
star
4

summary-workbench

Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.
Python
31
star
5

ecir21-an-empirical-comparison-of-web-page-segmentation-algorithms

JavaScript
26
star
6

wasp

Java
25
star
7

archive-query-log

πŸ“œ The Archive Query Log.
Jupyter Notebook
22
star
8

acl22-identifying-the-human-values-behind-arguments

Machine Learning scripts for the identification of human values behind arguments.
Python
22
star
9

ir_axioms

↕️ Intuitive axiomatic retrieval experimentation.
Python
22
star
10

ACL-22

15
star
11

ACL-18

Java
15
star
12

webis-tldr-17-corpus

Code for constructing TLDR corpus from Reddit dataset
Python
15
star
13

cikm20-web-page-segmentation-revisited-evaluation-framework-and-dataset

Code for "Web Page Segmentation Revisited: Evaluation Framework and Dataset", accepted as resources paper to CIKM 2020
HTML
13
star
14

mturk-manager

An alternative front end for Amazon Mechanical Turk
Vue
12
star
15

scidata22-stereo-scientific-text-reuse

Go
10
star
16

msmarco-llm-distillation

Python
10
star
17

lightning-ir

Python
10
star
18

DADT

Implementation of Disjoint Author-Document Topic Model
Python
9
star
19

set-encoder

Jupyter Notebook
9
star
20

ijcai24-manipulating-embeddings-stable-diffusion

Code for the paper "Manipulating Embeddings of Stable Diffusion Prompts".
Python
8
star
21

webis-de.github.io

The Webis Group Website.
HTML
8
star
22

corpus-viewer

Python
8
star
23

coling22-benchmark-for-causal-question-answering

Jupyter Notebook
8
star
24

unmasking

General-purpose Unmasking Framework
Python
8
star
25

ML4CD-21

Code repository for "BERTian Poetics: Constrained Composition with Masked LMs"
Jupyter Notebook
6
star
26

SIGIR-17

Java
6
star
27

scriptor

Plug-and-play reproducible web analysis.
JavaScript
6
star
28

ECIR-24

6
star
29

argmining-21-keypoint-analysis-sharedtask-code

The code for the our submission for the key point analysis sharedtask (2021)
Jupyter Notebook
5
star
30

lecture.js

Lecture.js converts a script and slides to a spoken video presentation using advanced text-to-speech services
JavaScript
5
star
31

ARGMINING-17

The repository for the paper, Unit Segmentation of Argumentative Texts. In ArgMining 2017
Python
5
star
32

waka

Construct and author knowledge graphs from text.
Python
5
star
33

ecir24-seo-spam-in-search-engines

Jupyter Notebook
5
star
34

wat

Web Annotation Tool
Java
4
star
35

acl22-revisiting-uncertainty-based-query-strategies-for-active-learning-with-transformers

Revisiting Uncertainty-based Query Strategies for Active Learning with Transformers
Python
4
star
36

acl22-clickbait-spoiling

Jupyter Notebook
4
star
37

eacl21-belief-based-claim-generation

Jupyter Notebook
4
star
38

NLPCSS-20

The repository of the NLPCSS 2020 paper
3
star
39

acl20-target-inference-in-conclusion-generation

Python
3
star
40

ACL-20

Central repository of all ACL'20 publications by the Webis group.
3
star
41

downloads

The downloads directory for the webis.de web page. History will be deleted irregularly.
Python
3
star
42

ecir22-anchor-text

Code and Data for the paper on anchor text for MS Marco.
Jupyter Notebook
3
star
43

webis-web-archiver

Source code and scripts for the Webis Web Archiver
Java
3
star
44

ACL-19

ACL 2019 Code and Data
Python
3
star
45

mastodon-search

πŸ•ΈοΈ A Corpus for Simulating Search on Mastodon.
Jupyter Notebook
3
star
46

webis-de-archive

Splash page for archive.webis.de
CSS
3
star
47

emnlp21-same-sentiment

EMNLP 2021 - Casting the Same Sentiment Classification Problem
Jupyter Notebook
3
star
48

SIGIR-19

Repository for the SIGIR'19 paper "Argument Search: Assessing Argument Relevance."
Jupyter Notebook
3
star
49

natural-language-processing-exercises

Python
3
star
50

EMNLP-23

3
star
51

IJCAI-21

Code for the paper "Bias Silhouette Analysis: Towards Assessing the Quality of Bias Metrics for Word Embedding Models".
Python
2
star
52

COLING-20

HTML
2
star
53

ICWSM-17

Java
2
star
54

acl21-counter-argument-generation-by-attacking-weak-premises

Jupyter Notebook
2
star
55

slidehub

Generic code for slidehub pages
HTML
2
star
56

aitools4-aq-web-page-content-extraction

Java
2
star
57

ECIR-23

Roff
2
star
58

acl21-ArgKG-argument-generation

2
star
59

ArgMining-20

2
star
60

aitools4-aq-geolocation

Java
2
star
61

pytorch-window-matmul

a custom CUDA kernel for windowed matrix multiplication
Python
2
star
62

COLING-22

2
star
63

QPP-23

Jupyter Notebook
2
star
64

argmining19-same-side-classification

The Benchmarking Workshop
Jupyter Notebook
2
star
65

ecir22-query-obfuscation-game

HTML
2
star
66

ACL-23

2
star
67

password-generation-rules

Java
2
star
68

argmining20-social-bias-argumentation

Code for the paper "Argument from Old Man’s View: Assessing Social Bias in Argumentation".
Python
2
star
69

sigir20-sampling-bias-due-to-near-duplicates-in-learning-to-rank

Sampling Bias Due to Near-Duplicates in Learning to Rank
Kotlin
2
star
70

ECIR-19

Python
2
star
71

authorship-threetrain

Implementation of the tri-training algorithm for authorship attribution described in a paper by Qian et al. 2014
Python
2
star
72

acl20-crawling-mailing-lists

Python
2
star
73

ecir24-sparse-cross-encoder

Code and models for the ECIR'24 paper 'Investigating the Effects of Sparse Attention on Cross-Encoders'
Jupyter Notebook
2
star
74

ecir24-simulating-follow-up-questions

Python
2
star
75

ICTIR-22

Repository for the paper "Sparse Pairwise Re-ranking with Pre-trained Transformers" published at ICTIR 2022.
Jupyter Notebook
2
star
76

emnlp21-same-stance

EMNLP 2021 - On Classifying whether Two Texts are on the Same Side of an Argument
Jupyter Notebook
2
star
77

acl22-moral-debater-a-study-on-the-computational-generation-of-morally-framed-arguments

Jupyter Notebook
2
star
78

SIGIR-18

The repository for the data in the SIGIR paper "A User Study on Snippet Generation: Text Reuse vs. Paraphrases"
Python
2
star
79

acl21-informative-conclusion-generation

Jupyter Notebook
1
star
80

in2writing22-language-models-as-context-sensitive-word-search-engines

Python
1
star
81

tpdl22-visual-web-archive-quality-assessment

Java
1
star
82

webis-de-assets

Generic Webis Website Assets
SCSS
1
star
83

ACL-21

1
star
84

semeval19-hyperpartisan-news-detection-article-cleaner

Code for cleaning the HTML of articles
Java
1
star
85

cikm20-ndcg-negative-relevance-judgements

Code for the CIKM20 Short Paper: "The Impact of Negative Relevance Judgments on NDCG"
Jupyter Notebook
1
star
86

argmining21-frame-identification

1
star
87

targer-api

πŸ—£οΈ Simple, type-safe access to the TARGER neural argument tagging APIs.
Python
1
star
88

acl19-heuristic-authorship-obfuscation

C++
1
star
89

EACL-23

EACL-23 Code and Data
1
star
90

eacl23-conclusion-based-counter-argument-generation

Jupyter Notebook
1
star
91

EMNLP-22

1
star
92

WWW-20

The repository of the Webconf paper "Abstractive Snippet Generation"
Python
1
star
93

AMOC-21

This repository documents our entry in the 2021 Amoc Hackathon
Jupyter Notebook
1
star
94

SAMESIDE-19

SameSideClassification Source Code (Fork ASV)
Jupyter Notebook
1
star
95

argmining20-rhetorical-devices

Java
1
star
96

koppel14

Tries to implement the algorithms in the paper 'Determining if two Documents are written by the same author' by Koppel and Winter from 2014
Jupyter Notebook
1
star
97

bea24-essay-feedback-generation

Python
1
star
98

naacl24-school-student-essay-corpus

Jupyter Notebook
1
star
99

RENEUIR-24

The participation of the FSU team at ReNeuIR 2024.
Jupyter Notebook
1
star
100

WWW-24

The repository of the WWW'2024 paper "Detecting Generated Native Ads in Conversational Search"
1
star