• Stars
    star
    32
  • Rank 801,539 (Top 16 %)
  • Language
    Python
  • License
    MIT License
  • Created almost 5 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles"

More Repositories

1

awesome-document-similarity

A curated list of resources on document similarity measures (papers, tutorials, code, ...)
232
star
2

pytorch-bert-document-classification

Enriching BERT with Knowledge Graph Embedding for Document Classification (PyTorch)
Jupyter Notebook
158
star
3

scincl

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)
Python
63
star
4

aspect-document-similarity

Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020
Jupyter Notebook
62
star
5

llm-datasets

A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.
Python
51
star
6

legal-document-similarity

Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal Literature Recommendations"
Jupyter Notebook
31
star
7

clp-transfer

Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
Python
29
star
8

aspect-document-embeddings

Code, dataset & models for the paper Specialized Document Embeddings for Aspect-based Similarity of Research Papers (#JCDL2022)
Jupyter Notebook
11
star
9

german-language-models

A collection of German GPT language models
10
star
10

awesome-contrastive-learning-for-nlp

A collection of papers about contrastive learning for natural language processing.
7
star
11

wikipedia-article-recommendations

Survey data and Python code for the ICADL 2021 paper "A Qualitative Evaluation of User Preference for Link-based vs. Text-based Recommendations of Wikipedia Articles"
Jupyter Notebook
5
star
12

getting-started

Dockerfile
3
star
13

covid-vaccination-appointment

Python
3
star
14

Leaflet.Sim

Leaflet.Sim is a framework for location-based simulations with Leaflet maps that can visualise moving markers, which can change their style, and events over time on a map.
JavaScript
2
star
15

emnlp2022-papers

Python
2
star
16

finetune-evaluation-harness

Python
2
star
17

CmdLineSlideShow

Command line script for generating rich slide shows from a set of images with transition effects and audio. Using ImageMagick and FFMPEG.
Shell
2
star
18

Wikipedia2Lucene

Import a Wikipedia XML Dump from HDFS to Lucene index or Elasticsearch and retrieve similar Wikipedia articles based on Lucene's MoreLikeThis query.
Java
1
star
19

kibana-reallybettermap

Multiple locations for Kibana's bettermap panel
JavaScript
1
star
20

news-visualization

News visualization with Elastic Search and Kibana including NER, Sentiment Analysis and Geo Locations.
Java
1
star
21

data-sourcing

Python
1
star
22

turkish-lm-bias

Investigating Gender Bias in Turkish Language Models
Jupyter Notebook
1
star