• Stars
    star
    101
  • Rank 338,166 (Top 7 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created about 6 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Exploring the simple sentence similarity measurements using word embeddings

License Downloads

Simple Sentence Similarity

We provide a collection of simple unsupervised semantic textual similarity methods to calculate semantic similarity between two sentences.

References

If you find this code useful in your research, please consider citing:

@inproceedings{ranasinghe-etal-2019-enhancing,
    title = "Enhancing Unsupervised Sentence Similarity Methods with Deep Contextualised Word Representations",
    author = "Ranasinghe, Tharindu  and
      Orasan, Constantin  and
      Mitkov, Ruslan",
    booktitle = "Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019)",
    month = sep,
    year = "2019",
    address = "Varna, Bulgaria",
    publisher = "INCOMA Ltd.",
    url = "https://www.aclweb.org/anthology/R19-1115",
    doi = "10.26615/978-954-452-056-4_115",
    pages = "994--1003",
    abstract = "Calculating Semantic Textual Similarity (STS) plays a significant role in many applications such as question answering, document summarisation, information retrieval and information extraction. All modern state of the art STS methods rely on word embeddings one way or another. The recently introduced contextualised word embeddings have proved more effective than standard word embeddings in many natural language processing tasks. This paper evaluates the impact of several contextualised word embeddings on unsupervised STS methods and compares it with the existing supervised/unsupervised STS methods for different datasets in different languages and different domains",
}
}

More Repositories

1

TransQuest

Transformer based translation quality estimation
Python
105
star
2

Siamese-Recurrent-Architectures

Usage of Siamese Recurrent Neural network architectures for semantic textual similarity
Jupyter Notebook
22
star
3

MUDES

Multilingual Detection of Offensive Spans
Python
7
star
4

DeepOffense

Python
5
star
5

HASOC-2019

Hate Speech and Offensive Content Identification in Indo-European Languages
Jupyter Notebook
5
star
6

MOLD

Marathi Offensive Language Dataset
Python
2
star
7

Thesis

Deep Learning based Semantic Textual Similarity Metric for Applications in Translation Technology
TeX
2
star
8

Germeval-Task-2

This repo is the work done for Germeval Task 2, 2019 — Shared Task on the Identification of Offensive Language by RGCL
Jupyter Notebook
2
star
9

HateSpans

Python
2
star
10

MUDES-UI

System Demonstration for MUDES
Python
2
star
11

Irony-Detection

This repo is the work done for IDAT 2019 Shared Task — Shared Task on detecting irony in Arabic tweets by RGCL
Python
2
star
12

Offenseval_2020

SemEval-2020 Task 12: OffensEval 2020: Identifying and Categorizing Offensive Language in Social Media
Jupyter Notebook
2
star
13

STS-Transformers

Transformer based Semantic Textual Similarity
Python
2
star
14

SemEval-2019-Task-12-Toponym-Resolution-in-Scientific-Papers

Jupyter Notebook
2
star
15

Authorship-Detection

Python
1
star
16

Biomedical-Semantic-Similarity-Estimation

Jupyter Notebook
1
star
17

DistilOffense

Small, fast and cheap offensive language identification models
Python
1
star
18

FT5

Python
1
star
19

Toponym-Resolution

Jupyter Notebook
1
star
20

MultiTransQuest

Python
1
star
21

Intelligent-Translation-Memories

Jupyter Notebook
1
star
22

Aggression-Identification

Code for RANLP 2019 paper: "Emoji Powered Capsule Network to Detect Type and Target of Offensive Posts in Social Media"
Jupyter Notebook
1
star
23

NeTTT-2024

Jupyter Notebook
1
star