Web IR / NLP Group @ NUS (@WING-NUS)

Top repositories

1

scisumm-corpus

Scientific Document Summarization Corpus and Annotations from the WING NUS group.
212
star
2

sequicity

Source code for the ACL 2018 paper entitled "Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-Sequence Architectures" by Wenqiang Lei et al.
Python
154
star
3

JD2Skills-BERT-XMLC

Code and Dataset for the Bhola et al. (2020) Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classification Framework
Python
52
star
4

SWING

The Summarizer from the Web IR / NLP Group (WING), hence SWING, is a modular, state-of-the-art automatic extractive text summarization system. It is used as the basis for summarization research at the National University of Singapore. It performs as one of the leading automatic summarization systems in the international TAC competition, getting high marks for the ROUGE evaluation measure
Ruby
39
star
5

cs6101

The Web IR / NLP Group (WING)'s public reading group at the National University of Singapore.
JavaScript
37
star
6

slsql

Code for the EMNLP 2020 paper "Re-examining the Role of Schema Linking in Text-to-SQL".
Python
26
star
7

SciAssist

Python
18
star
8

SSID

Student Submission Integrity Diagnosis
Java
18
star
9

Kairos

Kairos, combines a focused crawler and an information extraction engine, to convert a list of conference websites into a index filled with fields of metadata that correspond to individual papers. Using event date metadata extracted from the conference website, Kairos proactively harvests metadata about the individual papers soon after they are made public. We use a Maximum Entropy classifier to classify uniform resource locators (URLs) as scientific conference websites and use Conditional Random Fields (CRF) to extract individual paper metadata from such websites. The crawler is built on top of the popular open-source crawler Nutch.
Java
18
star
10

Prastava

100% Pure Ruby Recommendation System (CF/CBF/Hybrid)
Ruby
12
star
11

ELCo

The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024
Python
11
star
12

JavaRAP

JavaRAP is an implementation of the classic Resolution of Anaphora Procedure (RAP) given by Lappin and Leass (1994) . It resolves third person pronouns, lexical anaphors, and identifies pleonastic pronouns. The original purpose of the implementation is to provide anaphora resolution result to our TREC 2003 Q&A system.
9
star
13

ResearchTrends

Source code for the COLING 2018 paper entitled "Identifying Emergent Research Trends by Key Authors and Phrases" by Shenhao Jiang et al.
Python
8
star
14

RelatedWorkSummarizationDataset

Dataset for the paper: Cong Duy Vu Hoang and Min-Yen Kan (2010) Towards Automated Related Work Summarization. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, China. pp. 427-435.
HTML
6
star
15

RAZ

Robust Argumentative Zoning - This is the home page for the argumentative zoning for raw text project, a collaboration between NUS and the University of Cambridge. This work follows Teufel's thesis to zone (label) sentences with six different rhetorical functions for scholarly discoure. The download comes with Teufel's original analysis and markup of 80 cmp-lg articles.
Perl
6
star
16

ChairVisE

To be edited.
Vue
5
star
17

ir-seminar

The Web IR / NLP Group (WING)'s IR Seminar at the National University of Singapore.
HTML
5
star
18

discoling

Source code for the AAAI 2018 paper entitled "Linguistic Properties Matter for Implicit Discourse Relation Recognition: Combining Semantic Interaction, Topic Continuity and Attribution" by Wenqiang Lei et al.
Python
5
star
19

texWordCount

A perl script to help count words in LaTeX. LPGL.
TeX
5
star
20

SciSWING

Scientific Document Summarizer from the Web IR / NLP Group (WING), NUS
Ruby
4
star
21

chatongpt

Chat on GPT public event - 18 April 2023
CSS
4
star
22

PyTorchCRF

A work-in-progress repository to develop a stand-alone lightweight CRF Layer in Pytorch
Python
4
star
23

nlp-seminar

The Web IR / NLP Group (WING)'s NLP Seminar at the National University of Singapore.
JavaScript
4
star
24

RSScrawler-1

Python
2
star
25

WING-LDA

LDA group project
Python
2
star
26

FCKeyphrase

SciVerse Application for document keyphrase extraction
C++
2
star
27

Word-News-Android

Word News Android client
JavaScript
2
star
28

WordNews

WordNews Chinese/English Language Learning system with Chrome Extension and Android backends.
Python
2
star
29

search-engine-wrapper

This package provides a Java wrapper framework for unifying programmatic access to search engines. A convenience class is also included for downloading the files at the URLs in the search engine results. This package contains an API as well as a command-line application.
Java
2
star
30

cubit

Google Scholar Analytics Package (Server backend and embeddable Javascript)
JavaScript
1
star
31

SSNLP-2019

Static Jekyll Website for the Singapore Symposium on Natural Language Processing.
CSS
1
star
32

ACL-Anthology-Codebase

Script and code for running the older version of the ACL Anthology
Perl
1
star
33

PDTB-scorer

Java
1
star
34

domadapter

Python
1
star
35

Elsevier-KP

Keyphrase Extraction (Base version for Elsevier; Elsevier-KP). Link below not working yet.
JavaScript
1
star
36

NeuralQuestionGeneration

WING-NUS (Pan Liangming's) Re-implementation of Serban et al's. 2016 ACL work "Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus"
1
star
37

DICOMER

DIscourse COherence Model for Evaluating Readability. DICOMER is a package for evaluating the coherence of text using discourse matrix representation, augmented with discourse hierarchy structure. Part of the deliverables from Lin et al.'s 2012 ACL paper entitled "Combining Coherence Models and Machine Translation Evaluation Metrics for Summarization Evaluation".
Ruby
1
star
38

TESLA-S

TESLA-S: Evaluating Summary Content. Adaptation of the popular TESLA evaluation metric for summarization content evaluation. Part of the deliverables from Lin et al.'s 2012 ACL paper entitled "Combining Coherence Models and Machine Translation Evaluation Metrics for Summarization Evaluation".
Java
1
star
39

ETD-Parsing

Repository for shared project for electronic theses and dissertation parsing collaboration between Virginia Tech, Old Dominion University and National University of Singapore.
Python
1
star