Web IR / NLP Group @ NUS (@WING-NUS)

Top repositories

1

scisumm-corpus

Scientific Document Summarization Corpus and Annotations from the WING NUS group.
210
star
2

sequicity

Source code for the ACL 2018 paper entitled "Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-Sequence Architectures" by Wenqiang Lei et al.
Python
155
star
3

JD2Skills-BERT-XMLC

Code and Dataset for the Bhola et al. (2020) Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classification Framework
Python
49
star
4

SWING

The Summarizer from the Web IR / NLP Group (WING), hence SWING, is a modular, state-of-the-art automatic extractive text summarization system. It is used as the basis for summarization research at the National University of Singapore. It performs as one of the leading automatic summarization systems in the international TAC competition, getting high marks for the ROUGE evaluation measure
Ruby
39
star
5

cs6101

The Web IR / NLP Group (WING)'s public reading group at the National University of Singapore.
JavaScript
36
star
6

slsql

Code for the EMNLP 2020 paper "Re-examining the Role of Schema Linking in Text-to-SQL".
Python
25
star
7

SSID

Student Submission Integrity Diagnosis
Java
18
star
8

Kairos

Kairos, combines a focused crawler and an information extraction engine, to convert a list of conference websites into a index filled with fields of metadata that correspond to individual papers. Using event date metadata extracted from the conference website, Kairos proactively harvests metadata about the individual papers soon after they are made public. We use a Maximum Entropy classifier to classify uniform resource locators (URLs) as scientific conference websites and use Conditional Random Fields (CRF) to extract individual paper metadata from such websites. The crawler is built on top of the popular open-source crawler Nutch.
Java
18
star
9

SciAssist

Python
17
star
10

Prastava

100% Pure Ruby Recommendation System (CF/CBF/Hybrid)
Ruby
12
star
11

JavaRAP

JavaRAP is an implementation of the classic Resolution of Anaphora Procedure (RAP) given by Lappin and Leass (1994) . It resolves third person pronouns, lexical anaphors, and identifies pleonastic pronouns. The original purpose of the implementation is to provide anaphora resolution result to our TREC 2003 Q&A system.
9
star
12

ResearchTrends

Source code for the COLING 2018 paper entitled "Identifying Emergent Research Trends by Key Authors and Phrases" by Shenhao Jiang et al.
Python
8
star
13

RelatedWorkSummarizationDataset

Dataset for the paper: Cong Duy Vu Hoang and Min-Yen Kan (2010) Towards Automated Related Work Summarization. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, China. pp. 427-435.
HTML
6
star
14

RAZ

Robust Argumentative Zoning - This is the home page for the argumentative zoning for raw text project, a collaboration between NUS and the University of Cambridge. This work follows Teufel's thesis to zone (label) sentences with six different rhetorical functions for scholarly discoure. The download comes with Teufel's original analysis and markup of 80 cmp-lg articles.
Perl
6
star
15

ChairVisE

To be edited.
Vue
5
star
16

ir-seminar

The Web IR / NLP Group (WING)'s IR Seminar at the National University of Singapore.
HTML
5
star
17

discoling

Source code for the AAAI 2018 paper entitled "Linguistic Properties Matter for Implicit Discourse Relation Recognition: Combining Semantic Interaction, Topic Continuity and Attribution" by Wenqiang Lei et al.
Python
5
star
18

texWordCount

A perl script to help count words in LaTeX. LPGL.
TeX
5
star
19

SciSWING

Scientific Document Summarizer from the Web IR / NLP Group (WING), NUS
Ruby
4
star
20

chatongpt

Chat on GPT public event - 18 April 2023
CSS
4
star
21

nlp-seminar

The Web IR / NLP Group (WING)'s NLP Seminar at the National University of Singapore.
JavaScript
4
star
22

PyTorchCRF

A work-in-progress repository to develop a stand-alone lightweight CRF Layer in Pytorch
Python
4
star
23

RSScrawler-1

Python
2
star
24

WING-LDA

LDA group project
Python
2
star
25

FCKeyphrase

SciVerse Application for document keyphrase extraction
C++
2
star
26

Word-News-Android

Word News Android client
JavaScript
2
star
27

WordNews

WordNews Chinese/English Language Learning system with Chrome Extension and Android backends.
Python
2
star
28

search-engine-wrapper

This package provides a Java wrapper framework for unifying programmatic access to search engines. A convenience class is also included for downloading the files at the URLs in the search engine results. This package contains an API as well as a command-line application.
Java
2
star
29

SSNLP-2019

Static Jekyll Website for the Singapore Symposium on Natural Language Processing.
CSS
1
star
30

cubit

Google Scholar Analytics Package (Server backend and embeddable Javascript)
JavaScript
1
star
31

ACL-Anthology-Codebase

Script and code for running the older version of the ACL Anthology
Perl
1
star
32

PDTB-scorer

Java
1
star
33

domadapter

Python
1
star
34

Elsevier-KP

Keyphrase Extraction (Base version for Elsevier; Elsevier-KP). Link below not working yet.
JavaScript
1
star
35

NeuralQuestionGeneration

WING-NUS (Pan Liangming's) Re-implementation of Serban et al's. 2016 ACL work "Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus"
1
star
36

DICOMER

DIscourse COherence Model for Evaluating Readability. DICOMER is a package for evaluating the coherence of text using discourse matrix representation, augmented with discourse hierarchy structure. Part of the deliverables from Lin et al.'s 2012 ACL paper entitled "Combining Coherence Models and Machine Translation Evaluation Metrics for Summarization Evaluation".
Ruby
1
star
37

TESLA-S

TESLA-S: Evaluating Summary Content. Adaptation of the popular TESLA evaluation metric for summarization content evaluation. Part of the deliverables from Lin et al.'s 2012 ACL paper entitled "Combining Coherence Models and Machine Translation Evaluation Metrics for Summarization Evaluation".
Java
1
star
38

ETD-Parsing

Repository for shared project for electronic theses and dissertation parsing collaboration between Virginia Tech, Old Dominion University and National University of Singapore.
Python
1
star
39

ELCo

1
star