Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

OCaml

Crystal

CSS

Nix

Shell

Zig

R

CoffeeScript

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Elixir

Elm

Scala

Rust

Shell

Ruby

Perl

C

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇧🇼 Botswana

🇨🇿 Czechia

🇳🇵 Nepal

🇨🇴 Colombia

🇫🇮 Finland

🇨🇰 Cook Islands

🇷🇸 Serbia

🇸🇩 Sudan

All Countries Compare Countries

nlpyang/structured

Stars
130
Rank 277,575 (Top 6 %)
Language
Python
Created over 7 years ago
Updated almost 7 years ago

nlpyang

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

code for Learning Structured Text Representations

Learning Structured Text Representations

Code for the paper:

Learning Structured Text Representations
Yang Liu and Mirella Lapata, Accepted by TACL

Dependencies

This code is implemented with Tensorflow and the data preprocessing is with Gensim

Document Classification

Data

The pre-processed YELP 2013 data can be downloaded at https://drive.google.com/open?id=0BxGUKratNjbaZjFIR1MtbkdzZVU

Preprocessing

To preprocess the data, run

python prepare_data.py path-to-train path-to-dev path-to-test

This will generate a pickle file, the format for the input data can be found in the sample folder

Training

python cli.py --data_file path_to_pkl --rnn_cell lstm --batch_size 16 --dim_str 50 --dim_sem 75 --dim_output 5 --keep_prob 0.7 --opt Adagrad
--lr 0.05 --norm 1e-4 --gpu -1 --sent_attention max --doc_attention max --log_period 5000

This will train the Tree-Matrix structured attention model in the paper on the training-set and present results on the devset/testset

License

MIT

BertSum

Code for paper Fine-tune BERT for Extractive Summarization

Python

1,464

PreSumm

code for EMNLP 2019 paper Text Summarization with Pretrained Encoders

Python

1,280

geval

Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"

Python

240

hiersumm

Code for paper Hierarchical Transformers for Multi-Document Summarization in ACL2019

Python

229

SUMO

Code for paper Single Document Summarization as Tree Induction

Python

NoisySumm

Codes for NAACL 2021 paper 'Noisy Self-Knowledge Distillation for Text Summarization'

Python

nlpyang/structured

nlpyang

Reviews

Repository Details

Learning Structured Text Representations

Dependencies

Document Classification

Data

Preprocessing

Training

License

More Repositories