Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Shell

Java

MATLAB

Go

Dart

Swift

Elixir

Ruby

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Swift

Dart

Ada

C++

Kotlin

Ruby

Perl

JavaScript

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇬🇼 Guinea-Bissau

🇬🇱 Greenland

🇪🇷 Eritrea

🇦🇿 Azerbaijan

🇭🇷 Croatia

🇧🇴 Bolivia

🇿🇲 Zambia

🇦🇱 Albania

All Countries Compare Countries

OlavHN/attention-over-attention

Stars
178
Rank 214,989 (Top 5 %)
Language
Python
Created about 8 years ago
Updated over 2 years ago

OlavHN/attention-over-attention

OlavHN

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Implementation of Attention-over-Attention Neural Networks for Reading Comprehension (https://arxiv.org/abs/1607.04423) in TensorFlow

Attention over Attention

Implementation of the paper Attention-over-Attention Neural Networks for Reading Comprehension in tensorflow

Some context on my blog

Reading comprehension for cloze style tasks is to remove word from an article summary, then read the article and try to infer the missing word. This example works on the CNN news dataset.

With the same hyperparameters as reported in the paper, this implementation got an accuracy of 74.3% on both the validation and test set, compared with 73.1% and 74.4% reported by the author.

To train a new model: python model.py --training=True --name=my_model

To test accuracy: python model.py --training=False --name=my_model --epochs=1 --dropout_keep_prob=1

Note that the tfrecords and model files are stored with git lfs

Raw data for use with reader.py to produce .tfrecords files was downloaded from [http://cs.nyu.edu/~kcho/DMQA/]

Interesting parts

Masked softmax implementation
Example of batched sparse tensors with correct mask handling
Example of pointer style attention
Test/validation split part of the tf-graph

fast-neural-style

Fast neural style in tensorflow based on http://arxiv.org/abs/1603.08155

bnlstm

Batch normalized LSTM for tensorflow

react-paper

Paper elements by Google translated to React

tfweb

Simple HTTP JSON server for Tensorflow models

co-nano

Exposes couchdb nano library API as thunks for use with co and koa.

virtuallist

React virtual list component

react-rtc