vene/sparse-structured-attention

Stars
224
Rank 177,792 (Top 4 %)
Language
Python
License
BSD 3-Clause "New...
Created about 7 years ago
Updated about 4 years ago

vene/sparse-structured-attention

vene

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Sparse and structured neural attention mechanisms

Sparse and structured attention mechanisms

Efficient implementation of structured sparsity inducing attention mechanisms: fusedmax, oscarmax and sparsemax.

Note: If you are just looking for sparsemax, I recommend the implementation in the entmax.

Currently available for pytorch >= 0.4.1. (For older versions, use a previous release of this package.) Requires python >= 2.7, cython, numpy, scipy.

Usage example:

In [1]: import torch
In [2]: import torchsparseattn
In [3]: a = torch.tensor([1, 2.1, 1.9], dtype=torch.double)
In [4]: lengths = torch.tensor([3])
In [5]: fusedmax = torchsparseattn.Fusedmax(alpha=.1)
In [6]: fusedmax(a, lengths)
Out[6]: tensor([0.0000, 0.5000, 0.5000], dtype=torch.float64)

For details, check out our paper:

Vlad Niculae and Mathieu Blondel A Regularized Framework for Sparse and Structured Neural Attention In: Proceedings of NIPS, 2017. https://arxiv.org/abs/1705.07704

See also:

André F. T. Martins and Ramón Fernandez Astudillo From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification In: Proceedings of ICML, 2016 https://arxiv.org/abs/1602.02068

X. Zeng and M. Figueiredo, The ordered weighted L1 norm: Atomic formulation, dual norm, and projections. eprint http://arxiv.org/abs/1409.4271

sparsemap

SparseMAP: differentiable sparse structure inference

marseille

Mining Argument Structures with Expressive Inference (Linear and LSTM Engines)

pyowl

Ordered Weighted L1 regularization for classification and regression in Python

pelican-bibtex

pelican-bibtex: Manage your academic publications page with Pelican and BibTeX

misc-nlp

My projects and works in progress in NLP and computational linguistics.

comparison-pattern

Research code for mining comparison expressions from natural language texts.

vene.github.io

My github-powered personal website.

pydai

retrieve-oldromanian

Old Romanian IR example

config-files

My configuration files

bilearn

Bilinear models for machine learning.

ro-morphology

Machine learning tools for solving morphology tasks in the Romanian language

pyglarf

Python utilities for working on top of GLARF's output.

dynet-custom

Custom dynet node example

prototype-lp-sparsemap

Some original prototyping code from LP-SparseMAP, for reference

ambra

AMBRA: Temporal text prediction by pairwise comparisons