Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Kotlin

C#

Zig

Erlang

Ruby

C

Objective-C

Nix

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Lua

Rust

Clojure

Scala

Python

C++

Ada

Jupyter Notebook

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇸🇳 Senegal

🇰🇾 Cayman Islands

🇨🇱 Chile

🇲🇼 Malawi

🇦🇮 Anguilla

🇵🇾 Paraguay

🇰🇿 Kazakhstan

🇸🇮 Slovenia

All Countries Compare Countries

AlexGidiotis/Document-Classifier-LSTM

Stars
169
Rank 224,453 (Top 5 %)
Language
Python
License
MIT License
Created over 7 years ago
Updated about 1 year ago

AlexGidiotis/Document-Classifier-LSTM

AlexGidiotis

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

A bidirectional LSTM with attention for multiclass/multilabel text classification.

Document-Classifier-LSTM

Recurrent Neural Networks for multilclass, multilabel classification of texts. The models that learn to tag samll texts with 169 different tags from arxiv.

In classifier.py is implemented a standard BLSTM network with attention.

In hatt_classifier.py you can find the implementation of Hierarchical Attention Networks for Document Classification.

The neural networks were built using Keras and Tensorflow.

The best performing model is the attention BLSTM that achieves a micro f-score of 0.67 on the test set.

The Hierarchical Attention Network achieves only 0.65 micro f-score.

I am using 500k paper abstracts from arxiv. In order to download your own data refer to the arxiv OAI api.

Pretrained word embeddings can be used. The embeddings can either be GloVe or Word2Vec. You can download the GoogleNews-vectors-negative300.bin or the GloVe embeddings.

Usage:

In order to train your own model you must prepare your data set using the data_prep.py script. The preprocessing converts to lower case, tokenizes and removes very short words. The preprocessed files and label files should be saved in a /data folder.
You can now run classifier.py or hatt_classifier.py to build and train the models.
The trained models are exported to json and the weights to h5 for later use.
You can use utils.visualize_attention to visualize the attention weights.

Requirements

Python
NLTK
NumPy
Pandas
SciPy
OpenCV
scikit-learn
Tensorflow
Keras

Run pip install -r requirements.txt to install the requirements.

Multimodal-Gesture-Recognition-with-LSTMs-and-CTC

An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Advanced-ML-techniques

This repo contains implementation of advanced ML techniques. Includes model ensembles, cost-sensitive learning and dealing with class imbalance.

DANCER-summ

Code for the paper "A Divide-and-Conquer Approach to the Summarization of Long Documents"

keras-CF-NADE

keras implementation of Neural Autoregressive Distribution Estimator for Collaborative Filtering

PMC-StructuredAbstracts-Dataset

Code to obtain the PMC-SA. A dataset for the summarization of scientific articles.

gpt-light

The easiest repo for building GPT applications.

predict_query_product_relevance

we try to predict a relevance score between a search term and a product

ml-algorithms-minimal

Pure python implementations of popular ML algorithms.

Jupyter Notebook

LSTM-Language-Models

Experiments with LSTM networks for language modeling.

Automatic_Gesture_Recognition

In this project we address the task of multimodal recognition of gestures. We are using skeletal, speech, rgb and depth streams to recognise sequences of gestures.

Bayesian-Summarization

Keras-tutorial-on-CNNs

Tutorial on how to implement a simple CNN for image classification and face detection.