jazzsaxmafia/show_and_tell.tensorflow

Stars
290
Rank 142,981 (Top 3 %)
Language
Jupyter Notebook
License
BSD 2-Clause "Sim...
Created about 9 years ago
Updated about 8 years ago

jazzsaxmafia/show_and_tell.tensorflow

jazzsaxmafia

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Neural Caption Generator

Tensorflow implementation of "Show and Tell" http://arxiv.org/abs/1411.4555
Borrowed some code and ideas from Andrej Karpathy's NeuralTalk.
You need flickr30k data (images and annotations)

Code

make_flickr_dataset.py : Extracting feats of flickr30k images, and save them in './data/feats.npy'
model.py : TensorFlow Version

Usage

Flickr30k Dataset Download
Extract VGG Featues of Flicker30k images (make_flickr_dataset.py)
Train: run train() in model.py
Test: run test() or test_tf() in model.py
parameters: VGG FC7 feature of test image, trained model path
Once you download Tensorflow VGG Net (one of the links below), you don't need Caffe when testing.

Downloading data/trained model

Extraced FC7 data: download
This is used in train() function in model.py. You can skip feature extraction part by using this.
Pretrained model download
This is used in test() and test_tf() in model.py. If you do not have time for training, or if you just want to check out captioning, download and test the model.
Tensorflow VGG net download
This file is used in test_tf() in model.py
Along with the files above, you might want to download flickr30k annotation data from link

License

BSD license

show_attend_and_tell.tensorflow

Jupyter Notebook

Weakly_detector

Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

Jupyter Notebook

video_to_sequence

Implementation of "Sequence to Sequence – Video to Text"

Inpainting

Implementation of "Context Encoders: Feature Learning by Inpainting"

dcgan_tensorflow

Tensorflow implementation of "UNSUPERVISED REPRESENTATION LEARNING WITH DEEP CONVOLUTIONAL GENERATIVE ADVERSARIAL NETWORKS"

m_CNN

Implementation of "Mutimodal Convolution Neural Networks for Matching Image and Sentence"

show_attend_and_tell

kaggle_AI_science_challenge

awesome-recruit-en

awesome-recruit

cnn_cats_dogs

Caffe와 Oxford pet dataset을 이용해 개 / 고양이를 분류한은 딥 러닝 예제입니다.

video_recognition

neuraltalk_theano

nummanip

news_comment_generation

image_web

portal_searcher

awesome-recruit-en.v2

alexnet

MajorGradeCompute

Learning_tools

jazzsax_web