• Stars
    star
    176
  • Rank 216,987 (Top 5 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 7 years ago
  • Updated almost 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Neural image captioning (NIC) implementation with Keras 2.

Neural Image Captioning (NIC)

Neural image captioning implementation with Keras based on Show and Tell.

alt tag

Instructions

To train from zero using the iapr2012 dataset:

  • Download IAPR2012 dataset from here
  • Move the downloaded file to the datasets/IAPR_2012/ directory
  • Untar the file:

tar xvf iaprtc12.tgz

Extract/download image features

To extract:

  • Edit the file train.py by changing the flag extract_image_features to True.

To download:

  • Download the image features:

  • Download the extracted image features from here

  • Move them do datasets/IAPR_2012/preprocessed_data/ directory

  • Start training by running the script

python3 train.py

Notes

  • Extracting the image features might take 1-2 hours in a GTX860M.
  • Training 50 epochs should give you reasonable results.
  • I will provide pre-trained models in COCO soon (hopefully)

More Repositories

1

face_classification

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.
Python
5,579
star
2

paz

Hierarchical perception library in Python for pose estimation, object detection, instance segmentation, keypoint estimation, face recognition, etc.
Python
613
star
3

STN.keras

Implementation of spatial transformer networks (STNs) in keras 2 with tensorflow as backend.
Jupyter Notebook
278
star
4

SSD-keras

SSD: Single Shot MultiBox Detector in keras.
Python
84
star
5

RBF-Network

Minimal implementation of a radial basis function network.
Jupyter Notebook
73
star
6

beauvoir

Domain randomization library in Python for object detection and classification using Blender.
Python
15
star
7

luvina

High-level Natural Language Processing (NLP) for Python.
Python
13
star
8

bayesian-inverse-graphics

Bayesian Inverse Graphics for Few-Shot Concept Learning
Python
8
star
9

tfp-workshop

Tensorflow probability workshop
Dockerfile
4
star
10

PyGPToolbox

PyGPToolbox - A python geometry processing toolbox
Python
3
star
11

altamira-data

Altamira's external data such as model weights and image-pose dictionaries.
3
star
12

self-organizing-map

Small numpy implementation of self-organizing maps (SOMs).
Jupyter Notebook
2
star
13

ROS_face_classification

Python
1
star
14

pathnet.keras

Implementation of "PathNet: Evolution Channels Gradient Descent in Super Neural Networks" in keras 2.
Python
1
star
15

cnn_benchmarks

CNN benchmarks for Keras models
Python
1
star
16

object_perception

Object perception tasks: detection, tracking and classification.
Python
1
star
17

neras

keras with numpy
Python
1
star
18

image_affine_transformer

A class to apply affine transformations to a batch of images using bilinear interpolation.
Jupyter Notebook
1
star
19

IDE_vim

Integrated development environment (IDE) in vim.
Vim Script
1
star