• Stars
    star
    112
  • Rank 310,502 (Top 7 %)
  • Language
    Jupyter Notebook
  • License
    Other
  • Created over 8 years ago
  • Updated about 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Code release for Hu et al. Natural Language Object Retrieval, in CVPR, 2016

Natural Language Object Retrieval

This repository contains the code for the following paper:

  • R. Hu, H. Xu, M. Rohrbach, J. Feng, K. Saenko, T. Darrell, Natural Language Object Retrieval, in Computer Vision and Pattern Recognition (CVPR), 2016 (PDF)
@article{hu2016natural,
  title={Natural Language Object Retrieval},
  author={Hu, Ronghang and Xu, Huazhe and Rohrbach, Marcus and Feng, Jiashi and Saenko, Kate and Darrell, Trevor},
  journal={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2016}
}

Project Page: http://ronghanghu.com/text_obj_retrieval

Installation

  1. Download this repository or clone with Git, and then cd into the root directory of the repository.
  2. Run ./external/download_caffe.sh to download the SCRC Caffe version for this experiment. It will be downloaded and unzipped into external/caffe-natural-language-object-retrieval. This version is modified from the Caffe LRCN implementation.
  3. Build the SCRC Caffe version in external/caffe-natural-language-object-retrieval, following the Caffe installation instruction. Remember to also build pycaffe.

SCRC demo

  1. Download the pretrained models with ./models/download_trained_models.sh.
  2. Run the SCRC demo in ./demo/retrieval_demo.ipynb with Jupyter Notebook (IPython Notebook).

Image

Train and evaluate SCRC model on ReferIt Dataset

  1. Download the ReferIt dataset: ./datasets/download_referit_dataset.sh.
  2. Download pre-extracted EdgeBox proposals: ./data/download_edgebox_proposals.sh.
  3. You may need to add the SRCR root directory to Python's module path: export PYTHONPATH=.:$PYTHONPATH.
  4. Preprocess the ReferIt dataset to generate metadata needed for training and evaluation: python ./exp-referit/preprocess_dataset.py.
  5. Cache the scene-level contextual features to disk: python ./exp-referit/cache_referit_context_features.py.
  6. Build training image lists and HDF5 batches: python ./exp-referit/cache_referit_training_batches.py.
  7. Initialize the model parameters and train with SGD: python ./exp-referit/initialize_weights_scrc_full.py && ./exp-referit/train_scrc_full_on_referit.sh.
  8. Evaluate the trained model: python ./exp-referit/test_scrc_on_referit.py.

Optionally, you may also train a SCRC version without contextual feature, using python ./exp-referit/initialize_weights_scrc_no_context.py && ./exp-referit/train_scrc_no_context_on_referit.sh.

Train and evaluate SCRC model on Kitchen Dataset

  1. Download the Kitchen dataset: ./datasets/download_kitchen_dataset.sh.
  2. You may need to add the SRCR root directory to Python's module path: export PYTHONPATH=.:$PYTHONPATH.
  3. Build training image lists and HDF5 batches: python exp-kitchen/cache_kitchen_training_batches.py.
  4. Train with SGD: ./exp-kitchen/train_scrc_kitchen.sh.
  5. Evaluate the trained model: python exp-kitchen/test_scrc_on_kitchen.py.

More Repositories

1

seg_every_thing

Code release for Hu et al., Learning to Segment Every Thing. in CVPR, 2018.
Python
423
star
2

n2nmn

Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017
SourcePawn
270
star
3

tensorflow_compact_bilinear_pooling

Compact Bilinear Pooling in TensorFlow
Python
141
star
4

speaker_follower

Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
C++
124
star
5

lcgn

Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
Python
90
star
6

text_objseg

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016
Jupyter Notebook
86
star
7

snmn

Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018
Python
71
star
8

cmn

Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017
Python
67
star
9

gqa_single_hop_baseline

A simple but well-performing "single-hop" visual attention model for the GQA dataset
Python
19
star
10

vit_10b_fsdp_example

See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md
Python
18
star
11

moco_v3_tpu

Python
16
star
12

vqa-maskrcnn-benchmark-m4c

Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_feature.py
Python
12
star
13

visualnet_label

An Online Tool for Rigid Object Landmark Labeling
JavaScript
4
star
14

SanguoshaEX

Sanguosha EX: An Open Source PC Game Based on Popular Desktop Game "Sanguosha"
C++
3
star
15

ptxla_scaling_examples

A list of examples for model scaling in PyTorch/XLA
2
star
16

mhex_graph

Modified Hierarchy-Exclusion Graph (MHEX Graph)
MATLAB
1
star
17

detectron2_vitdet

Python
1
star