• Stars
    star
    123
  • Rank 288,511 (Top 6 %)
  • Language
    Python
  • License
    GNU General Publi...
  • Created about 3 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

[ICCV 2021] Instance-level Image Retrieval using Reranking Transformers

Instance-level Image Retrieval using Reranking Transformers

Fuwen Tan, Jiangbo Yuan, Vicente Ordonez, ICCV 2021.

Abstract

Instance-level image retrieval is the task of searching in a large database for images that match an object in a query image. To address this task, systems usually rely on a retrieval step that uses global image descriptors, and a subsequent step that performs domain-specific refinements or reranking by leveraging operations such as geometric verification based on local features. In this work, we propose Reranking Transformers (RRTs) as a general model to incorporate both local and global features to rerank the matching images in a supervised fashion and thus replace the relatively expensive process of geometric verification. RRTs are lightweight and can be easily parallelized so that reranking a set of top matching results can be performed in a single forward-pass. We perform extensive experiments on the Revisited Oxford and Paris datasets, and the Google Landmark v2 dataset, showing that RRTs outperform previous reranking approaches while using much fewer local descriptors. Moreover, we demonstrate that, unlike existing approaches, RRTs can be optimized jointly with the feature extractor, which can lead to feature representations tailored to downstream tasks and further accuracy improvements.

Software required

The code is only tested on Linux 64:

  conda create -n rrt python=3.6
  conda activate rrt
  pip install -r requirements.txt

Organization

To use the code for experiments on Google Landmarks v2, Revisited Oxford/Paris, please refer to the folder RRT_GLD.

To use the code for experiments on Stanford Online Products, please refer to the folder RRT_SOP.

To use the code for evaluating SuperGlue on Revisited Oxford/Paris and Stanford Online Products, please refer to the repo SuperGlue.

Citing

If you find our paper/code useful, please consider citing:

@inproceedings{fwtan-instance-2021,
    author = {Fuwen Tan and Jiangbo Yuan and Vicente Ordonez},
    title = {Instance-level Image Retrieval using Reranking Transformers},
    year = {2021},
    booktitle = {International Conference on Computer Vision (ICCV)}
 }

More Repositories

1

Curriculum-Labeling

[AAAI 2021] Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning
Python
134
star
2

Text2Scene

[CVPR 2019] Text2Scene: Generating Compositional Scenes from Textual Descriptions
Python
117
star
3

Balanced-Datasets-Are-Not-Enough

[ICCV 2019] Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations
Python
34
star
4

Double-Hard-Debias

[ACL 2020] Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation
Jupyter Notebook
26
star
5

SelfEQ

[CVPR 2024] Code for "Improved Visual Grounding through Self-Consistent Explanations".
Python
18
star
6

AMC-grounding

[CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"
Jupyter Notebook
17
star
7

feedbackprop

[CVPR 2018] Feedback-prop: Convolutional Neural Network Inference under Partial Evidence
Jupyter Notebook
13
star
8

DrillDown

[NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries
Python
11
star
9

SyViC

[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
Python
9
star
10

chair-segments

Chair Segments: A Compact Benchmark for the Study of Object Segmentation
Python
8
star
11

visual-pivoting

[EMNLP 2020] Using Visual Feature Space as a Pivot Across Languages
Python
4
star
12

TV-GZSL

On the Transferability of Visual Features in Generalized Zero-Shot Learning Toolkit
Python
4
star
13

SimVQA

[CVPR 2022] SimVQA: Exploring Simulated Environments for Visual Question Answering
Python
4
star
14

chat-crowd

[NAACL 2019] A web interface for collecting dialog data for interactive drawing of 2D shapes
Python
2
star
15

PropTest

Code for "PropTest: Automatic Property Testing for Improved Visual Programming"
1
star