gRefCOCO - Dataset for [CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
🏠[Project page] 📄[GRES Arxiv] 📄[GREC Arxiv]
This repository contains information and tools for the gRefCOCO dataset, proposed by the CVPR2023 Highlight paper:
GRES: Generalized Referring Expression Segmentation
Chang Liu, Henghui Ding, Xudong Jiang
CVPR 2023 Highlight, Acceptance Rate 2.5%
gRefCOCO Dataset Download
⬇️ Get the gRefCOCO dataset from:
- ☁️ OneDrive
Usage
- Like RefCOCO, gRefCOCO also should be used together with images from the
train2014
of MS COCO. - An example of dataloader grefer.py is provided.
- We will update this repository with full API package and documentation soon. Please follow the usage in the baseline code for now.
Task 1 - GREC: Generalized Referring Expression Comprehension
-
The GREC evaluation metric code is here.
-
We provide code based on MDETR, its training and inference are as follows:
Training (Finetuning)
- Process grefcoco to coco format.
python scripts/fine-tuning/grefexp_coco_format.py --data_path xxx --out_path mdetr_annotations/ --coco_path xxx
- Training and download
pretrained_resnet101_checkpoint.pth
from MDETR
python -m torch.distributed.launch --nproc_per_node=2 --use_env main.py --dataset_config configs/grefcoco.json --batch_size 4 --load pretrained_resnet101_checkpoint.pth --ema --text_encoder_lr 1e-5 --lr 5e-5 --output-dir grefcoco
Inference
- Obtain
checkpoint.pth
after training or download trained model here ☁️ Google Drive - For test results, pass --test and --test_type test or testA or testB according to the dataset.
python -m torch.distributed.launch --nproc_per_node=2 --use_env main.py --dataset_config configs/grefcoco.json --batch_size 4 --resume grefcoco/checkpoint.pth --ema --eval
Task 2 - GRES: Generalized Referring Expression Segmentation
Please refer to ReLA for more details.
Acknowledgement
Our project is built upon refer and cocoapi. Many thanks to the authors for their great works!
BibTeX
Please consider to cite GRES/GREC if it helps your research.
@inproceedings{GRES,
title={{GRES}: Generalized Referring Expression Segmentation},
author={Liu, Chang and Ding, Henghui and Jiang, Xudong},
booktitle={CVPR},
year={2023}
}
@article{GREC,
title={{GREC}: Generalized Referring Expression Comprehension},
author={He, Shuting and Ding, Henghui and Liu, Chang and Jiang, Xudong},
journal={arXiv preprint arXiv:2308.16182},
year={2023}
}
We also recommend other highly related works:
@article{VLT,
title={{VLT}: Vision-language transformer and query generation for referring segmentation},
author={Ding, Henghui and Liu, Chang and Wang, Suchen and Jiang, Xudong},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
year={2023},
volume={45},
number={6},
publisher={IEEE}
}
@inproceedings{MeViS,
title={{MeViS}: A Large-scale Benchmark for Video Segmentation with Motion Expressions},
author={Ding, Henghui and Liu, Chang and He, Shuting and Jiang, Xudong and Loy, Chen Change},
booktitle={ICCV},
year={2023}
}