• Stars
    star
    110
  • Rank 316,770 (Top 7 %)
  • Language
    Python
  • License
    BSD 2-Clause "Sim...
  • Created over 1 year ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Official Implementation of ICCV 2023 Paper - SegPrompt: Boosting Open-World Segmentation via Category-level Prompt Learning

SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning

Muzhi Zhu1, ย  Hengtao Li1, ย  Hao Chen1, ย  Chengxiang Fan1, ย  Weian Mao2,1, ย  Chenchen Jing1, ย  Yifan Liu2, ย  Chunhua Shen1

1Zhejiang University, ย  2The University of Adelaide, ย 

News

  • [2023/07/14] Our work SegPrompt is accepted by Int. Conf. Computer Vision (ICCV) 2023! ๐ŸŽ‰๐ŸŽ‰๐ŸŽ‰
  • [2023/08/30] We release our new benchmark LVIS-OW.

Installation

Please follow the instructions in Mask2Former

Other requirements

pip install torchshow
pip install torch-scatter -f https://data.pyg.org/whl/torch-1.10.1+cu113.html
pip install lvis
pip install setuptools==59.5.0
pip install seaborn

LVIS-OW benchmark

Here we provide our proposed new benchmark LVIS-OW.

Dataset preparation

First prepare COCO and LVIS dataset, place them under $DETECTRON2_DATASETS following Detectron2

The dataset structure is as follows:

datasets/
  coco/
    annotations/
      instances_{train,val}2017.json
    {train,val}2017/
  lvis/
    lvis_v1_{train,val}.json

We reorganize the dataset and divide the categories into Known-Seen-Unseen to better evaluate the open-world model. The json files can be downloaded from here.

Or you can directly use the command to generate from the json file of COCO and LVIS.

bash tools/prepare_lvisow.sh 

After you successfully get lvis_v1_train_ow.json and lvis_v1_val_resplit_r.json, you can refer to here to register the training set and test set. Then you can use our benchmark for training and testing.

Evaluation on LVIS-OW

python tools/eval_lvis_ow.py --dt-json-file output/m2f_binary_lvis_ow/lvis_r/inference/lvis_instances_results.json

Acknowledgement

We thank the following repos for their great works:

Cite our Paper

If you found this project useful for your paper, please kindly cite our paper.

More Repositories

1

AdelaiDet

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Python
3,375
star
2

AdelaiDepth

This repo contains the projects: 'Virtual Normal', 'DiverseDepth', and '3D Scene Shape'. They aim to solve the monocular depth estimation, 3D scene reconstruction from single image problems.
Python
1,063
star
3

Matcher

[ICLR'24] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
Python
434
star
4

MovieDreamer

249
star
5

StyleDrop-PyTorch

This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.
Python
201
star
6

Poseur

[ECCV 2022] The official repo for the paper "Poseur: Direct Human Pose Regression with Transformers".
Python
179
star
7

AutoStory

144
star
8

GenPercept

GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
Python
122
star
9

DyCo3D

Python
121
star
10

FrozenRecon

[ICCV2023] ๐ŸงŠFrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models
Python
117
star
11

FreeCustom

[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
Python
101
star
12

OIR

[ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"
Python
81
star
13

RGM

69
star
14

GeoBench

A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.
Python
49
star
15

GenDeF

38
star
16

DiverGen

DiverGen (CVPR 2024) & BSGAL (ICML 2024)
Python
35
star
17

LoRAPrune

Python
30
star
18

FreeCompose

Jupyter Notebook
23
star
19

SINE

[NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples
Python
21
star
20

VFN

[ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".
Python
21
star
21

partially-labelled

Learning to segment multi-organ and tumorsfrom multiple partially labeled datasets
19
star
22

FADiff

[ICML 2024] Floating Anchor Diffusion Model for Multi-motif Scaffolding
Python
19
star
23

Depth3D

Python
10
star
24

VLModel

Repo of HawkLlama.
Python
10
star
25

STORY

4
star
26

OIR-Diffusion

JavaScript
1
star