• Stars
    star
    109
  • Rank 319,077 (Top 7 %)
  • Language
    Python
  • Created over 1 year ago
  • Updated 8 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

[ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS

Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation

ICCV, 2023
Xiangtai Li · Haobo Yuan · Wenwei Zhang · Guangliang Cheng
Jiangmiao Pang . Chen Change Loy*

arXiv PDF Project Page


Universal Video Segmentation Model For VSS, VPS, and VIS

avatar

News !!

  • Update Swin Large Config For Tube-Link VPS on VIP-Seg.

  • Code released!!

[Paper] [CODE]

Features

$\color{#2F6EBA}{Universal\ Video\ Segmentation\ Model}$

  • A new framework unifies online video segmentation methods and near online video segmentation methods.
  • A new unified solution for three video segmentation tasks: VSS, VIS, and VPS.

$\color{#2F6EBA}{Explore\ the\ Cross-Tube\ Relation}$

  • The first video segmentation method that explores the cross-tube relation.
  • Proposed Tube-wise matching performs better than Frame-wise matching.

$\color{#2F6EBA}{Strong\ Performance}$

  • Achieves strong performance on VIS, VSS, and VPS datasets (five datasets) in one unified architecture.
  • AchievesEven better performance than those specific architectures.

Dataset

See Dataset.md

Install

See Install.md

Training, Evaluation, and Models

See Train.md

Visualization Results

[VIS] Youtube-VIS 2019

Demo

vis_demo_1

vis_demo_2

[VPS] VIP-Seg

Demo

vps_demo_1

vps_demo_2

[VSS] VSPW

Demo

vss_demo

[VPS] KITTI-STEP

Demo

vps_demo_3

Citation

If you think both Tube-Link and its codebase are useful for your research, please consider referring Tube-Link:

@inproceedings{li2023tube,
  title={Tube-link: A flexible cross tube baseline for universal video segmentation},
  author={Li, Xiangtai and Yuan, Haobo and Zhang, Wenwei and Cheng, Guangliang and Pang, Jiangmiao and Loy, Chen Change},
  booktitle={ICCV},
  year={2023}
}

@inproceedings{li2022videoknet,
  title={Video k-net: A simple, strong, and unified baseline for video segmentation},
  author={Li, Xiangtai and Zhang, Wenwei and Pang, Jiangmiao and Chen, Kai and Cheng, Guangliang and Tong, Yunhai and Loy, Chen Change},
  booktitle={CVPR},
  year={2022}
}

License

MIT

More Repositories

1

OMG-Seg

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Python
1,272
star
2

Awesome-Segmentation-With-Transformer

[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey
684
star
3

OctaveConv_pytorch

Pytorch implementation of newly added convolution
Python
582
star
4

DecoupleSegNets

[ECCV-2020]: Improving Semantic Segmentation via Decoupled Body and Edge Supervision
Python
370
star
5

SFSegNets

[ECCV-2020-oral]-Semantic Flow for Fast and Accurate Scene Parsing
Python
368
star
6

GALD-DGCNet

Source code and model GALD net (BMVC-2019) and Dual-Seg Net (BMVC-2019)
Python
343
star
7

Fast_Seg

This repo provides âš¡ fastâš¡ semantic segmentation models on CityScapes/Camvid DataSet by Pytorch
Python
208
star
8

CAE

This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
Python
192
star
9

Video-K-Net

[CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
Python
149
star
10

PFSegNets

PointFlow (CVPR-2021)
Python
121
star
11

dfn_seg

Implementation of Paper Learning a Discriminative Feature Network for Semantic Segmentation (CVPR2018)(face++)
Python
70
star
12

BSSeg

BoundarySqueeze: Image Segmentation as Boundary Squeezing
Python
55
star
13

Panoptic-PartFormer

[ECCV-2022] The First Unified End-to-End System for Panoptic Part Segmentation
Python
53
star
14

fuse_seg_pytorch

Pytorch Implementation of Paper: Enhancing Feature Fusion for Semantic Segmentation (face++)
Python
43
star
15

AI_challenger_Chinese_Caption

Repository for image caption for Chinese
Jupyter Notebook
25
star
16

TemporalPyramidRouting

Temporal Pyramid Routing For Video Instance Segmentation-T-PAMI-2022
Python
25
star
17

QueryPanSeg

Query Learning of Both Thing and Stuff for Panoptic Segmentation-ICIP-2022
15
star
18

deepLearning.ai.solution

This repository contains the implementation of deep learning courses by Andrew ng on Coursera
Jupyter Notebook
13
star
19

netwarp_test

Semantic Video CNNs through Representation Warping. ICCV 2017
Python
5
star
20

CompactSecondOrderNet

3
star
21

BasicAlgorithm-PAT-LeetCode-LintCode-

This repository is used to record the study of algorithm.
C++
1
star
22

cinema_java_software_engineering

This repository contains a simple Cinema System. This is the project of 3rd Software Engineering
Java
1
star
23

Pytorch-Cifar-models

This repository contains some famous CNN models that can run on the cifar-10 dataset
Python
1
star
24

MobileNet2-pytorch

This repository contains mobile nets implemetation by pytorch
Python
1
star
25

Adaboost-byhand

This repository contains the basic, mulit_boosting and basic bagging implementation
Python
1
star