• Stars
    star
    266
  • Rank 153,278 (Top 4 %)
  • Language
  • Created almost 3 years ago
  • Updated 2 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models

This github will be continuously updated for the survey paper:

Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey, Xiao Wang, Guangyao Chen, Guangwu Qian, Pengcheng Gao, Xiao-Yong Wei, Yaowei Wang, Yonghong Tian, Wen Gao. [Paper]


Framework of this survey

Review and Surveys

Please check this file [Surveys.md]

Datasets

Please check this file [Datasets.md]

Publications

Please check this file [paperList.md]

Experimental Analysis

๐Ÿ“ƒ BibTex:

If you find this survey useful for your research, please cite the following papers:

@article{wang2022MMPTMSurvey,
  title={Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey},
  author={Wang, Xiao and Chen, Guangyao and Qian, Guangwu and Gao, Pengcheng and Wei, Xiao-Yong and Wang, Yaowei and Tian, Yonghong and Gao, Wen},
  url={https://github.com/wangxiao5791509/MultiModal_BigModels_Survey},
  year={2022}
}

If you have any questions about this survey, please email me via: [email protected] or [email protected]

More Repositories

1

Pedestrian-Attribute-Recognition-Paper-List

[PR-2021-Survey] Paper list on Pedestrian Attribute Recognition (PAR) and related tasks (Pattern Recognition 2021)
681
star
2

Single_Object_Tracking_Paper_List

Paper list for single object tracking (State-of-the-art SOT trackers)
375
star
3

Cloth_Change_Person_reID_Paper_List

Paper collection for cloth variation based person re-identification
112
star
4

VisEvent_SOT_Benchmark

[IEEE TCYB 2023] The first large-scale tracking dataset by fusing RGB and Event cameras.
Python
109
star
5

SNN_CV_Applications_Resources

Paper list for SNN based computer vision tasks.
103
star
6

Person-Search-Paper-List

The paper list for person search
69
star
7

RGB-Thermal-Tracking-Paper-List

Paper collection of rgb-infrared tracking algorithms.
40
star
8

TNL2K_evaluation_toolkit

Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark (CVPR 2021)
MATLAB
39
star
9

RGB-DVS-SOT-Baselines

22
star
10

MFG_RGBT_Tracking_PyTorch

Official implementation of MFG-RGBT-Tracking with PyTorch, IEEE TMM 2022
Python
15
star
11

Tracking-with-Deep-Reinforcement-Learning

14
star
12

LGSearch_DDGAN_PyTorch

Tracking by Joint Local and Global Search: A Target-aware Attention based Approach (IEEE TNNLS 2021)
Python
11
star
13

DeepMTA_PyTorch

Official implementation of deep-multi-trajectory-based single object tracking (IEEE T-CSVT 2021).
Python
10
star
14

cmSalGAN_PyTorch

Official implementation of "cmSalGAN: RGB-D Salient Object Detection with Cross-View Generative Adversarial Networks" (IEEE TMM 2020)
MATLAB
10
star
15

Age-Progression-Regression-by-CAAE

the repaired code of paper "Age Progression/Regression by Conditional Adversarial Autoencoder---CVPR 2017"
Python
10
star
16

Dynamic-Memory-Network-Paper-List

The paper list of dynamic memory network
9
star
17

Spiking-Neural-Network-Paper-List

Paper list for Spiking Neural Networks (SNN)
7
star
18

PET_Principles_and_Applications

Principles and Applications for Positron Emission Tomography (PET)
3
star
19

SiamDW_tracker_revised

Python
3
star
20

EVclouds_Classification_WACV2019

Recognize the Gesture for DVS sensors using Event Clound Method
Python
3
star
21

RoI_Pooling_PyTorch

A modified version of RoI Pooling function for proposal feature extractation.
Python
2
star
22

RAM-LPM-PyTorch

Recurrent Attention Model with Log-Polar Mapping (RAM-LPM)
Jupyter Notebook
1
star
23

MOT-Paper-List

The paper list for Multi-Object-Tracking
1
star
24

STADB_ReID

Official implementation of "STADB: A Self-Thresholding Attention Guided ADB Network for Person Re-identification"
Python
1
star
25

MultiModality_Spatial_registration_matlab

A toolkit for spatial registration of dual-modal data like rgb and thermal images
MATLAB
1
star
26

wangxiao5791509.github.io

HTML
1
star