• Stars
    star
    486
  • Rank 90,527 (Top 2 %)
  • Language
    Python
  • License
    MIT License
  • Created about 3 years ago
  • Updated almost 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"

MobileViT

Overview

This is a PyTorch implementation of MobileViT specified in "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer", arXiv 2021.

img

👉 Check out CoAtNet if you are interested in other Convolution + Transformer models.

Usage

import torch
from mobilevit import mobilevit_xxs

img = torch.randn(1, 3, 256, 256)
vit = mobilevit_xxs()
out = vit(img)

Citation

@article{mehta2021mobilevit,
  title={MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer},
  author={Mehta, Sachin and Rastegari, Mohammad},
  journal={arXiv preprint arXiv:2110.02178},
  year={2021}
}

Credits

Code adapted from MobileNetV2 and ViT.