• Stars
    star
    17
  • Rank 1,257,181 (Top 25 %)
  • Language
    Python
  • License
    MIT License
  • Created over 2 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Towards Local Visual Modeling for Image Captioning

More Repositories

1

External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Python
11,380
star
2

FightingCV-Paper-Reading

⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀
Shell
790
star
3

FightingCV-Course

深度学习/计算机视觉/多模态/机器学习/人工智能零基础理论/实战教程汇总分享
Shell
110
star
4

X-Dreamer

A pytorch implementation of “X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation”
Python
69
star
5

RepMLP-pytorch

Pytorch implement ion of RepMLP
Python
32
star
6

xmu-xiaoma666

32
star
7

X-Mesh

A pytorch implementation of “ X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance”
Python
25
star
8

SDATR

Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)
Python
20
star
9

ECCV2022-Paper-List

ECCV2022-Paper-List
18
star
10

ImageCaptionMetrics

This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It will reveals whether the difference of two results is significant. In this code, we complete evaluation code for Spice details(*i.e.*,Object, Relation, Attribute, Color, Count, and Size ).
Python
16
star
11

CVAlgorithm

CV面试中的常见算法
Python
8
star
12

MLP-Mixer-pytorch

Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision
Python
6
star
13

MFM

An official implementation for "Knowing What to Learn: A Metric-Oriented Focal Mechanism for Image Captioning"
Python
6
star
14

Leetcode_diary

Leetcode is all you need
Python
5
star
15

Pytorch-Image-Classification

Pytorch-Image-Classification
Python
4
star
16

Awesome-Model-Pytorch

pytorch implementation of deep learning models
2
star
17

Multimodal-Open-O1

Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool works locally and aims to create inference chains akin to those used by OpenAI-o1, but with localized processing power.
Python
2
star
18

DTNet

The official repository for “Image Captioning via Dynamic Path Customization”.
Python
1
star
19

Beat

Python
1
star