• Stars
    star
    7
  • Rank 2,294,772 (Top 46 %)
  • Language
    Python
  • License
    MIT License
  • Created 6 months ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vision Language Models"

More Repositories

1

awesome-vision-and-language

A curated list of awesome vision and language resources (still under construction... stay tuned!)
427
star
2

Depth_from_Focus

Conventional Depth from Focus(DfF) estimation with slight focus variations in image sequences
Python
56
star
3

Explore-And-Match

Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in Videos"
Python
42
star
4

RecycleNet

Attentional Learning of Trash Classification
Python
38
star
5

ActionMAE

[AAAI 2023] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"
Python
16
star
6

Temporal-Span-Proposal-Network-VidVRD

What and When to look?: Temporal Span Proposal Network for Video Relation Detection
Python
15
star
7

Local-to-Global-Interaction-Networks-SGG

[TNNLS 2022] Official pytorch implementation of "Tackling the Challenges in Scene Graph Generation with Local-to-Global Interactions"
Jupyter Notebook
9
star
8

DMP

Official pytorch implementation of "Diffusion Model Patching via Mixture-of-Prompts"
Python
9
star
9

RITUAL

Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in LVLMs"
Python
8
star
10

explore-and-match

Explore-and-Match: A New Paradigm for Temporal Video Grounding with Natural Language
Python
7
star
11

Structure-from-Motion

Structure from Motion(SfM) with slight different view of images
Python
5
star
12

evo_ai

Evolutionary Algorithms (knapsack problem, traveling salesman problem, 4bit deceptive problem, neural network architecture optimization)
Python
5
star
13

SVOL

[WACV 2024] Official pytorch implementation of "SVOL: Sketch-based Video Object Localization"
Python
1
star
14

Attention-in-CNN

Attention Module in CNN (ResNet + CIFAR100)
Python
1
star
15

Cost-Out-Multitask-Learning

[Electronics] Revisiting Dropout: Escaping Pressure for Training Neural Networks with Multiple Costs
Python
1
star