There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper βUnbiased Scene Graph Generation from Biased Training CVPR 2020βLong-Tailed-Recognition.pytorch
[NeurIPS 2020] This project provides a strong single-stage baseline for Long-Tailed Classification, Detection, and Instance Segmentation (LVIS). It is also a PyTorch implementation of the NeurIPS 2020 paper 'Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect'.VQA2.0-Recent-Approachs-2018.pytorch
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures", "Learning to count object", "Bottom-up top-down" for Visual Question Answering 2.0ResNet50-Pytorch-Face-Recognition
Using Pytorch to implement a ResNet50 for Cross-Age Face RecognitionVCTree-Scene-Graph-Generation
Code for the Scene Graph Generation part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contexts"Generalized-Long-Tailed-Benchmarks.pytorch
[ECCV 2022] A generalized long-tailed challenge that incorporates both the conventional class-wise imbalance and the overlooked attribute-wise imbalance within each class. The proposed IFL together with other baselines are also included.GGNN-for-bAbI-dataset.pytorch.1.0
A Complete PyTorch 1.0 Implementation of Gated Graph Sequence Neural Networks (GGNN)ResNet50-Tensorflow-Face-Recognition
Using Tensorflow to implement a ResNet50 for Cross-Age Face RecognitionVCTree-Visual-Question-Answering
Code for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contexts"Local-Disco-Diffusion-v5.2.jupyterNote
A custom Disco Diffusion v5.2 that runs on local GPUS.LVIS-for-mmdetection
support Large Vocabulary Instance Segmentation (LVIS) dataset for mmdetectionKinetics-Data-Preprocessing
An instruction to 1) download the Kinetics-400/Kinetics-600, 2) resize the videos, and 3) prepare annotations.Qwen-Tokenizer-Pruner
Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this project provides a Tokenizer vocabulary shearing solution for Qwen and Qwen-VL.Describe-and-Guess-GAME-Using-GPT-3
A simple demo of how to use GPT-3 to play Describe-and-Guess in the specified topic and question type.kai-blog
faster-rcnn.pytorch
Minimalist-TinyLLaMA-to-Onnx
Export TinyLLaMA to Onnx and Conduct LLM inference using onnxruntimeQuick-Draw-Multimodal-Recognition
The Course Project of CE7454 (Team 13)Love Open Source and this site? Check out how you can help us