• Stars
    star
    231
  • Rank 173,434 (Top 4 %)
  • Language
  • Created almost 8 years ago
  • Updated almost 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Object detectors based on DL (from: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html )

Object Detection

Method VOC2007 VOC2010 VOC2012 ILSVRC 2013 MSCOCO 2015 Speed
OverFeat - - - 24.3% - -
R-CNN (AlexNet) 58.5% 53.7% 53.3% 31.4% - -
R-CNN (VGG16) 66.0% - - - - -
SPP_net(ZF-5) 54.2%(1-model), 60.9%(2-model) - - 31.84%(1-model), 35.11%(6-model) - -
DeepID-Net 64.1% - - 50.3% - -
NoC 73.3% - 68.8% - - -
Fast-RCNN (VGG16) 70.0% 68.8% 68.4% - 19.7%(@[0.5-0.95]), 35.9%(@0.5) -
MR-CNN 78.2% - 73.9% - - -
Faster-RCNN (VGG16) 78.8% - 75.9% - 21.9%(@[0.5-0.95]), 42.7%(@0.5) 198ms
Faster-RCNN (ResNet-101) 85.6% - 83.8% - 37.4%(@[0.5-0.95]), 59.0%(@0.5) -
SSD300 (VGG16) 72.1% - - - - 58 fps
SSD500 (VGG16) 75.1% - - - - 23 fps
ION 79.2% - 76.4% - - -
CRAFT 75.7% - 71.3% 48.5% - -
OHEM 78.9% - 76.3% - 25.5%(@[0.5-0.95]), 45.9%(@0.5) -
R-FCN (ResNet-50) 77.4% - - - - 0.12sec(K40), 0.09sec(TitianX)
R-FCN (ResNet-101) 79.5% - - - - 0.17sec(K40), 0.12sec(TitianX)
R-FCN (ResNet-101),multi sc train 83.6% - 82.0% - 31.5%(@[0.5-0.95]), 53.2%(@0.5) -
PVANet 9.0 81.8% - 82.5% - - 750ms(CPU), 46ms(TitianX)

Leaderboard

Detection Results: VOC2012

Papers

Deep Neural Networks for Object Detection

OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

R-CNN

Rich feature hierarchies for accurate object detection and semantic segmentation

MultiBox

Scalable Object Detection using Deep Neural Networks

Scalable, High-Quality Object Detection

SPP-Net

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

DeepID-Net

DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection

Object Detectors Emerge in Deep Scene CNNs

segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection

NoC

Object Detection Networks on Convolutional Feature Maps

Improving Object Detection with Deep Convolutional Networks via Bayesian Optimization and Structured Prediction

Fast R-CNN

Fast R-CNN

DeepBox

DeepBox: Learning Objectness with Convolutional Networks

MR-CNN

Object detection via a multi-region & semantic segmentation-aware CNN model

Faster R-CNN

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Faster R-CNN in MXNet with distributed implementation and data parallelization

Contextual Priming and Feedback for Faster R-CNN

An Implementation of Faster RCNN with Study for Region Sampling

YOLO

You Only Look Once: Unified, Real-Time Object Detection

darkflow - translate darknet to tensorflow. Load trained weights, retrain/fine-tune them using tensorflow, export constant graph def to C++

Start Training YOLO with Our Own Data

R-CNN minus R

AttentionNet

AttentionNet: Aggregating Weak Directions for Accurate Object Detection

DenseBox

DenseBox: Unifying Landmark Localization with End to End Object Detection

SSD

SSD: Single Shot MultiBox Detector

Inside-Outside Net (ION)

Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks

Adaptive Object Detection Using Adjacency and Zoom Prediction

G-CNN

G-CNN: an Iterative Grid Based Object Detector

Factors in Finetuning Deep Model for object detection

Factors in Finetuning Deep Model for Object Detection with Long-tail Distribution

We don’t need no bounding-boxes: Training object class detectors using only human verification

HyperNet

HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection

MultiPathNet

A MultiPath Network for Object Detection

CRAFT

CRAFT Objects from Images

OHEM

Training Region-based Object Detectors with Online Hard Example Mining

Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection

Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers

R-FCN

R-FCN: Object Detection via Region-based Fully Convolutional Networks

Weakly supervised object detection using pseudo-strong labels

Recycle deep features for better object detection

MS-CNN

A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

Multi-stage Object Detection with Group Recursive Learning

Subcategory-aware Convolutional Neural Networks for Object Proposals and Detection

PVANET

PVANET: Deep but Lightweight Neural Networks for Real-time Object Detection

PVANet: Lightweight Deep Neural Networks for Real-time Object Detection

GBD-Net

Gated Bi-directional CNN for Object Detection

Crafting GBD-Net for Object Detection

StuffNet

StuffNet: Using ‘Stuff’ to Improve Object Detection

Generalized Haar Filter based Deep Networks for Real-Time Object Detection in Traffic Scene

Hierarchical Object Detection with Deep Reinforcement Learning

Learning to detect and localize many objects from few examples

Speed/accuracy trade-offs for modern convolutional object detectors

SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving

Feature Pyramid Network (FPN)

Feature Pyramid Networks for Object Detection

Action-Driven Object Detection with Top-Down Visual Attentions

Beyond Skip Connections: Top-Down Modulation for Object Detection

YOLOv2

YOLO9000: Better, Faster, Stronger

DSSD

DSSD : Deconvolutional Single Shot Detector

Wide-Residual-Inception Networks for Real-time Object Detection

Attentional Network for Visual Object Detection

Detection From Video

Learning Object Class Detectors from Weakly Annotated Video

Analysing domain shift factors between videos and images for object detection

Video Object Recognition

Deep Learning for Saliency Prediction in Natural Video

T-CNN

T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos

Object Detection from Video Tubelets with Convolutional Neural Networks

Object Detection in Videos with Tubelets and Multi-context Cues

Context Matters: Refining Object Detection in Video with Recurrent Neural Networks

CNN Based Object Detection in Large Video Images

Datasets

YouTube-Objects dataset v2.2

ILSVRC2015: Object detection from video (VID)

Object Detection in 3D

Vote3Deep: Fast Object Detection in 3D Point Clouds Using Efficient Convolutional Neural Networks

Object Detection on RGB-D

Learning Rich Features from RGB-D Images for Object Detection and Segmentation

Differential Geometry Boosts Convolutional Neural Networks for Object Detection

Salient Object Detection

This task involves predicting the salient regions of an image given by human eye fixations.

Best Deep Saliency Detection Models (CVPR 2016 & 2015)

http://i.cs.hku.hk/~yzyu/vision.html

Large-scale optimization of hierarchical features for saliency prediction in natural images

Predicting Eye Fixations using Convolutional Neural Networks

Saliency Detection by Multi-Context Deep Learning

DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection

SuperCNN: A Superpixelwise Convolutional Neural Network for Salient Object Detection

Shallow and Deep Convolutional Networks for Saliency Prediction

Recurrent Attentional Networks for Saliency Detection

Two-Stream Convolutional Networks for Dynamic Saliency Prediction

Unconstrained Salient Object Detection

Unconstrained Salient Object Detection via Proposal Subset Optimization

DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection

Salient Object Subitizing

Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection

Saliency Detection via Combining Region-Level and Pixel-Level Predictions with CNNs

Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection

A Deep Multi-Level Network for Saliency Prediction

Visual Saliency Detection Based on Multiscale Deep CNN Features

A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection

Deeply supervised salient object detection with short connections

Weakly Supervised Top-down Salient Object Detection

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

Visual Saliency Prediction Using a Mixture of Deep Neural Networks

A Fast and Compact Salient Score Regression Network Based on Fully Convolutional Network

Saliency Detection in Video

Deep Learning For Video Saliency Detection

Datasets

MSRA10K Salient Object Database

http://mmcheng.net/msra10k/

Specific Object Deteciton

Face Deteciton

Multi-view Face Detection Using Deep Convolutional Neural Networks

From Facial Parts Responses to Face Detection: A Deep Learning Approach

Compact Convolutional Neural Network Cascade for Face Detection

Face Detection with End-to-End Integration of a ConvNet and a 3D Model

CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection

Finding Tiny Faces

Towards a Deep Learning Framework for Unconstrained Face Detection

Supervised Transformer Network for Efficient Face Detection

UnitBox

UnitBox: An Advanced Object Detection Network

Bootstrapping Face Detection with Hard Negative Examples

Grid Loss: Detecting Occluded Faces

A Multi-Scale Cascade Fully Convolutional Network Face Detector

MTCNN

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Neural Networks

Face Detection using Deep Learning: An Improved Faster RCNN Approach

Faceness-Net: Face Detection through Deep Facial Part Responses

Datasets / Benchmarks

FDDB: Face Detection Data Set and Benchmark

WIDER FACE: A Face Detection Benchmark

Facial Point / Landmark Detection

Deep Convolutional Network Cascade for Facial Point Detection

Facial Landmark Detection by Deep Multi-task Learning

A Recurrent Encoder-Decoder Network for Sequential Face Alignment

Detecting facial landmarks in the video based on a hybrid framework

Deep Constrained Local Models for Facial Landmark Detection

Effective face landmark localization via single deep network

People Detection

End-to-end people detection in crowded scenes

Detecting People in Artwork with CNNs

Person Head Detection

Context-aware CNNs for person head detection

Pedestrian Detection

Pedestrian Detection aided by Deep Learning Semantic Tasks

Deep Learning Strong Parts for Pedestrian Detection

Deep convolutional neural networks for pedestrian detection

Scale-aware Fast R-CNN for Pedestrian Detection

New algorithm improves speed and accuracy of pedestrian detection

Pushing the Limits of Deep CNNs for Pedestrian Detection

  • intro: “set a new record on the Caltech pedestrian dataset, lowering the log-average miss rate from 11.7% to 8.9%”
  • arxiv: http://arxiv.org/abs/1603.04525

A Real-Time Deep Learning Pedestrian Detector for Robot Navigation

A Real-Time Pedestrian Detector using Deep Learning for Human-Aware Navigation

Is Faster R-CNN Doing Well for Pedestrian Detection?

Reduced Memory Region Based Deep Convolutional Neural Network Detection

Fused DNN: A deep neural network fusion approach to fast and robust pedestrian detection

Multispectral Deep Neural Networks for Pedestrian Detection

Vehicle Detection

DAVE: A Unified Framework for Fast Vehicle Detection and Annotation

Evolving Boxes for fast Vehicle Detection

Traffic-Sign Detection

Traffic-Sign Detection and Classification in the Wild

Boundary / Edge / Contour Detection

Holistically-Nested Edge Detection

Unsupervised Learning of Edges

Pushing the Boundaries of Boundary Detection using Deep Learning

Convolutional Oriented Boundaries

Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks

Richer Convolutional Features for Edge Detection

Skeleton Detection

Object Skeleton Extraction in Natural Images by Fusing Scale-associated Deep Side Outputs

DeepSkeleton: Learning Multi-task Scale-associated Deep Side Outputs for Object Skeleton Extraction in Natural Images

Fruit Detection

Deep Fruit Detection in Orchards

Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards

Others

Deep Deformation Network for Object Landmark Localization

Fashion Landmark Detection in the Wild

Deep Learning for Fast and Accurate Fashion Item Detection

Visual Relationship Detection with Language Priors

OSMDeepOD - OSM and Deep Learning based Object Detection from Aerial Imagery (formerly known as “OSM-Crosswalk-Detection”)

Selfie Detection by Synergy-Constraint Based Convolutional Neural Network

Associative Embedding:End-to-End Learning for Joint Detection and Grouping

Deep Cuboid Detection: Beyond 2D Bounding Boxes

Automatic Model Based Dataset Generation for Fast and Accurate Crop and Weeds Detection

Deep Learning Logo Detection with Data Expansion by Synthesising Context

Pixel-wise Ear Detection with Convolutional Encoder-Decoder Networks

Object Proposal

DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers

Scale-aware Pixel-wise Object Proposal Networks

Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization

Learning to Segment Object Proposals via Recursive Neural Networks

Localization

Beyond Bounding Boxes: Precise Localization of Objects in Images

Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning

Weakly Supervised Object Localization Using Size Estimates

Active Object Localization with Deep Reinforcement Learning

Localizing objects using referring expressions

LocNet: Improving Localization Accuracy for Object Detection

Learning Deep Features for Discriminative Localization

ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization

Tutorials / Talks

Convolutional Feature Maps: Elements of efficient (and accurate) CNN-based object detection

Towards Good Practices for Recognition & Detection

Projects

TensorBox: a simple framework for training neural networks to detect objects in images

Object detection in torch: Implementation of some object detection frameworks in torch

Using DIGITS to train an Object Detection network

FCN-MultiBox Detector

KittiBox: A car detection model implemented in Tensorflow.

Blogs

Convolutional Neural Networks for Object Detection

http://rnd.azoft.com/convolutional-neural-networks-object-detection/

Introducing automatic object detection to visual search (Pinterest)

Deep Learning for Object Detection with DIGITS

Analyzing The Papers Behind Facebook’s Computer Vision Approach

Easily Create High Quality Object Detectors with Deep Learning

How to Train a Deep-Learned Object Detection Model in the Microsoft Cognitive Toolkit

Object Detection in Satellite Imagery, a Low Overhead Approach

You Only Look Twice — Multi-Scale Object Detection in Satellite Imagery With Convolutional Neural Networks

Faster R-CNN Pedestrian and Car Detection

Small U-Net for vehicle detection

More Repositories

1

Multitarget-tracker

Multiple Object Tracker, Based on Hungarian algorithm + Kalman filter.
C++
2,006
star
2

LogPolarFFTTemplateMatcher

OpenCV Fourier-Mellin algorithm implementation.
C++
151
star
3

TILT-Cpp-port

"TILT: Transform Invariant Low-rank Textures" CPP port.
C++
17
star
4

ellipseDetector

OpenCV ellipse detector.
C++
17
star
5

RotatedRectLib

Small library for working with rotated rectangle shaped image regions.
C++
15
star
6

nano_smpl

C
13
star
7

openvino_pedestrian_tracker

Standalone openvino pedestrian tracking demo project,
C++
11
star
8

nano_bfm

Basel morphable face model mesh and texture generator using GPU.
C
10
star
9

SelfCalibration

SelfCalibration using single image.
C++
9
star
10

PRNet_PyTorch_v2

Fixed version of https://github.com/tomguluson92/PRNet_PyTorch
Python
9
star
11

kaldi_vosk_win_cmake

cmake based kaldi + vosk + microphone speech recognition example
C++
7
star
12

Caffemodel-Parser

Tool for parse caffemodel files, and dump them to binary files.
Protocol Buffer
6
star
13

imgui_canvas

2d 3d plotting application temp;ate
C++
6
star
14

image-size-scanner

Scans image files and extracts image sizes, no image decoding, no dependencies.
C++
6
star
15

imgui_opencv_template

template project for imgui and opencv
C
6
star
16

mixtures_of_trees_FaceDetector_openCV

OpenCV port for mixtures of trees face detector. From site: http://www.ics.uci.edu/~xzhu/face/
C++
5
star
17

imgui_image_viewer

imgui image list
C
5
star
18

SpatialSubspaceRotation

Visual heart rate measure using Spatial Subspace Rotation (2SR) method. Inspired by https://github.com/partofthestars/Spatial-Subspace-Rotation
C++
5
star
19

dashedLineCompiler

Dashed line/polyline compiler, it provides ability to plot dashed lines to any plotting framework which can draw solid lines.
C++
5
star
20

AIML_editor

AIML bot code edidtor/debugger
HTML
4
star
21

SMPLpp

C
4
star
22

aiml_cpp

AIML interpreter C++ implementation with utf-8 support.
C++
3
star
23

TrackerGenerator

Tracker generator tool to configure and generate trackers from a set of components.
C
3
star
24

ImGUI_colorer

ImGui editor with syntax highlighing based on colorer project.
C
3
star
25

SuiteSparse

C
3
star
26

SparseAutoencoder

Разреженный автоэнкодер (visual studio)
MATLAB
3
star
27

optic_cad

Just a simple tandem of Easy3d and goptical libraries.
C++
3
star
28

FontToPNG

Generates PNG images for font glyphs,
C
3
star
29

PiecewiseAffineWarper

Fast OpenCv implementation of piecewise affine warper
C++
2
star
30

gabor_segmentation

C++
2
star
31

TrackerGeneratorProto

Prototype model project for TrackerGenerator project
C++
2
star
32

ImGuiColorTextEdit_rus

ImGuiColorTextEdit with russian support
C++
2
star
33

ScrollingBuffer

Simple scrolling buffer I used for multimodal time series applications.
C++
2
star
34

ImIDE

A chunk of code from my project, to fast start ide like projects.
C
2
star
35

tor_tools

tool for working with tor network (tested on visual studio 2019)
C
2
star
36

FaceRotate

Piecewise affine transform example.
2
star
37

Cmake-Dark-Edition

Cmake with dark theme.
C
2
star
38

slam2d_c_port

Минимальная реализация EKF SLAM 2D
C++
2
star
39

glut_cmake_template

Minimal glut CMake project, all dependencies are fetched from internet.
C++
2
star
40

cvGDI

C++
1
star
41

pytorch.nms

python 3.7, vs2019 version
Cuda
1
star
42

NavarroHumanEyeModel

Optical model of human eye.
C++
1
star
43

CaffeGui

Node Editor for CAFFE.
C++
1
star
44

Dots-RL-AI-gym

C++
1
star
45

GitStarsToMarkDown

Git stars list to single markdown file tool
Python
1
star
46

MATLABtoEigenCheatsheet

MATLAB to Eigen cheatsheet
1
star
47

slogi

Разбивка слов на слоги. (Конечный автомат)
C++
1
star
48

ShowcaseViewNext

Espian's Showcase View adopted to new api (java and kotlin versions included).
Java
1
star
49

NormalsComputerClass

Simaple per-face and per-vertex normals computing class for triangle mesh.
C++
1
star
50

HeadControl

C++
1
star
51

adaptagrams_win_cmake_java

Clone of adaptagrams project, tuned for Visual Studio + swig 4.0.1 for java binding. Build using CMake. Tested for java 17.
C++
1
star
52

copperspice_basic_app

CMake
1
star
53

XYPlot

C++
1
star
54

IIR_Filter_Design

Мои эксперименты с БИХ фильтрами. И библиотекой Iir1.
C++
1
star
55

navec_cpp

Word embedding model, based on navec project.
C
1
star
56

numerals

Функции работы с русскими числительными
C++
1
star
57

ConvNN

GPU accelerated simple convolutional network trainer/classifier.
XML
1
star
58

Optix7TemplateProject

Template Cmake based
C++
1
star
59

evevtBasedStdAppStarter

Starter project for event based multithreaded application. Based on eventpp library, no boost needed. Included spdlog.
C++
1
star