Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Zig

Go

HTML

R

PowerShell

Erlang

MATLAB

Scala

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

C#

C

Crystal

Elm

Jupyter Notebook

Lua

MATLAB

Shell

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇳🇿 New Zealand

🇯🇪 Jersey

🇧🇲 Bermuda

🇸🇲 San Marino

🇳🇷 Nauru

🇻🇦 Vatican City

🇪🇷 Eritrea

🇨🇱 Chile

All Countries Compare Countries

HaoMood/bilinear-cnn

Stars
390
Rank 110,242 (Top 3 %)
Language
Python
License
GNU General Publi...
Created almost 7 years ago
Updated over 5 years ago

HaoMood/bilinear-cnn

HaoMood

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

PyTorch implementation of bilinear CNN for fine-grained image recognition

               Bilinear CNN (B-CNN) for Fine-grained recognition


DESCRIPTIONS
    After getting the deep descriptors of an image, bilinear pooling computes
    the sum of the outer product of those deep descriptors. Bilinear pooling
    captures all pairwise descriptor interactions, i.e., interactions of
    different part, in a translational invariant manner.

    B-CNN provides richer representations than linear models, and B-CNN achieves
    better performance than part-based fine-grained models with no need for
    further part annotation.
    
    Please note that this repo is relative old, which is writen in PyTorch 0.3.0.
    If you are using newer version of PyTorch (say, >=0.4.0), it is suggested to
    consider using this repo https://github.com/HaoMood/blinear-cnn-faster instead.


REFERENCE
    T.-Y. Lin, A. RoyChowdhury, and S. Maji. Bilinear CNN models for
    fine-grained visual recognition. In Proceedings of the IEEE International
    Conference on Computer Vision, pages 1449--1457, 2015.


PREREQUIREMENTS
    Python3.6 with Numpy supported
    PyTorch


LAYOUT
    ./data/                 # Datasets
    ./doc/                  # Automatically generated documents
    ./src/                  # Source code


USAGE
    Step 1. Fine-tune the fc layer only. It gives 76.77% test set accuracy.
    $ CUDA_VISIBLE_DEVICES=0,1,2,3 ./src/bilinear_cnn_fc.py --base_lr 1.0 \
          --batch_size 64 --epochs 55 --weight_decay 1e-8 \
          | tee "[fc-] base_lr_1.0-weight_decay_1e-8-epoch_.log"

    Step 2. Fine-tune all layers. It gives 84.17% test set accuracy.
    $ CUDA_VISIBLE_DEVICES=0,1,2,3 ./src/bilinear_cnn_all.py --base_lr 1e-2 \
          --batch_size 64 --epochs 25 --weight_decay 1e-5 \
          --model "model.pth" \
          | tee "[all-] base_lr_1e-2-weight_decay_1e-5-epoch_.log"


AUTHOR
    Hao Zhang: [email protected]


LICENSE
    CC BY-SA 3.0

blinear-cnn-faster

A Faster PyTorch implementation of bilinear CNN for fine-grained image recognition

File

Reference Files

homepage

Hao Zhang's Homepage

ddt

Deep Descriptor Transforming for image retrieval

crow

Python implementation of CroW for unsupervised image retrieval

cpp-primer

Solution to C++ Primer, 5th edtion

psychology

心理学笔记

fine-grained-baseline

Baseline results of fine-grained visual recognition

pwa

Python implementation of PWA for image retrieval.

Introduction-to-Programming-with-MATLAB

Assignments of Introduction to Programming with MATLAB Course

vgg-16-bn

Train CIFAR-10 using VGG-16-BN, which gives 93.39% test set accuracy.

vbird-linux

第13章学习shell script

ufldl

Assignments of UFLDL

emu

Assignment or Embedded System course

6.189

MIT 6.189 Assignments and Projects

6.s096-Effective-Programming-in-C-and-C-

Solution to assignments of MIT 6.s096 course

cs231n

Assignments of cs231n Course

Jupyter Notebook

pytorch-utilities

Data loader and other utility files for PyTorch

6.042-mathematics-for-computer-science

Code for MIT 6.042 Course