Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Zig

Go

HTML

R

PowerShell

Erlang

MATLAB

Scala

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

C#

C

Crystal

Elm

Jupyter Notebook

Lua

MATLAB

Shell

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇳🇿 New Zealand

🇯🇪 Jersey

🇧🇲 Bermuda

🇸🇲 San Marino

🇳🇷 Nauru

🇻🇦 Vatican City

🇪🇷 Eritrea

🇨🇱 Chile

All Countries Compare Countries

HaoMood/blinear-cnn-faster

Stars
124
Rank 288,207 (Top 6 %)
Language
Python
License
GNU General Publi...
Created over 6 years ago
Updated over 6 years ago

HaoMood/blinear-cnn-faster

HaoMood

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

A Faster PyTorch implementation of bilinear CNN for fine-grained image recognition

     Mean field approximation of Bilinear CNN for Fine-grained recognition


DESCRIPTIONS
    After getting the deep descriptors of an image, bilinear pooling computes
    the sum of the outer product of those deep descriptors. Bilinear pooling
    captures all pairwise descriptor interactions, i.e., interactions of
    different part, in a translational invariant manner.

    This project aims at accelerating training at the first step. We extract
    VGG-16 relu5-3 features from ImageNet pre-trained model in advance and save
    them onto disk. At the first step, we train the model directly from the
    extracted relu5-3 features. We avoid feed forwarding convolution layers
    multiple times.


PREREQUIREMENTS
    Python3.6 with Numpy supported
    PyTorch


LAYOUT
    ./data/                 # Datasets
    ./doc/                  # Automatically generated documents
    ./src/                  # Source code


USAGE
    Step 1. Fine-tune the fc layer only.
    # Get relu5-3 features from VGG-16 ImageNet pre-trained model.
    # It gives 75.47% accuracy on CUB.
    $ CUDA_VISIBLE_DEVICES=0 ./src/get_conv.py
    $ CUDA_VISIBLE_DEVICES=0,1,2,3 ./src/train.py --base_lr 1e0 \
          --batch_size 64 --epochs 80 --weight_decay 1e-5 \
          | tee "[fc-] base_lr_1e0-weight_decay_1e-5_.log"

    Step 2. Fine-tune all layers.
    # It gives 84.41% accuracy on CUB.
    $ CUDA_VISIBLE_DEVICES=0,1,2,3 ./src/train.py --base_lr 1e-2 \
          --batch_size 64 --epochs 80 --weight_decay 1e-5 \
          --pretrained "bcnn_fc_epoch_.pth" \
          | tee "[all-] base_lr_1e-2-weight_decay_1e-5.log"


AUTHOR
    Hao Zhang: [email protected]


LICENSE
    CC BY-SA 3.0

bilinear-cnn

PyTorch implementation of bilinear CNN for fine-grained image recognition

File

Reference Files

homepage

Hao Zhang's Homepage

ddt

Deep Descriptor Transforming for image retrieval

crow

Python implementation of CroW for unsupervised image retrieval

cpp-primer

Solution to C++ Primer, 5th edtion

psychology

心理学笔记

fine-grained-baseline

Baseline results of fine-grained visual recognition

pwa

Python implementation of PWA for image retrieval.

Introduction-to-Programming-with-MATLAB

Assignments of Introduction to Programming with MATLAB Course

vgg-16-bn

Train CIFAR-10 using VGG-16-BN, which gives 93.39% test set accuracy.

vbird-linux

第13章学习shell script

ufldl

Assignments of UFLDL

emu

Assignment or Embedded System course

6.189

MIT 6.189 Assignments and Projects

6.s096-Effective-Programming-in-C-and-C-

Solution to assignments of MIT 6.s096 course

cs231n

Assignments of cs231n Course

Jupyter Notebook

pytorch-utilities

Data loader and other utility files for PyTorch

6.042-mathematics-for-computer-science

Code for MIT 6.042 Course