unsky/mixup

Stars
98
Rank 345,882 (Top 7 %)
Language
Python
Created about 7 years ago
Updated almost 7 years ago

unsky/mixup

unsky

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

mixup: Beyond Empirical Risk Minimization

MixUp

This is an implement and Improvement on mixup: Beyond Empirical Risk Minimization https://arxiv.org/abs/1710.09412

The improvement

add backward
add mix rate

Two scenes:

The detail design of MixUp layer:

The results:

The symbol of resnet50 is writen by mxnet https://github.com/apache/incubator-mxnet/tree/master/example/image-classification/symbols, there have many versions. And i havenot do any optimizion for it. All the results are based on this baseline.

cifar10	alpha	mix_rate	test Acc	initial learning rate	batch size
(ERM)resnet50 90epoch	-	-	0.87900390625	0.05	256
(ERM)resnet50 200epoch	-	-	0.89365234375	0.05	256
(ERM)resnet50 300epoch	-	-	0.8931640625	0.05	256
(mixup)resnet50 90epoch	0.2	0.7	0.8609375	0.7	256
(mixup)resnet50 200epoch	0.2	0.7	0.91611328125	0.7	256
(mixup)resnet50 300epoch	0.2	0.7	0.9224609375	0.7	256
mixup in feature maps（resnet50 head conv）90epoch	0.2	0.7	0.8544921875	0.7	256
mixup in feature maps（resnet50 head conv）200epoch	0.2	0.7	0.91796875	0.7	256
mixup in feature maps（resnet50 head conv）300epoch	0.2	0.7	0.91845703125	0.7	256

MixUp

Mixup in feature map （resnet50 head conv）

ERM

Usage

install mxnet0.12 The mixup is in:symbols/mixup.py you can use it in your codes like:

data ,label = mx.sym.Custom(data= data,label = label,alpha = 0.2,num_classes = num_classes,batch_size = batch_size,mix_rate =0.7,op_type = 'MixUp')

label is the vector like [4,8,...9]

download the dataset

http://data.mxnet.io/data/cifar10/cifar10_val.rec

http://data.mxnet.io/data/cifar10/cifar10_train.rec

train & test:

./train.sh
./test.sh

Reference

Zhang H, Cisse M, Dauphin Y N, et al. mixup: Beyond Empirical Risk Minimization[J]. arXiv preprint arXiv:1710.09412, 2017.

FPN

Feature Pyramid Networks for Object Detection

Jupyter Notebook

focal-loss

Focal loss for Dense Object Detection

RetinaNet

Focal loss for Dense Object Detection

Deformable-ConvNets-caffe

Deformable Convolutional Networks on caffe

Jupyter Notebook

FPN-mxnet

Feature Pyramid Networks for Object Detection on mxnet

cortfly

cortfly一款轻量级局域网软件

ohem-end2end

online hard examples mining support for Faster R-CNN end to end.

Jupyter Notebook

sita

Sita: an synergic islets tracking architecture

LegoNet

CNN框架设计（from zero to one）

siameseNet

siamese net on caffe using own data

Bunny-Shooter

Bunny Shooter developed by unity 3d,

NexT-Rotate

一款轻量级静态博客生成程序-基于hexo-NexT二次开发

sift

sift特征点匹配

draw-style

neuro style for windows on matconvnet

unsky.github.io

My blog http://unsky.github.io http://deepdim.com/

esn-rmlp

unsky_tools

seaweed

海藻数据分析挖掘-BIT ML homework

leetcode

my code learning on leetcode.com