ResNetsに対する新たな正則化手法ShakeDrop...[1] Gao Huang, Yu Sun, Zhuang Liu, Daniel...
Transcript of ResNetsに対する新たな正則化手法ShakeDrop...[1] Gao Huang, Yu Sun, Zhuang Liu, Daniel...
72
76
80
84
ResNeXt 164 ResNeXt 29
78 80 82 84
(1,1)
(0,0)
(1,0)
(1,[0,1])
(1,[-1,1])
(0,1)
(0,[0,1])
(0,[-1,1])
([0,1],1)
([0,1],0)
([0,1],[0,1])
([0,1],[-1,1])
([-1,1],1)
([-1,1],0)
([-1,1],[0,1])
([-1,1],[-1,1])
Experiments (image classification)
72
76
80
84
ResNet 110 ResNet 164 WideResNet
28
ResNeXt 29 PyramidNet
110
5256606468
ResNet 110 ResNet 164 WideResNet
28
ResNeXt 29 PyramidNet
110
[1] Gao Huang, Yu Sun, Zhuang Liu, Daniel Sedra and Kilian Weinberger, “Deep Networks with Stochastic Depth,” NIPS2016
[2] Xavier Gastaldi, “Shake-Shake Regularization of 3-branch Residual Networks,” ICLR2017 workshop
Accuracy (%)
Accuracy (%)
We propose a new powerful regularization method, ShakeDrop, for improvements of ResNets architectures
and achieved state-of-the-art results on image classification datasets, CIFAR-10/100 (as of Mar. 2018)
We confirmed ShakeDrop stabilizes learning strongly disturbed by multiplying even a negative factor by
regarding StochasticDepth [1] mechanism as a probabilistic switch of two network architectures
ShakeDrop(proposed method)
• For any-Branch ResNets
• High accuracy
1-branch Shake(intermediate method)
• For any-Branch ResNets
• Low accuracy
StochasticDepth [1]• For any-Branch ResNets
• Low accuracy
60
64
68
ResNeXt 164 ResNeXt 29
CIFAR-100
Accuracy (%)
Accuracy (%)
(𝜶, 𝜷)
Best Parameters:
𝛂 ∈ −𝟏, 𝟏 , 𝜷 ∈ 𝟎, 𝟏
① Parameter Search ② Comparison with Baseline Methods
Architectures
Shake-Shake [2]• For 3-Branch ResNets
• High accuracy
Accuracy (%)
Tiny ImageNet 2-Branch ResNets 3-Branch ResNets
Contributions
Regularization methods 2-Branch ResNets 3-Branch ResNets
ResNet WideResnet PyramidNet ResNeXt
StochasticDepth [1]
Shake-Shake [2] - - -
ShakeDrop (ours)
Stabilize
Comparison of Regularization methods
PyramidNet
PyramidDrop
2-Branch ResNets 3-Branch ResNets
Vanilla
StochasticDepth [1]
ShakeDrop (ours)
Vanilla
StochasticDepth-A [1]
StochasticDepth-B [1]
Shake-Shake [2]
ShakeDrop-A (ours)
ShakeDrop-B (ours)
Best Parameters:
𝛂 ∈ −𝟏, 𝟏 , 𝜷 ∈ 𝟎, 𝟏
大阪府立大学大学院工学研究科山田良博, 岩村雅一, 黄瀬浩一
ResNetsに対する新たな正則化手法ShakeDropの提案