ResNetsに対する新たな正則化手法ShakeDrop...[1] Gao Huang, Yu Sun, Zhuang Liu, Daniel...

1
72 76 80 84 ResNeXt 164 ResNeXt 29 78 80 82 84 (1,1) (0,0) (1,0) (1,[0,1]) (1,[-1,1]) (0,1) (0,[0,1]) (0,[-1,1]) ([0,1],1) ([0,1],0) ([0,1],[0,1]) ([0,1],[-1,1]) ([-1,1],1) ([-1,1],0) ([-1,1],[0,1]) ([-1,1],[-1,1]) Experiments (image classification) 72 76 80 84 ResNet 110 ResNet 164 WideResNet 28 ResNeXt 29 PyramidNet 110 52 56 60 64 68 ResNet 110 ResNet 164 WideResNet 28 ResNeXt 29 PyramidNet 110 [1] Gao Huang, Yu Sun, Zhuang Liu, Daniel Sedra and Kilian Weinberger, “Deep Networks with Stochastic Depth,” NIPS2016 [2] Xavier Gastaldi, “Shake-Shake Regularization of 3-branch Residual Networks,” ICLR2017 workshop Accuracy (%) Accuracy (%) We propose a new powerful regularization method, ShakeDrop, for improvements of ResNets architectures and achieved state-of-the-art results on image classification datasets, CIFAR-10/100 (as of Mar. 2018) We confirmed ShakeDrop stabilizes learning strongly disturbed by multiplying even a negative factor by regarding StochasticDepth [1] mechanism as a probabilistic switch of two network architectures ShakeDrop (proposed method) For any-Branch ResNets High accuracy 1-branch Shake (intermediate method) For any-Branch ResNets Low accuracy StochasticDepth [1] For any-Branch ResNets Low accuracy 60 64 68 ResNeXt 164 ResNeXt 29 CIFAR-100 Accuracy (%) Accuracy (%) (, ) Best Parameters: −, , ∈ , Parameter Search Comparison with Baseline Methods Architectures Shake-Shake [2] For 3-Branch ResNets High accuracy Accuracy (%) Tiny ImageNet 2-Branch ResNets 3-Branch ResNets Contributions Regularization methods 2-Branch ResNets 3-Branch ResNets ResNet WideResnet PyramidNet ResNeXt StochasticDepth [1] Shake-Shake [2] - - - ShakeDrop (ours) Stabilize Comparison of Regularization methods PyramidNet PyramidDrop 2-Branch ResNets 3-Branch ResNets Vanilla StochasticDepth [1] ShakeDrop (ours) Vanilla StochasticDepth-A [1] StochasticDepth-B [1] Shake-Shake [2] ShakeDrop-A (ours) ShakeDrop-B (ours) Best Parameters: −, , ∈ , 大阪府立大学 大学院工学研究科 山田 良博, 岩村 雅一, 黄瀬 浩一 ResNetsに対する新たな正則化手法ShakeDropの提案

Transcript of ResNetsに対する新たな正則化手法ShakeDrop...[1] Gao Huang, Yu Sun, Zhuang Liu, Daniel...

Page 1: ResNetsに対する新たな正則化手法ShakeDrop...[1] Gao Huang, Yu Sun, Zhuang Liu, Daniel Sedra and Kilian Weinberger, “Deep Networks with Stochastic Depth,” NIPS2016 [2]

72

76

80

84

ResNeXt 164 ResNeXt 29

78 80 82 84

(1,1)

(0,0)

(1,0)

(1,[0,1])

(1,[-1,1])

(0,1)

(0,[0,1])

(0,[-1,1])

([0,1],1)

([0,1],0)

([0,1],[0,1])

([0,1],[-1,1])

([-1,1],1)

([-1,1],0)

([-1,1],[0,1])

([-1,1],[-1,1])

Experiments (image classification)

72

76

80

84

ResNet 110 ResNet 164 WideResNet

28

ResNeXt 29 PyramidNet

110

5256606468

ResNet 110 ResNet 164 WideResNet

28

ResNeXt 29 PyramidNet

110

[1] Gao Huang, Yu Sun, Zhuang Liu, Daniel Sedra and Kilian Weinberger, “Deep Networks with Stochastic Depth,” NIPS2016

[2] Xavier Gastaldi, “Shake-Shake Regularization of 3-branch Residual Networks,” ICLR2017 workshop

Accuracy (%)

Accuracy (%)

We propose a new powerful regularization method, ShakeDrop, for improvements of ResNets architectures

and achieved state-of-the-art results on image classification datasets, CIFAR-10/100 (as of Mar. 2018)

We confirmed ShakeDrop stabilizes learning strongly disturbed by multiplying even a negative factor by

regarding StochasticDepth [1] mechanism as a probabilistic switch of two network architectures

ShakeDrop(proposed method)

• For any-Branch ResNets

• High accuracy

1-branch Shake(intermediate method)

• For any-Branch ResNets

• Low accuracy

StochasticDepth [1]• For any-Branch ResNets

• Low accuracy

60

64

68

ResNeXt 164 ResNeXt 29

CIFAR-100

Accuracy (%)

Accuracy (%)

(𝜶, 𝜷)

Best Parameters:

𝛂 ∈ −𝟏, 𝟏 , 𝜷 ∈ 𝟎, 𝟏

① Parameter Search ② Comparison with Baseline Methods

Architectures

Shake-Shake [2]• For 3-Branch ResNets

• High accuracy

Accuracy (%)

Tiny ImageNet 2-Branch ResNets 3-Branch ResNets

Contributions

Regularization methods 2-Branch ResNets 3-Branch ResNets

ResNet WideResnet PyramidNet ResNeXt

StochasticDepth [1]

Shake-Shake [2] - - -

ShakeDrop (ours)

Stabilize

Comparison of Regularization methods

PyramidNet

PyramidDrop

2-Branch ResNets 3-Branch ResNets

Vanilla

StochasticDepth [1]

ShakeDrop (ours)

Vanilla

StochasticDepth-A [1]

StochasticDepth-B [1]

Shake-Shake [2]

ShakeDrop-A (ours)

ShakeDrop-B (ours)

Best Parameters:

𝛂 ∈ −𝟏, 𝟏 , 𝜷 ∈ 𝟎, 𝟏

大阪府立大学大学院工学研究科山田良博, 岩村雅一, 黄瀬浩一

ResNetsに対する新たな正則化手法ShakeDropの提案