SVRG-AALR: Stochastic Variance-Reduced Gradient Method with Adaptive Alternating Learning Rate for Training Deep Neural Networks
Fuzhou University SVRG-AALR: Stochastic Variance-Reduced Gradient Method with Adaptive Alternating Learning Rate for Training Deep Neural Networks SVRG-AALR Electronics 2025 7 Shiyun Zou , Hua Qin , Guolin Yang Pengfei Wang 广 西 2025-9-1
1.摘要
1. SVRG DNN DNN SVRG DNN SVRG DNN 使 DNN SVRG DNN LeNet VGG11 ResNet34 DenseNet121 DNN Lion 沿
2.引言
2. 广 DNN DNN 使 DNN DNN SGD Adam DNN DNN DNN
2.引言
2. SAG SVRG SAG SVRG SVRG SAG 使 SVRG LR SVM 使 SVRG SVRG DNN SGD SVRG DNN SVRG
2.引言
2. AALR 使 SVRG DNN SVRG- AALR i SVRG DNN Barzilai-Borwein 使 DNN ii SVRG DNN iii LeNet VGG11 ResNet34 DenseNet121 SVRG-AALR SVRG-AALR DNN AdamW AdaBound Lion
3.相关工作
3. 1. SVRG 1 w f(w) f(w) 2 μ 6 it wt T wT
3.相关工作
3. 1. SVRG 1 7 it it J={1,2,...,N} b Jb={j1,j2,...,jb} Jb J (1) υt (1) b b υt —— υt (1) 1 8
3.相关工作
3. 1. SVRG 1 8 α α α SVRG 使 DNN Barzilai-Borwein SVRG Barzilai-Borwein DNN
3.相关工作
3. 2. DNN SVRG-AALR SVRG μ AALR α_k N M epoch T φ , α , α γ∈(0,1] b DNN w_out
3.相关工作
3. 2 i α α 使 (0,1] α =0.01 α_k 7 ii 2 SVRG DNN epoch iii 3 SVRG μ iv 5-7 AALR α_k α_k α_k g_{k-1,T} w_{k-1,T} T α_k α_k=α_k/T α_k T
3.相关工作
3. viii 14 g_{k,T} 14 υ_t 线 g_{k,t} g_{k,t} υ_t 线 υ_t γ (0,1] 4/T T≥4 g_{k,T} ix 16-17 w_{k,T} 使 μ α_k
4. 实验结果
4. 4.1 DNN CIFAR10 CIFAR100 CINIC10 CIFAR10 6 32 × 32 10 5 5000 1 1000 CIFAR100 60 100 5 500 1 100 CIFAR10 CIFAR100 DNN CINIC10 CIFAR10 ImageNet[35,36] 27 32 × 32 10 9 9000 4.2 DNN SVRG-AALR LeNet VGG11 ResNet34 DenseNet121 DNN
4. 实验结果
4. LeNet DNN 使 CIFAR10 VGG11 DNN ResNet DNN —— ResNet ResNet34 VGG ResNet DenseNet121
4. 实验结果
4. 4.3 DNN SVRG-AALR DNN SGD Adam AdamW 0.001 SGD 0.9 4.4 Acc —— Prec Recall F1 F1 DNN q′ q q q TP_q TN_q FP_q FN_q
4. 实验结果
4. 4.5 LeNet a CIFAR10 线 b CIFAR10 线
4. 实验结果
4. i a SVRG-AALR 40 epoch 0.8 Lion AdamW AdaBelief 100 epoch ii b SVRG-AALR 线 40 Lion AdamW AdaBelief 100 SVRG-AALR SVRG-AALR LeNet SVRG-AALR 线 齿 SVRG-AALR
5.结论
5. SVRG 使 DNN DNN SVRG SVRG —— SVRG DNN Barzilai-Borwein i SVRG-AALR DNN DNN ii SVRG-AALR AdamW AdaBound Lion iii SVRG- AALR
谢谢!
Fuzhou University