当前位置:网站首页>Learning rate optimization strategy
Learning rate optimization strategy
2022-07-24 05:57:00 【Didi'cv】
Warmup Strategy
Deep learning , The initial weight of the model is randomly generated in the training start-up stage , choice Warmup The strategy can make the model use less learning at the beginning of training
Practice at a regular rate , After a set number of iterations , After the model tends to be stable , Then change to the preset learning rate , Achieve the effect of preheating the learning rate , It can prevent the model from shaking , Accelerate the convergence speed of the network , Improve the effect .
Used in experiments Warmup In strategy Gradual Warmup, That is, in the preheating stage of learning rate, the learning rate will gradually increase with the increase of iteration times , Until the end of the warm-up phase, the learning rate reaches the preset value , And then do the follow-up training , This can avoid the sudden increase of learning rate and the sharp increase of training error .


Poly Strategy
Learning rate is a super parameter that has a great influence on the weight update of the model . Only when the initial learning rate is set reasonably can the model be optimized , Too small will lead to slow convergence , Too large will lead to instability or convergence failure . The learning rate needs to change with the degree of online training , Its change strategy is very important , There are many strategies in deep learning , Such as Fixed Strategy 、Poly Strategy and sigmoid Strategy . In this paper, the experimental results are as follows SGD The optimization strategy adds Poly Learning rate decay strategy , The current learning rate is 

边栏推荐
- [activiti] process variables
- "Statistical learning methods (2nd Edition)" Li Hang Chapter 14 clustering method mind map notes and after-school exercise answers (detailed steps) K-means hierarchical clustering Chapter 14
- Iotp2pgate two IOT devices point-to-point communication fast implementation scheme
- 西瓜书/南瓜书--第1,2章总结
- Two architectures of data integration: ELT and ETL
- 目标检测带标签数据增强代码
- How to quickly connect CRM system and ERP system to realize the automatic flow of business processes
- Erp+rpa opens up the enterprise information island, and the enterprise benefits are doubled
- Answers and analysis of some after-school exercises in signals and systems (Wujing)
- Machine learning (Zhou Zhihua) Chapter 3 Notes on learning linear models
猜你喜欢

《统计学习方法(第2版)》李航 第14章 聚类方法 思维导图笔记 及 课后习题答案(步骤详细) k-均值 层次聚类 第十四章

在网络中添加SE通道注意力模块

JVM系统学习

HAL_Delay()延时误差约1ms的问题

jestson安装ibus输入法

《统计学习方法(第2版)》李航 第17章 潜在语义分析 LSA LSI 思维导图笔记 及 课后习题答案(步骤详细)第十七章
![[MYCAT] related concepts of MYCAT](/img/44/99d413d47828252267b5242c64960b.png)
[MYCAT] related concepts of MYCAT
![[activiti] activiti introduction](/img/17/bd8f6fd8dd8918a984ca0a3793ec0b.jpg)
[activiti] activiti introduction

STM32 DSP库MDK VC5\VC6编译错误: 256, (const float64_t *)twiddleCoefF64_256, armBitRevIndexTableF64_256,

Jupyter notebook选择conda环境
随机推荐
Positional argument after keyword argument
[FatFs] migrate FatFs manually and transfer SRAM virtual USB flash disk
day-7 jvm完结
DeepSort 总结
《信号与系统》(吴京)部分课后习题答案与解析
[activiti] activiti process engine configuration class
json.dumps()函数解析
Draw contour cv2.findcontours function and parameter interpretation
‘Results do not correspond to current coco set‘
目标检测带标签数据增强代码
Common features of ES6
《统计学习方法(第2版)》李航 第16章 主成分分析 PCA 思维导图笔记 及 课后习题答案(步骤详细)PCA 矩阵奇异值 第十六章
Iotp2pgate two IOT devices point-to-point communication fast implementation scheme
On the concepts of "input channel" and "output channel" in convolutional neural networks
Thymeleaf快速入门学习
IoTP2PGate 两台物联网设备点对点通信快速实现方案
《机器学习》(周志华)第2章 模型选择与评估 笔记 学习心得
[USB host] stm32h7 cubemx porting USB host with FreeRTOS to read USB disk, usbh_ Process_ The OS is stuck. There is a value of 0xa5a5a5
[activiti] gateway
labelme转voc代码中的一个小问题