当前位置:网站首页>Deep learning final review
Deep learning final review
2022-06-22 03:16:00 【World War II National University of science and technology】




2.2 Gradient descent method of perceptron , Algorithm flow
- Determine initialization parameters w and b.
- Build a perceptron model .
- Using reverse algorithm , Complete the adjustment of weight coefficient .
notes : Initialization parameters can be set arbitrarily , Finally, the convergence will be completed according to the reverse algorithm .


3.2 What are the activation functions , Draw their pictures , And their respective application scenarios and characteristics




3.3 Neural network training process , Write out your ideas


3.4 Loss function , What are the commonly used loss functions , Form of expression ( The formula ), Applicable occasions

3.5 In the process of neural network training , Over fitting ( The model is too complex ( Too many parameters included ), So that the model fits the training set very well , But the prediction of unknown data is very poor ( Poor generalization ability ).)

(1) Increase the amount of training data
① Collect more data ;
② Data augmentation (image augmentation): Make a series of random changes to the existing data , To produce similar but different training samples , So as to expand the scale of the training data set .
(2) Reduce model complexity
① Reduce hidden layer
② Reduce the number of neurons
(3) Add regular items
tf.keras.regularizers.l2
Parameters used :
l: Penalty item , The default is 0.01.
(4) Early termination (Early Stopping) In the process of training , Record the best validation set results to date , As the continuous 10 individual Epoch( Or more times ) When the best results are not exceeded , It can be considered that the accuracy is no longer improved .
(5)Dropout
Training phase : Every step With a certain probability p Stop the neurons .
Prediction stage : All neurons work , But the ownership value needs to be multiplied by (1-p).






边栏推荐
- [summary of leetcode weekly competition] summary of the 298th weekly competition of leetcode (6.19)
- Jz59. print binary tree in zigzag order
- Sword finger offer 57 Next node of binary tree
- Library management system (PHP final report)
- Right and left vertical time axis with serial number
- Classic case of JS operation node (three-level linkage)
- fastdfs5.0.11安装
- 圖數據庫ONgDB Release v-1.0.2
- Conference chat room - development documents
- [Shangshui Shuo series] day two
猜你喜欢

C2-qt serial port debugging assistant 2021.10.21

FastDFS-6.0.6

C++ primer Chapter 2 summary of variables and basic types

Figure database ongdb release v-1.0.2

ACL 2022 | multilingual knowledge map reasoning based on self supervised graph alignment
![[3. binary integer and floating point number]](/img/82/6c3ef250b90d875cddaebc5bd4a4b8.png)
[3. binary integer and floating point number]

uv_ loop_ Init() process

图数据库ONgDB Release v-1.0.2

Unity3d post process volume profile

JS special effects in the construction of animated web pages
随机推荐
C1-qt idea of realizing simple calculator 2021.10.15
Flink CDC MongoDB Connector 的实现原理和使用实践
Project Management-软件开发之项目管理
【leetcode周赛总结】LeetCode第298场周赛总结(6.19)
Unity3d C # generates non repeated random numbers in the interval
tag动态规划-刷题预备知识-1.动态规划五部曲解题法 + lt.509. 斐波那契数/ 剑指Offer 10.I + lt.70. 爬楼梯彻底解惑 + 面试真题扩展
Select for i/0 multiplexing
Sword finger offer 58 Symmetric binary tree
你是一名技术管理者还是项目管理者?
Redis6.0新特性(下)
【NVMe2.0b 9】控制器初始化过程
【NVMe2.0b 8】NVMe 队列仲裁机制
Sword finger offer 28 Symmetric binary tree
银联支付 返回商户 Nignx post请求405
[go language] we should learn the go language in this way ~ a comprehensive learning tutorial on the whole network
Classification of traffic signs
Use the serialize common command
Day17QtQLcdNumber2021-10-22
[crawler notes 2] mouse events, screenshots and common attack methods
Day12QFile2021-09-27