当前位置:网站首页>Development history of convolutional neural network (part)
Development history of convolutional neural network (part)
2022-07-25 09:50:00 【Freak I life】
The development of convolution neural network
LeNet
Two layers of convolution + Two pooling layers ( The sample layer )+ Two fully connected layers

AlexNet
(2012)
Bigger and deeper
Relu、Dropout、 Maximum pool layer 、 Data to enhance


VGG
(2015)

VGG block : Use a lot of 3x3( Generally, no matter how big, the effect will be bad ) Stack the convolution layers of and add a pool layer to make VGG block , Then use these VGG Blocks form a network
NIN
(2014)
NiN The full connection layer has been completely eliminated .( NiN The block starts with an ordinary convolution , Then there are two 1×1 The convolution of layer . these two items. 1×1 The convolution layer acts as a band ReLU Activate the pixel by pixel full connection layer of the function . The convolution window shape of the first layer is usually set by the user . The subsequent convolution window shape is fixed to 1×1.( Nonlinearity is added to each pixel ))
NiN Use one NiN block , The number of output channels is equal to the number of label categories . Put one last Global average pooling layer (global average pooling layer), Generate a logarithmic probability (logits).NiN One advantage of the design is , It significantly reduces the number of parameters required by the model ( It's not easy to over fit , Fewer parameters . Improve generalization ). However , In practice , This design sometimes increases the time of training the model .( Convergence is too slow )
** thought :** Calculate an average value for all pixels of the characteristic map of each output channel , After global average pooling, we get a dimension =
= Number of categories Of Eigenvector , Then enter it directly into softmax layer ;
** effect :** Instead of Fully connected layer , Images of any size are acceptable
1) It can better correspond the category to the characteristic diagram of the last convolution layer ( Each channel corresponds to a category , In this way, each feature graph can be regarded as the category confidence graph corresponding to the category )
2) Reduce the amount of parameters , The global average pooling layer has no parameters , Prevents over fitting in this layer
3) Integrate global spatial information , For input pictures spatial translation More robust
(softmax Written in train In the function (loss function ), Not on the network .【train_ch6 Inside 】)

![[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-qfqJJfKm-1658474784355)(C:\Users\a1823.MMY\AppData\Roaming\Typora\typora-user-images\1647058281538.png)]](/img/f8/83d182f27c0da2991508c578b70297.png)
GoogLeNet
(2014)
Increase the depth and width of the network while reducing parameters .
Simultaneous convolution repolymerization on multiple sizes . It is convolution on multiple scales at the same time , It can extract features of different scales .
![[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-Lo8jg1fi-1658474784355)(C:\Users\a1823.MMY\AppData\Roaming\Typora\typora-user-images\1647139071375.png)]](/img/5b/603096c1db1050f4c7f884b8fa9f58.png)
![[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-1YYEJs2Q-1658474784355)(C:\Users\a1823.MMY\AppData\Roaming\Typora\typora-user-images\1647139057858.png)]](/img/0c/c11b4f1fcae410407722f0c3280d29.png)
Five paragraph , Nine inception block , It is the first network to reach hundreds of times .
Batch normalization
![[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-5PZ82Zmz-1658474784355)(C:\Users\a1823.MMY\AppData\Roaming\Typora\typora-user-images\1647329593838.png)]](/img/19/1bc25b65970d876581c7aa4f79644d.png)
It works on :
Full connection layer and convolution layer output , Before activating the function ;
Full connection layer and convolution layer input .
For the full connectivity layer , Acting on the feature dimension ;
For the convolution layer , Acting on the channel dimension .
Batch normalization fixes the mean and variance in a small batch
, Then learn the appropriate offset and zoom .
It can accelerate the convergence speed , But generally, the accuracy of the model is not changed .
ResNet
(2015)
[ Multiply large numbers by decimals , It could be a decimal ; Large number plus decimal , The result is still a large number ]

When the number of layers is too deep , The training effect of neural network becomes worse .
A network without residuals has information loss . and resnet No at all .


![[ Failed to transfer the external chain picture , The origin station may have anti-theft chain mechanism , It is suggested to save the pictures and upload them directly (img-gZqfcLc4-1658474784356)(C:\Users\a1823.MMY\AppData\Roaming\Typora\typora-user-images\1647690009909.png)]](/img/48/5ff31b63467ddef0cbfda0f7455ede.png)
边栏推荐
- 基于PackNet的演进——丰田研究院(TRI)深度估计文章盘点(上)
- [code source] add brackets to the daily question
- expect+sh实现自动交互
- First knowledge of opencv4.x --- image histogram equalization
- [code source] daily one question non decreasing 01 sequence
- 解决QTCreator使用VS编译中文乱码错误
- CDA Level1知识点总结之多维数据透视分析
- [code source] I have a big head for a problem every day
- 初识Opencv4.X----在图像上绘制形状
- MLOps专栏介绍
猜你喜欢

卷积神经网络发展历程(部分)

CDA LEVELⅠ2021新版模拟题二(附答案)

A picture explains SQL join left and right

*7-2 CCF 2015-09-2 date calculation
![[data mining] Chapter 3 basis of data analysis](/img/ec/5d988371ea19919148ab6ef4d0247d.png)
[data mining] Chapter 3 basis of data analysis

~1 CCF 2022-06-2 treasure hunt! Big adventure!

如何安装pytorch?—— 一种最简单有效的方法!

yolov5实现小数据集的目标检测--kolektor缺陷数据集

解决esp8266无法连接手机和电脑热点的问题

预测2021年:加速实现RPA以外的超自动化成果
随机推荐
Surfaceview flash screen (black problem)
T5论文总结
yolov5实现小数据集的目标检测--kolektor缺陷数据集
基于机智云平台的温湿度和光照强度获取
CDA Level1复盘总结
2022年的个人技术选型梳理
chmod和chown对挂载的分区的文件失效
Binary Cross Entropy真的适合多标签分类吗?
Creation of adjacency table of undirected connected graph output breadth depth traversal
鱼眼图像自监督深度估计原理分析和Omnidet核心代码解读
单目深度估计模型Featdepth实战中的问题和拓展
初识Opencv4.X----图像直方图匹配
解决esp8266无法连接手机和电脑热点的问题
[data mining] Chapter 3 basis of data analysis
Hyperautomation for the enhancement of automation in industries 论文翻译
【数据挖掘】第二章 认识数据
## 使用 Kotlin USE 简化文件读写
The shortest path problem Bellman Ford (single source shortest path) (illustration)
CDA Level1多选题精选
Raspberry sect door ban system based on face recognition