当前位置:网站首页>Classic model - Nin & googlenet
Classic model - Nin & googlenet
2022-06-26 03:28:00 【On the right is my goddess】
NiN
The problem of full connection layer : Contains a large number of parameters . It's easy to over fit .
Usually transport Enter into through Avenue Count × chart image ruler " × transport Out ruler degree Enter the number of channels \times Image size \times Output scale transport Enter into through Avenue Count × chart image ruler " × transport Out ruler degree
NiN The idea is : Don't connect all layers at all ;
One NiN block :
The convolution layer is followed by two 1x1 Convolution of , The stride is 1, No filling , The output shape is the same as the convolution output . It acts as a full connection layer ( Per pixel ).
NiN The architecture of :
- No full connection layer ;
- Use alternately NiN Blocks and strides are 2 The largest pool layer of ( Gradually reduce the height and width and increase the number of channels );
- Finally, the output is obtained by using the global average pooling layer ( The number of input channels is the number of categories );
If we want to get 1000 Class words , Finally, there is 1000 Channels , The confidence of the corresponding class of this channel is obtained by global average pooling .
summary :
- NiN Blocks use convolution layers +2 individual 1x1 Convolution layer , The latter adds nonlinearity to each pixel ;
- Global average pooling replaces VGG and AlexNet The full connection layer of , Few parameters , It's not easy to over fit .

Parameter used Alex That set , But I added some 1x1 Convolution of .
GoogleNet
How to choose the best super parameter ?
Convolution kernel 、 Pooling ways 、 The channel number ?
Inception block : Every convolution has to , Last concatenation( Height width unchanged , Channel number connection ).

You can see , The function of white blocks is to reduce the complexity of the model by changing the number of channels ( That is, the parameter quantity ). The blue block is used to extract information .
The design idea of first decreasing and then increasing is bottleneck The feeling of .
Inception A block is compared to a single 3x3 or 5x5 Compared to convolution , It has fewer parameters and computational complexity .
meanwhile Inception Blocks also increase the diversity of information learned from them .

Stage1 and Stage2 and VGG Agreement .GoogleNet Used a lot of NiN Thought , Use... In large quantities 1x1 Convolution reduces the amount of parameters .

Compared with AlexNet,GoogleNet The convolution kernel of is relatively small , This allows spatial information not to be compressed very quickly , Support information learning when the number of subsequent channels increases .
meanwhile , Spatial information is compressed , I think it is also a helpless move to increase the number of channels , The purpose is to reduce the number of parameters .

The third stage , You can see that the number of channels is still increasing , But every one of them Inception The parameters of the blocks are different . It is worth mentioning that ,3x3 Convolution is always the most allocated , This is because its parameters are not large , The effect of extracting information is also OK .
Inception There are many variants of the block follow-up ,V2 Joined the BN、V3 Modified convolution size 、V4 Residual connection is added .
边栏推荐
猜你喜欢

Google recommends using kotlin flow in MVVM architecture

How Inkscape converts PNG pictures to SVG pictures without distortion

点击事件

Review of the paper: unmixing based soft color segmentation for image manipulation

ArrayList#subList这四个坑,一不小心就中招

Oracle connectivity issues and Solutions

多媒体元素,音频、视频

类图

Cultivate children's creativity under the concept of project steam Education

浅谈虚拟内存与项目开发中的OOM问题
随机推荐
【哈希表】改进,拉链法哈希结构——直接用两个索引查找,不用每次都hash和%一遍
微信小程序开发准备工作
kitti2bag 安装出现的各种错误
浅谈虚拟内存与项目开发中的OOM问题
Add an "open search description" to the site to adapt to the browser's "site search"“
Butterknife unbinder uses flashback in fragment and viewpager
Kotlin quick start
云计算基础-0
Qixia fire department carries out fire safety training on construction site
工作室第3次HarmonyOS培训笔记
Using meta analysis to drive the development of educational robot
【QT】自定义控件-空气质量仪表盘
Golang regexp package use - 06 - other usage (Special Character conversion, find regular Common prefix, Switch gourmand mode, query regular Group NUMBER, query regular Group name, cut with regular, qu
Wealth freedom skills: commercialize yourself
Do you want to add a key to the applet or for sequence?
类图
Plug in installation and shortcut keys of jupyter notebook
MySQL数据库基础
国信金太阳靠谱吗?开证券账户安全吗?
Components and routing