当前位置:网站首页>Target detection series fast r-cnn
Target detection series fast r-cnn
2022-06-24 09:01:00 【Bald little Su】
Author's brief introduction : Bald Sue , Committed to describing problems in the most popular language
Looking back : Target detection series —— The work of the mountain RCNN The principle,
Near term goals : Have 10000 fans
Support Xiao Su : give the thumbs-up 、 Collection 、 Leaving a message.
List of articles
- Target detection series ——Fast R-CNN The principle,
- Write it at the front
- Candidate area generation
- Full image input network , The candidate frame is projected onto the feature graph to obtain the feature matrix
- The characteristic matrix is ROI pooling Layers are scaled to uniform size , Then flatten the characteristic map to get the prediction result
- Fast R-CNN Complete process
- Loss function
- Summary
- Reference link
Target detection series ——Fast R-CNN The principle,
Write it at the front
In the last article , We introduced RCNN Principle , Detail stamp *** Here's a brief overview RCNN The algorithm steps of :
- Candidate area generation
- Feature extraction by neural network
- SVM Classifier classification
- The regressor corrects the position of the candidate box
Following RCNN After the release of ,RGB The great God published again Fast R-CNN Let's take a look at the expression in the paper Fast R-CNN Pictures of the results , as follows :

About this classic picture , Now you just need another intuitive feeling , In depth analysis will follow . This article is more RCNN With a big hint ,Fast R-CNN The main steps are as follows :
- Candidate area generation
- Full image input network , The candidate frame is projected onto the feature graph to obtain the feature matrix
- The characteristic matrix is ROI pooling Layers are scaled to uniform size , Then flatten the characteristic map to get the prediction result
You can see , Only RCNN and Fast R-CNN From the steps of , There are still some differences between them , The details of these steps will be discussed below , Let's see
Candidate area generation
Candidate region generation and R-CNN There is no difference between , The same is true of SS Algorithm , I won't repeat it here , What is not clear can be referred to Part 1 R-CCN post
Full image input network , The candidate frame is projected onto the feature graph to obtain the feature matrix
Remember when R-CNN What do we input into the network ? No selling here , stay R-CNN The loser is Jing SS The algorithm gets 2000 Candidate box , This obviously requires a huge amount of computation ; And in the Fast R-CNN in , We only need to input the original image into the feature extraction network to get the feature map of the original image . Here you may have such a problem : Since the input network is the original image , How to use the candidate boxes generated in the original image in the first step ? In fact, this part is based on hekaiming's SPP-Net—— A candidate frame in the original image will be mapped to the corresponding position of the resulting feature map after passing through the neural network , This position is computable . For your convenience , I drew the following figure for your reference :

About the above mapping rules , You can refer to this blog :SPP-net
The characteristic matrix is ROI pooling Layers are scaled to uniform size , Then flatten the characteristic map to get the prediction result
stay Fast R-CNN in , We're not like R-CNN Forced scaling of the picture as in , But after we get the mapping on the characteristic graph ( That is, the candidate box ), Compare these candidate boxes ROI pooling Operation to uniformly scale candidate boxes of different sizes to a uniform size ,ROI pooling The operation is shown in the figure below : That is, regardless of the size of the original feature map , We all divide the feature map into 7*7=49 Equal parts , Then use maximum pooling or average pooling for each serving , Down sampling the original feature map into 7*7 Uniform size .
After the feature map becomes a unified size , It can be flattened and sent to the full connection layer , Then connect softmax Layer and the regressor Layer can output .
Fast R-CNN Complete process
Through the above description , Now I believe that if you look at this picture again, it will be more profound . The general process is consistent with the above , There is no narration here

Loss function
The loss function consists of two parts , Part is classified loss , One part is the boundary box regression loss .


Summary
Fast R-CNN That's all for the principle of , I hope it can help you . It will be updated later Faster_RCNN And related code explanation , Come on !!!
Reference link
If the article is helpful to you , It would be
Whew, whew, whew ~~duang~~ A great bai
Those interested in artificial intelligence can scan the code to join C Station artificial intelligence official Community , Welcome to join the exchange study , Progress together !!
边栏推荐
- The list of open source summer winners has been publicized, and the field of basic software has become a hot application this year
- 开源之夏中选名单已公示,基础软件领域成为今年的热门申请
- 【使用 PicGo+腾讯云对象存储COS 作为图床】
- “论解不了数独所以选择做个数独游戏这件事”
- Solution: Nan occurs in loss during model training
- Opencv maximum filtering (not limited to images)
- MySQL | 视图《康师傅MySQL从入门到高级》笔记
- 不能改变虚拟机电源状态报错解决方案
- Data midrange: analysis of full stack technical architecture of data midrange, with industry solutions
- 【LeetCode】415. 字符串相加
猜你喜欢

What is graph neural network? Figure what is the use of neural networks?

Pymysql inserts data into MySQL and reports an error for no reason

【牛客】HJ1 字符串最后一个单词的长度

【NOI模拟赛】给国与时光鸡(构造)

“不平凡的代理初始值设定不受支持”,出现的原因及解决方法

【LeetCode】541. Reverse string II

从华为WeAutomate数字机器人论坛,看政企领域的“政务新智理”

【LeetCode】541. 反转字符串 II

JS to find and update the specified value in the object through the key

4274. suffix expression
随机推荐
[team management] 25 tips for testing team performance management
所说的Get post:请求的区别,你真的知道了吗??????
原生小程序用画布制作海报,等比例缩放,和uniapp差不多就是写法有点不同
Qingcloud based "real estate integration" cloud solution
阿里资深软件测试工程师推荐测试人员必学——安全测试入门介绍
4274. 后缀表达式
Idea another line shortcut
MySQL | store notes of Master Kong MySQL from introduction to advanced
Data middle office: a collection of middle office construction architectures of large domestic factories
什么是图神经网络?图神经网络有什么用?
Spark - the number of leftouterjoin results is inconsistent with that of the left table
Liunx Mysql安装
Huawei Router: IPSec Technology
Squid代理服务器应用
数据中台:数据采集和抽取的技术栈详解
Installation of sophus package in slam14 lecture
打印出来的对象是[object object],解决方法
How to configure environment variables and distinguish environment packaging for multi terminal project of uniapp development
数据中台:中台实践与总结
Mba-day25 best value problem - application problem