当前位置:网站首页>15+ urban road element segmentation application, this segmentation model is enough
15+ urban road element segmentation application, this segmentation model is enough
2022-06-24 04:09:00 【Paddlepaddle】
This article is already in Flying propeller The official account is issued , Please check the link :
15+ Application of urban road element segmentation , Using this segmentation model is enough !
Image semantic segmentation is a classical and challenging task in computer vision . It aims to provide detailed pixel level image classification , It is equivalent to assigning semantic labels to each pixel . This technology is now widely used in urban security 、 System fields such as road condition judgment , For example, the application of map navigation is to recognize buildings by segmentation 、 wall 、 Road elements such as pavement conditions , So as to capture the key information of pavement more accurately .
In order to let everyone get started more quickly , Baidu visual technology department is based on Flying propeller Image segmentation development kit PaddleSeg It provides a complete set of practical examples of Urban Street View Road element segmentation industry , Provides data preparation from 、 The whole process scheme of model training and optimization , Lower the threshold of industrial landing . In this project, we need to put 19 Split the key objectives , So what is our specific plan idea ?

chart 1 Segmentation example
Click on GET Project links
https://aistudio.baidu.com/aistudio/projectdetail/4038141?contributionType=1
All source code and tutorials have been open source , Welcome to use .
Project difficulties
The goal is complex
The road is complicated : Include straight line , Turn a corner , Traffic lights, intersections, etc ;
The environment is complex : Adapt to the day 、 Night 、 Foggy and rainy days, etc ;
Scene is complicated : Urban Rd 、 rural 、 Expressway and other scenes are quite different .
The sample is unbalanced
There are many categories : Including pavement 、 Sidewalk 、 building 、 wall 、 fence 、 Pole 、 traffic lights 、 traffic sign 、 Vegetation 、 ground 、 sky 、 people 、 Cyclists 、 vehicle 、 truck 、 The bus 、 train 、 The motorcycle 、 Bicycle ;
unbalanced : At most... Will appear in each image 15 Cars and 30 A pedestrian , Sometimes there will be 2 Cars and no pedestrians , And various degrees of occlusion and truncation .
Model selection
The mainstream semantic segmentation schemes include the following series :
FCN(Fully Convolution Network): Total convolution network , As a precedent of using deep learning for image segmentation , Its symbolic significance is greater than its practical significance .
U-Net series : stay UNet Before , The main partition networks are straight - barrel , Only the top-level or later layers of information are used for up sampling reconstruction . and UNet Is a convolution layer directly connected to the input .
DeepLab series :DeepLab In the field of image segmentation is another series , There are already several versions , And before UNet Compared with the series , The main difference is in the processing of the input image and the structure of the network .DeepLab The image pyramid is mainly used 、 Cavity convolution 、SPP Space Pyramid pooling 、 Can separate convolution and other methods to improve the effect of segmentation .
HRNet series :HRNet yes 2019 A new neural network was proposed by Microsoft Research Asia in , Different from the previous convolutional neural network , The network can still maintain high resolution in the deep layer of the network , Therefore, the predicted semantic information is more accurate , It is also more accurate in space .
Transformer series : since Transformer Since it was introduced into computer vision , It gave birth to a large number of related research and applications . In the direction of image segmentation , Emerged like SETR、TransUNet、SegFormer、MaskFormer Based on Transformer Semantic segmentation network model . It breaks the restriction of convolution structure on the access of image global information .
Because the segmentation target is complex , We selected the one with better accuracy HRNet In the series MscaleOCRNet Follow up experiments on the model , it SOTA Of mIoU Reached 87%. Compared with HRNet Network structure , It calculates a relation weight between each pixel and other pixels of the image on the result of segmentation , A superposition with the original feature constitutes OCRNet The Internet , Then based on the OCRNet Carry out layered and multi-scale training to form the final MscaleOCRNet, The multi-scale training and reasoning method is shown in the figure below .

chart 2 MscaleOCRNet programme
Algorithm optimization
In order to further improve the accuracy , Solve the problem of sample imbalance , We provide the following optimization ideas :
Modify the pre training model : take mapillary Pre training changed to Cityscapes Pre training model , Migrate to KITTI-STEP Data set training can effectively improve the segmentation effect ;
Add multi-scale training : from [0.5,1.0] The two scales are increased to [0.5,1.0,2.0] Three scales ;
Modify the input size : Modify the input dimension by 1024x512 Change to original drawing size 1248x384.

Using tools
This project uses PaddleSeg Development complete .PaddleSeg Is based on Flying propeller PaddlePaddle Developed an end-to-end image segmentation development kit , It covers a large number of high-quality segmentation models in different directions such as high precision and lightweight . Through modular design , Provides configurable drivers and API Call two applications , Help developers more easily complete the whole process of image segmentation applications from training to deployment . Provide semantic segmentation 、 Interactive segmentation 、 Panoramic segmentation 、Matting Four image segmentation capabilities .
Model deployment
Use Flying propeller Native inference base Paddle Inference, Used for server-side model deployment , In general, it is divided into three steps :
1. establish PaddlePredictor, Set the exported model path ;
2. Create a for input PaddleTensor, The incoming to PaddlePredictor in ;
3. Get output PaddleTensor, Take out the results .

If you want to know more details , Welcome to our live course , The whole process of teaching is waiting for you .
Wonderful course preview
In order to make the kids more convenient to use the example tutorial , Baidu senior R & D Engineer will On 6 month 23 Japan ( Thursday )20:00 spot Prepare data for in-depth analysis 、 The whole development process from scheme design to model optimization deployment , Hand in hand to teach you code practice .
Focus on Flying propeller Official account to sign up for live broadcast class
Join the technology exchange group
References : chart 2 Quote from “Hierarchical Multi-Scale Attention for Semantic Segmentation”
Focus on 【 Flying propeller PaddlePaddle】 official account
Get more technical content ~
This article is shared in Blog “ Flying propeller PaddlePaddle”(CSDN).
If there is any infringement , Please contact the [email protected] Delete .
Participation of this paper “OSC Source creation plan ”, You are welcome to join us , share .
边栏推荐
- Clang代码覆盖率检测(插桩技术)
- Browser rendering mechanism
- Several good books for learning data
- 抢先报名丨新一代 HTAP 数据库如何在云上重塑?TiDB V6 线上发布会即将揭晓!
- What is a 1U server? What industries can 1U servers be used in?
- How to set the domain name on the server what is the role of the domain name
- "The first share of Chinese member e-commerce" gathered in the anti reptile attack and defense war | talk with industrial security experts
- Real time monitoring of water conservancy by RTU of telemetry terminal
- JVM调优简要思想及简单案例-怎么调优
- The collection method of penetration test, and which methods can be used to find the real IP
猜你喜欢
随机推荐
Tell you about mvcc
开源之夏2022中选结果公示,449名高校生将投入开源项目贡献
flutter系列之:flutter中的offstage
How to modify the channel name registered by the camera in the easygbs national standard platform?
"The first share of Chinese member e-commerce" gathered in the anti reptile attack and defense war | talk with industrial security experts
Methods of creating and modifying shell script files in batch
3. go deep into tidb: perform optimization explanation
Halcon knowledge: contour operator on region (2)
golang clean a slice
SQL注入绕过安全狗思路一
Do you understand TLS protocol?
hprofStringCache
Submit sitemap to Baidu
祝贺钟君成为 CHAOSS Metric Model 工作组的 Maintainer
[code Capriccio - dynamic planning] t392 Judgement subsequence
2021 graphic design trend: aesthetic response to chaos
How to draw the flow chart of C language structure, and how to draw the structure flow chart
Wide & deep model and optimizer understand code practice
How to spell the iframe address of the video channel in easycvr?
Structure size calculation of C language struct




![[numpy] numpy's judgment on Nan value](/img/aa/dc75a86bbb9f5a235b1baf5f3495ff.png)




