当前位置:网站首页>Context encoders: feature learning by painting paper notes
Context encoders: feature learning by painting paper notes
2022-07-24 05:00:00 【Magic__ Conch】
Computer Vision and Pattern Recognition Nov 2016
Abbreviation of the network CE, The overall architecture of the model is an encoder - decoder .
Model structures,
Network structure
- Encoder Composed of convolution ( from AlexNet Separated from ), Used to extract multiple feature map. 227 × 227 227\times227 227×227 Code to 6 × 6 × 256 6\times6\times256 6×6×256.
- The full connection layer is used in multiple feature map To send messages ( Light use Encoder The convolution layer of cannot connect all positions in a characteristic graph ).
In order to reduce the amount of calculation , This network uses the channel full connection layer . That is, for multiple characteristic graphs , Not all of them are connected by one brain , Instead, each feature graph is connected separately .
In this case , If there is m individual n × n n\times n n×n Characteristic graph , Then the parameter quantity will change from the original m 2 n 4 m^2n^4 m2n4 Reduce to m n 4 mn^4 mn4.
- Decoder It is mainly composed of deconvolution and upper sampling layer ( Coincide with Encoder contrary ), Is used to feature map enlarge To original size .
Loss function
It's made up of two parts . Use only L2 Loss function doesn't work , Because it does not have translation invariance , It makes it easier for the generated pixel values to be biased towards the average . So the author adds here adversarial loss.
- reconstruction loss (L2)
L r e c ( x ) = ∥ M ^ ⊙ ( x − F ( ( 1 − M ^ ) ⊙ x ) ) ∥ 2 2 \mathcal{L}_{r e c}(x)=\|\hat{M} \odot(x-F((1-\hat{M}) \odot x))\|_{2}^{2} Lrec(x)=∥M^⊙(x−F((1−M^)⊙x))∥22
effect : Capture the overall structure of the repair area and its consistency with the surrounding visible area .
M M M yes mask,x It's the input image , ⊙ \odot ⊙ Is the product of elements ,F yes context encoder The result .
- adversarial loss( suffer GAN inspire )
effect : Make the prediction of the repaired area look more realistic .
GAN It is usually composed of generator and discriminator , This article makes the previous Encoder-Decoder The result is a generator , Therefore, it is only necessary to define an additional discriminator .
L a d v = max D E x ∈ X [ log ( D ( x ) ) + log ( 1 − D ( F ( ( 1 − M ^ ) ⊙ x ) ) ) ] \begin{aligned} \mathcal{L}_{a d v}=\max _{D} &\ \mathbb{E}_{x \in \mathcal{X}}[\log (D(x))+\log (1-D(F((1-\hat{M}) \odot x)))] \end{aligned} Ladv=Dmax Ex∈X[log(D(x))+log(1−D(F((1−M^)⊙x)))]
During training , function F And D A combination of SGD( Stochastic gradient descent ) To optimize .
Joint loss function
Is to add the two loss functions after weighting .
L = λ r e c L r e c + λ a d v L a d v . \mathcal{L}=\lambda_{r e c} \mathcal{L}_{r e c}+\lambda_{a d v} \mathcal{L}_{a d v} . L=λrecLrec+λadvLadv.
appendix
About L2 The loss cannot be directly used , original text .
边栏推荐
- MapReduce concept
- [essay] goodbye to Internet Explorer, and the mark of an era will disappear
- LabVIEW主VI冻结挂起
- How to make the words on the screen larger (setting method to make the text more comfortable on the large screen)
- Introduction to MapReduce
- [cornerstone of high concurrency] multithreading, daemon thread, thread safety, thread synchronization, mutual exclusion
- Summary of the development process and key and difficult points of the Listening Project
- Little black leetcode journey: 100 same trees
- Web3 product manager's Guide: how to face the encryption world
- 力。操处于业务低峰期。进口调用会帮您准备时,每个字
猜你喜欢

微信朋友圈的高性能架构设计
![[essay] goodbye to Internet Explorer, and the mark of an era will disappear](/img/1f/1fa596cf89bbade3079271dc1c324f.png)
[essay] goodbye to Internet Explorer, and the mark of an era will disappear

greatest common divisor

-Bash: wget: command not found

链接预测中训练集、验证集以及测试集的划分(以PyG的RandomLinkSplit为例)

Yolov7 -- brief introduction of the paper

Chapter V communication training

mapreduce概念

Quick reference manual for the strongest collation of common regular expressions (glory Collection Edition)

Icml2022 | rock: causal reasoning principle on common sense causality
随机推荐
Kingbase V8R6集群安装部署案例---脚本在线一键扩容
打印1000年到2000年之间的闰年
What if the computer can't take screenshots? The solution to the problem that the shortcut screen capture key of the computer cannot be used
How to play the Microsoft twin tool twinsonot? Introduction to twin test tool twinornot
What is the proper resolution of the computer monitor? Introduction to the best resolution of monitors of various sizes and the selection of different wallpapers
Division of training set, verification set and test set in link prediction (take randomlinksplit of pyg as an example)
Summary of the development process and key and difficult points of the Listening Project
power. The operation is in the low peak period of business. Import call will help you prepare each word
What does the red five pointed star in the lower right corner of sina Weibo avatar mean? How to become a master of sina Weibo?
力。操处于业务低峰期。进口调用会帮您准备时,每个字
Unable to delete the file prompt the solution that the file cannot be deleted because the specified file cannot be found
Little black gnawing leetcode:589. Preorder traversal of n-ary tree
微信朋友圈的高性能架构设计
[network counting experiment report] Cisco LAN Simulation and simple network test
Sort - quicksort
Chapter 0 Introduction to encog
The x-fkgom supporting the GOM engine key.lic is authorized to start
Face algorithms
Sword finger offer special assault edition day 7
节都需能有问题制端口, 第一个下标 。很多机器成