当前位置:网站首页>Development Series III of GaN (lapgan, srgan)
Development Series III of GaN (lapgan, srgan)
2022-07-24 17:33:00 【51CTO】
GAN Development Series III of (LapGAN、SRGAN)
We have already introduced it in the previous article GAN The introduction to generating countermeasure networks and some GAN series , In the following album will continue to introduce some of the more classic GAN.
GAN Introduction to generating countermeasure network
GAN The development of the series one (CGAN、DCGAN、WGAN、WGAN-GP、LSGAN、BEGAN)
GAN The development of series 2 (PGGAN、SinGAN)
One 、 LapGAN
The paper :《Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks》
Address of thesis :https://arxiv.org/abs/1506.05751
1、 The basic idea
LapGAN It's based on GAN and CGAN On the basis of , use Laplacian Pyramid The pyramid of Laplace To generate images from thick to thin , So as to generate high-resolution images . At each level of the pyramid, there are learning residuals with adjacent levels , By constantly stacking CGAN Get the final resolution .CGAN As we mentioned in the previous article, it is in GAN Add conditional constraints on the basis of , To alleviate the original GAN The generator generates samples too freely .
original GAN The formula of is :

CGAN The formula of is :

2、 The pyramid of Laplace
Laplacian pyramid is the result of continuous up sampling of images in scale space , Gaussian pyramid is the result of continuous down sampling of images in scale space . First build Gaussian pyramid , To image I0 For continuous K Next sampling , obtain

Is the first K The Laplace pyramid on level is

The Laplace pyramids on other levels are :

Laplacian pyramid No k The layer is equal to the Gaussian pyramid k Layer minus Gaussian pyramid k+1 Upper sampling of layer .
Use the Laplace pyramid to restore the image :

3、LapGAN principle
With K=3 For example , At this time, the pyramid of Laplace is 4 The layer structure , contain 4 A generator G0、G1、G2、G3, Generate... Separately 4 A resolution image 64x64、32x32、16x16、8x8, The lowest resolution image to train the original GAN, Input only noise , Later, higher resolution image training CGAN, Input the image sampled on the Gaussian pyramid image with noise and the same level .
LapGAN Through a series of CGAN In series , Constantly generate higher resolution .


LAPGAN stay CIFAR10、STL10 and LSUN Experiments were conducted on three data sets , The generated image is as follows :

Two 、 SRGAN
The paper 《Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network》
Address of thesis :https://arxiv.org/pdf/1609.04802.pdf
Code address :https://github.com/OUCMachineLearning/OUCML/blob/master/GAN/srgan_celebA/srgan.py
1、 The basic idea
SRGAN Yes, it will GAN Applied to the field of image super-resolution ,CNN Convolutional neural network has achieved very good results in the traditional super-resolution reconstruction , High peak signal-to-noise ratio can be achieved PSNR, With MSE Is the objective function of minimization .SRGAN Is the first to recover 4 The algorithm framework of down sampling image , The author proposes a perceptual loss function , Including confrontation loss and content loss , The counter loss comes from the discriminator , It is used to distinguish the real image from the generated super-resolution image , Content loss focuses on visual similarity .

Usually, image super-resolution algorithm uses the mean square error between the reconstructed super-resolution image and the real image MSE As an objective function , Optimize MSE So as to improve PSNR, however MSE and PSNR The value of is not a good indicator of the visual effect , The following figure PSNR The vision with the highest value is not good .

2、 Network structure
Usually per pixel MSE Due to excessive smoothing, it is difficult to deal with the super-resolution details of the image , This paper designs a new loss function , Will be per pixel MSE Replace loss with content loss . Perceived loss is expressed as the weighted sum of content loss and adversarial loss ,

Content Loss It is the loss per pixel of the feature map of a certain layer as the content loss ,

Adversarial Loss Against the loss

The network structure proposed by the author is as follows , The generator consists of a residual structure Residual blocks form ,

The author uses sub-pixel Network as a generative network , use VGG As a discriminant network GAN Got very good results , But this uses the difference per pixel as the loss function . after , The author tries to use the perceptual loss function proposed by himself as the optimization goal , although PSNR and SSIM Not high , But the visual effect is better than other networks , Avoid the over smooth characteristics of other methods .



边栏推荐
- Canvas from getting started to persuading friends to give up (graphic version)
- Atcoder beginer 202 e - count descendants (heuristic merge on heavy chain split tree for offline query)
- Logical operation of image pixels
- Is it safe for Mr. qiniu to open a securities account? Can I drive it?
- 微信朋友圈的高性能复杂度分析
- opencv自带颜色操作
- Memory allocation and recycling strategy
- Nearly 30 colleges and universities were named and praised by the Ministry of education!
- JSP custom tag library -- select tag
- [wechat official account H5] authorization
猜你喜欢
![[wechat official account H5] authorization](/img/d1/2712f87e134c0b8b8fdeaab9e30492.png)
[wechat official account H5] authorization

Demonstration experiment of scrollbar for adjusting image brightness

Heuristic merging (including examples of general formula and tree heuristic merging)
![[GNN report] Tencent AI Lab Xu TingYang: graph generation model and its application in molecular generation](/img/5f/c790baf8f8e62fca36fdb4492c38b2.png)
[GNN report] Tencent AI Lab Xu TingYang: graph generation model and its application in molecular generation

Colleges and universities have introduced 23 Philippine doctors with heavy funds, and the relevant departments have launched an investigation!

Array learning navigation

portmap 端口转发

别再到处乱放配置文件了!试试我司使用 7 年的这套解决方案,稳的一秕

Coldplay weekly issue 10

Atcoder Beginner 202 E - Count Descendants(离线查询 重链剖分树上启发式合并)
随机推荐
微信朋友圈的高性能复杂度分析
Is computer monitoring true? Four experiments to find out
ShardingSphere数据库读写分离
List of stringutils and string methods
Bring 120W goods in 15 seconds. You can also shoot such a popular video
awk从入门到入土(17)awk多行写法
2022 Niuke summer multi school K - link with bracket sequence I (linear DP)
一个实际使用SwiftUI 4.0中ViewThatFits自适应视图的例子
2022 ranking list of database audit products - must see!
Open source Invoicing system, 10 minutes to complete, it is recommended to collect!
Rare earth Developer Conference | Apache pulsar committee Liu Dezhi shares the way of cloud native technology transformation
Df2net 3D model deployment
[GNN report] Tencent AI Lab Xu TingYang: graph generation model and its application in molecular generation
数论整除分块讲解 例题:2021陕西省赛C
Iftnews | Christie's launched its venture capital department, aiming at Web3 and metauniverse industries
hcip第三天
ufw 端口转发
Stop littering configuration files everywhere! Try our 7-year-old solution, which is stable
What should we pay attention to in the resume of software testing?
Analog electricity - what is the resistance?