当前位置:网站首页>Effect evaluation of generative countermeasure network
Effect evaluation of generative countermeasure network
2022-07-24 22:46:00 【Ashes little fish man Timo】
Scene description
GAN It is one of the most popular image generation models nowadays ; We can see in many papers that different GAN Generate clear and realistic images . However , If only the naked eye is used to subjectively evaluate the image quality , Obviously, the performance of a model cannot be scientifically evaluated , We need to use appropriate methods to measure quantitatively GAN Generation capacity of , Accurately describe the quality and diversity of the generated samples , Measure the difference between the generated distribution and the real distribution . At present, we often pass IS (Inception Score)、FID ( Frechet Inception Distance ) Evaluate the quality and diversity of the generated samples .
IS and FID Principle
Is It is often used to evaluate the quality of the generated image , In its name Inception originate InceptionNet, Because calculation IS You need a stay ImageNet Pre trained on the dataset Inception-v3 Classification of network .IS Actually, I'm making a KL Divergence calculation , The specific formula is

among ,p(y|x) It refers to a given generated image x, Input it into the pre trained Inception-v3 The category probability output after classifying the network ; p(y) Is marginal distribution , For all generated images , This pre trained classification network outputs the expectation of the probability of the category . If the generated image contains meaningful and clearly identifiable targets , Then the classification network should judge the image as a specific category with high confidence , therefore p(y|x) It should have a small entropy . Besides , To generate images with diversity ,p(y) It should have a large entropy . If p(y) The entropy of is larger ,p(y|x) Small entropy , That is, the generated image contains a lot of categories , And the category of each image is clear and has high confidence , here p(y|x) And p(y) Of KL Great divergence . It can be seen that ,IS The real sample is not compared with the generated sample , It only quantifies the quality and diversity of the generated samples .
FID To make up for it IS Deficiency , The comparison between real samples and generated samples is added . It also inputs the generated samples into the classification network , The difference is ,FID Not the output probability of the last layer of the network p(y|x) To operate , Instead, it operates on the response of the penultimate layer of the network, that is, the characteristic graph . say concretely ,FID It is calculated by comparing the mean and variance of the characteristic graph of the real sample and the generated sample :

among ,
and
Represent the mean and covariance matrix of real samples respectively ,
and
And respectively represent the mean and covariance matrices of the generated samples ,Tr(.) Trace representing matrix .FID The lower the value , It shows that the closer the statistics of the generated sample and the real sample are . However ,FID The characteristic graph is approximated as Gaussian distribution , The way of calculating mean and variance is too intertwined , Unable to evaluate image details .
summary
IS and FID Is currently the GAN The two most widely used evaluation methods in the field of image .IS And FID Realized with GAN Quantitative evaluation of generative capacity , But they are all descriptions of the overall performance , There is no way from diversity 、 Independent measurement of a single generated sample from the perspective of quality . in addition , They all depend on using ImageNet Pre trained classification network , For other types of data sets ( Such as facial images or medical imaging data ) Not very suitable. .
except IS and FID, There are other assessments GAN Methods of generating capabilities , Such as mode score (Mode Score )、 Maximum mean difference 、 Nearest neighbor two sample test ( C2ST) 、 section W- distance ( Sliced Wassecrtein Distance, SWD).
source : In depth learning
边栏推荐
- 图结构的实现,从点到边再到图
- 买收益百分之6的理财产品,需要开户吗?
- JUC concurrent programming - Advanced 05 - lock free of shared model (CAS | atomic integer | atomic reference | atomic array | field updater | atomic accumulator | unsafe class)
- "Fundamentals of program design" Chapter 10 function and program structure 7-2 Hanoi Tower problem (20 points)
- 新手哪个证券开户最好 开户最安全
- Multi task face attribute analysis based on deep learning (based on paddlepaddle)
- CA证书制作实战
- GB、MB、KB分别是什么意思。大小分别是多少
- Filter list
- Let‘s Encrypt
猜你喜欢

老杜Servlet-JSP

图结构的实现,从点到边再到图

暴力递归——N皇后详解 && 如何用位运算进行优化

Network Security Learning (III) basic DOS commands

从暴力递归到动态规划,记忆化搜索

JUC concurrent programming - Advanced 05 - lock free of shared model (CAS | atomic integer | atomic reference | atomic array | field updater | atomic accumulator | unsafe class)

Some analysis of slow MySQL query

First engineering practice, or first engineering thought—— An undergraduate's perception from learning oi to learning development

Implementation of graph structure, from point to edge and then to graph

What is a video content recommendation engine?
随机推荐
QT6 with vs Code: compiling source code and basic configuration
DDos攻击分类
IndexTree2D
[which is better to use, apopost or apifox? Just read this!]
EL & JSTL: JSTL summary
激光雷达障碍物检测与追踪实战——cuda版欧式聚类
新手哪个证券开户最好 开户最安全
PCL point cloud processing ply file reading and saving (54)
单调栈结构练习——子数组最小值的累加和
On the open and closed principle
The size of STM32 stack
Network Security Learning (III) basic DOS commands
聊聊 Redis 是如何进行请求处理
Implementation of graph structure, from point to edge and then to graph
Ranking of engineering project management software
Solve the problem that JSP cannot use session.getattribute()
IndexTree
[cloud native kubernetes] kubernetes cluster advanced resource object staterulesets
Monotonic stack structure exercise -- cumulative sum of minimum values of subarrays
The accuracy of float