当前位置:网站首页>【CVPR 2021】DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort

【CVPR 2021】DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort

2022-06-26 09:24:00 _ Summer tree

 Insert picture description here

Speed reading summary

DatasetGAN:
A process of automatically generating a large number of high-quality semantic segmentation image data sets , Need the least manpower . Only a few labeled samples are needed to train decoder Generate the rest of the potential space , Thus, an infinite annotation data generator . The resulting dataset can then be used to train any computer vision architecture .

Annotation cost is the bottleneck of data scale .
Our goal is to synthesize large and high-quality label data sets , Only a few examples of labels are needed .
In our work , We show the latest and most advanced image generation models to learn very powerful potential representations , It can be used for complex pixel level tasks .

We introduced DatasetGAN, It can generate a large number of high-quality semantic segmentation image data sets , Need the least manpower .
The key to our approach is to observe , Trained to synthesize images GANs Must acquire rich semantic knowledge , To present diverse and realistic examples of objects .
Our key point is , Training a successful decoder requires only a small number of labeled images , Thus, an infinite annotation data set generator .
Because we only need to mark a few examples , therefore We annotate the image in great detail , And generate data sets with rich objects and partial segmentation .

We are 7 Image segmentation tasks generate data sets , These include 34 Personal face pieces and 32 Pixel level labels for car parts . Our approach is significantly superior to all semi supervised baselines , And it is equivalent to the method of full supervision , Although in some cases you need two orders of magnitude less annotation data .
In our work , We show the animation of the object 3D The reconstruction , There we use our method to generate detailed part tags .

 Insert picture description here
DATASETGAN Composite image annotation pairs , Large high-quality datasets with detailed pixel level labels can be generated . Figure shows this 4 A step .(1,2). utilize StyleGAN, Only a few composite images are annotated . Train an efficient branch to generate labels .(3). Automatically generate a huge synthetic annotation image data set .(4). Train your favorite methods with synthetic datasets , And test it on real images .

 Insert picture description here chart 2:DATASETGAN The overall architecture of . We from StyleGAN Upsampling features are mapped to the highest resolution , Construct pixel level feature vectors for all pixels on the composite image . Then train MLP The set of classifiers , The semantic knowledge in the pixel feature vector is interpreted into its component label .

 Insert picture description here chart 3“: Small human annotated face and car datasets . Most datasets used for semantic segmentation (MS-COCO [33], ADE [56], cityscape[11]) It's too big , The user cannot check every training image . In this picture , We showed all the marked faces (a-c) And cars (d-f) Split training example .a) Shows an example of a segmentation mask and associated tags ,b) Shows the complete set of training images (GAN sample ),c) Shows a partial list of dimensions and the number of instances in the dataset . An interesting fact is , Please note that , There are more tags in a single image than in a dataset .

 Insert picture description here

chart 4: come from DATASETGAN Examples of synthetic images and labels of faces and cars .StyleGAN For backbone 1024 Zhang 1024 Resolution CelebA-HQ (faces) Images and 512 Zhang 384 Resolution LSUN CAR (cars) Image training .DATASETGAN use 16 An annotated example for training . // This is annotated What label is it ?

 Insert picture description here
chart 5: come from DATASETGAN The birds of China 、 cat 、 Examples of composite images and labels for bedrooms .StyleGAN stay NABirds(10241024 A picture )、LSUN CAT(256256 A picture ) and LSUN Bedroom(256256 A picture ) To be trained on .DATASETGAN stay 30 Only annotated bird samples 、30 A cat and 40 Training in a bedroom .

 Insert picture description here
chart 6: The number of training examples is the same as mIOU We compare... On the benchmark ADE-Car- 12 Test set . The red dotted line indicates the full supervision method , It makes use of information from ADE20k Of 2.6k Training examples . // mIOU What is it? ?

 Insert picture description here

Method

The key insight of DATASETGAN is that generativemodels such as GANs that are trained to synthesize highlyrealistic images must acquire semantic knowledge in theirhigh dimensional latent space.

DATASET-GAN aims to utilize these powerful properties of imageGANs. Intuitively, if a human provides a labeling corre-sponding to one latent code, we expect to be able to effec-tively propagate this labeling across the GAN’s latent space.

Specifically, we synthesize a small num-ber of images by utilizing a GAN architecture, StyleGANin our paper, and record their corresponding latent featuremaps.

By sampling latent codeszand passing eachthrough the entire architecture, we have an infinite datasetgenerator!

This video explanation is not bad : https://www.bilibili.com/video/av502581865/

原网站

版权声明
本文为[_ Summer tree]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/02/202202170551501780.html