当前位置:网站首页>[paper reading] temporary binding for semi-superior learning
[paper reading] temporary binding for semi-superior learning
2022-07-24 13:08:00 【The next day is expected 1314】
1. Abstract
In this paper , We propose a simple and effective method , For training deep neural networks in a semi supervised environment , Only a small part of the training data is marked . We introduced Self integration , We use the training output of the network in different periods to form a consensus prediction of unknown tags , most important of all , Under different regularization and input enhancement conditions . Compared with the network output in the recent training period , This integrated prediction can be expected to be a better predictor of unknown tags , Therefore, it can be used as the goal of training .
Notice: The article states straight to the point that this is a method in semi supervised deep neural network . The main contribution is to propose for Model disturbance The idea of , Two models are proposed , Π \mathbf{\Pi} Π model, Temporal ensembling.
2. Algorithm description
2.1. Π \mathbf{\Pi} Π model


Through the flow chart and pseudo code in the paper , We can clearly understand the general flow of the algorithm . Some of the small details , It may need to be found when it reappears , The words here , Just record your questions , If you look back later, read carefully to answer .Q1: The depth dependence of the proposed model is expressed in the paper Input Augment and Dropout, stay Π \Pi Π Model Between perturbed model and undisturbed model Input Augment Is it consistent .Q2: Why is there a difference between the parameters of supervised loss and unsupervised loss in pseudo code loss C C C, among C C C Indicates the number of data labels .
2.2. Temporal ensembling


Journal entry : First of all, the description of the paper is very clear , You can clearly understand the general flow of the algorithm only by looking at the pseudo code . The second is with Π \Pi Π model comparison ,Temporal ensembling The unsupervised loss of is based on the previous model epoch The error between the output of and the current output . It is pointed out in the article that ,Temporal ensembling than Π \Pi Π model faster , as a result of Temporal ensembling Every batch Just do a forward operation , and Π \Pi Π model There are two forward operations . In fact, the essence of faster speed here is Space for time , Similar to caching .
TODO: There are some places in the paper trick The author did not explain , We should acquiesce to the knowledge that everyone knows , But I don't know , You can get to know . for instance :
Z ← α Z + ( 1 − α ) z (1) Z \leftarrow \alpha Z + (1-\alpha)z \tag{1} Z←αZ+(1−α)z(1)
z ~ ← Z / ( 1 − α t ) (2) \tilde{z} \leftarrow Z/(1-\alpha^{t}) \tag{2} z~←Z/(1−αt)(2).
边栏推荐
- Redis(13)----浅谈Redis的主从复制
- sql的where+or的用法丢失条件
- EfficientFormer:轻量化ViT Backbone
- Why does 2.tostring() report an error
- [datasheet] PHY ksz9031 gigabit network chip interpretation
- ESP32ADC
- Prototype inheritance
- 34. Add two numbers
- The core capability of accelerating enterprise data application innovation flexibility
- Take chef and ansible as examples to get started with server configuration
猜你喜欢

SSM online campus album management platform

Implement is by yourself_ default_ constructible

About the concept of thread (1)

sql的where+or的用法丢失条件
EfficientFormer:轻量化ViT Backbone

Usage of swipemenurecyclerview

【C语言】详细的文件操作相关知识

SQL JOIN 入门使用示例学习左连接、内连接、自连接

Wang Ping, co-founder of Denglin Technology: Innovation + self research "dual core" drive, gpu+ enabling AI takes root | quantum bit · viewpoint sharing review

27. Longest increasing subsequence
随机推荐
Writing browser plug-ins
Atcoder beginer contest 261e / / bitwise thinking + DP
26. Reverse linked list II
25. Middle order traversal of binary tree
flinksql 在yarn上面怎么 以 perjob 模式跑啊,我在sqlclient 提交任务之
27. Longest increasing subsequence
基于boost库的搜索引擎
Mobilevit: challenge the end-to-side overlord of mobilenet
SSM在线租房售房平台多城市版本
Use of PageHelper
About packaging objects
AtCoder Beginner Contest 261E // 按位思考 + dp
如何使用autofs挂载NFS共享
Roller_ Block default behavior_ Zero roll event compatible
28. Rainwater connection
Redis (13) -- on master-slave replication of redis
cookie
34. Add two numbers
Implementation of dynamic columns in EAS BOS doc list
Square root of 33.x