当前位置:网站首页>[paper reading] temporary binding for semi-superior learning
[paper reading] temporary binding for semi-superior learning
2022-07-24 13:08:00 【The next day is expected 1314】
1. Abstract
In this paper , We propose a simple and effective method , For training deep neural networks in a semi supervised environment , Only a small part of the training data is marked . We introduced Self integration , We use the training output of the network in different periods to form a consensus prediction of unknown tags , most important of all , Under different regularization and input enhancement conditions . Compared with the network output in the recent training period , This integrated prediction can be expected to be a better predictor of unknown tags , Therefore, it can be used as the goal of training .
Notice: The article states straight to the point that this is a method in semi supervised deep neural network . The main contribution is to propose for Model disturbance The idea of , Two models are proposed , Π \mathbf{\Pi} Π model, Temporal ensembling.
2. Algorithm description
2.1. Π \mathbf{\Pi} Π model


Through the flow chart and pseudo code in the paper , We can clearly understand the general flow of the algorithm . Some of the small details , It may need to be found when it reappears , The words here , Just record your questions , If you look back later, read carefully to answer .Q1: The depth dependence of the proposed model is expressed in the paper Input Augment and Dropout, stay Π \Pi Π Model Between perturbed model and undisturbed model Input Augment Is it consistent .Q2: Why is there a difference between the parameters of supervised loss and unsupervised loss in pseudo code loss C C C, among C C C Indicates the number of data labels .
2.2. Temporal ensembling


Journal entry : First of all, the description of the paper is very clear , You can clearly understand the general flow of the algorithm only by looking at the pseudo code . The second is with Π \Pi Π model comparison ,Temporal ensembling The unsupervised loss of is based on the previous model epoch The error between the output of and the current output . It is pointed out in the article that ,Temporal ensembling than Π \Pi Π model faster , as a result of Temporal ensembling Every batch Just do a forward operation , and Π \Pi Π model There are two forward operations . In fact, the essence of faster speed here is Space for time , Similar to caching .
TODO: There are some places in the paper trick The author did not explain , We should acquiesce to the knowledge that everyone knows , But I don't know , You can get to know . for instance :
Z ← α Z + ( 1 − α ) z (1) Z \leftarrow \alpha Z + (1-\alpha)z \tag{1} Z←αZ+(1−α)z(1)
z ~ ← Z / ( 1 − α t ) (2) \tilde{z} \leftarrow Z/(1-\alpha^{t}) \tag{2} z~←Z/(1−αt)(2).
边栏推荐
- Step of product switching to domestic chips, stm32f4 switching to gd32
- ESP32ADC
- 3.实现蛇和基本游戏界面
- SSM online examination system including documents
- Finclip's "applet export app" function has been updated again by the company
- About the concept of thread (1)
- Vscode solves the problem of terminal Chinese garbled code
- cookie
- [C language] detailed knowledge of document operation
- 猿人学第七题
猜你喜欢

leetcode第 302 场周赛复盘

Finclip's "applet export app" function has been updated again by the company

2022.07.21

3. Realize snake and basic game interface

Getting started with SQL join use examples to learn left connection, inner connection and self connection

I 用c I 实现 大顶堆

English grammar_ Indefinite pronouns - Overview

SSM在线租房售房平台多城市版本

Implementation of dynamic columns in EAS BOS doc list

Speech processing based on MATLAB
随机推荐
SSM online campus album management platform
Leetcode's 302 weekly rematch
[stm32] internal independent watchdog iwdg
Summary of recent interviews
About thread (4) thread interaction
How to mount NFS shares using autofs
Windivert: capture and modify packages
[datasheet] interpretation of cs5480 data book of metering chip
【C语言】详细的文件操作相关知识
setAttribute、getAttribute、removeAttribute
English grammar_ Indefinite pronouns - Overview
3.实现蛇和基本游戏界面
FinClip 「小程序导出 App 」功能又双叒叕更新了
I realize large top stack with C I
Redis(13)----浅谈Redis的主从复制
C code specification
开山之作造假!Science大曝Nature重磅论文学术不端,恐误导全球16年
Promise
猿人学第六题
MobileViT:挑战MobileNet端侧霸主