当前位置:网站首页>[paper reading] temporary binding for semi-superior learning
[paper reading] temporary binding for semi-superior learning
2022-07-24 13:08:00 【The next day is expected 1314】
1. Abstract
In this paper , We propose a simple and effective method , For training deep neural networks in a semi supervised environment , Only a small part of the training data is marked . We introduced Self integration , We use the training output of the network in different periods to form a consensus prediction of unknown tags , most important of all , Under different regularization and input enhancement conditions . Compared with the network output in the recent training period , This integrated prediction can be expected to be a better predictor of unknown tags , Therefore, it can be used as the goal of training .
Notice: The article states straight to the point that this is a method in semi supervised deep neural network . The main contribution is to propose for Model disturbance The idea of , Two models are proposed , Π \mathbf{\Pi} Π model, Temporal ensembling.
2. Algorithm description
2.1. Π \mathbf{\Pi} Π model


Through the flow chart and pseudo code in the paper , We can clearly understand the general flow of the algorithm . Some of the small details , It may need to be found when it reappears , The words here , Just record your questions , If you look back later, read carefully to answer .Q1: The depth dependence of the proposed model is expressed in the paper Input Augment and Dropout, stay Π \Pi Π Model Between perturbed model and undisturbed model Input Augment Is it consistent .Q2: Why is there a difference between the parameters of supervised loss and unsupervised loss in pseudo code loss C C C, among C C C Indicates the number of data labels .
2.2. Temporal ensembling


Journal entry : First of all, the description of the paper is very clear , You can clearly understand the general flow of the algorithm only by looking at the pseudo code . The second is with Π \Pi Π model comparison ,Temporal ensembling The unsupervised loss of is based on the previous model epoch The error between the output of and the current output . It is pointed out in the article that ,Temporal ensembling than Π \Pi Π model faster , as a result of Temporal ensembling Every batch Just do a forward operation , and Π \Pi Π model There are two forward operations . In fact, the essence of faster speed here is Space for time , Similar to caching .
TODO: There are some places in the paper trick The author did not explain , We should acquiesce to the knowledge that everyone knows , But I don't know , You can get to know . for instance :
Z ← α Z + ( 1 − α ) z (1) Z \leftarrow \alpha Z + (1-\alpha)z \tag{1} Z←αZ+(1−α)z(1)
z ~ ← Z / ( 1 − α t ) (2) \tilde{z} \leftarrow Z/(1-\alpha^{t}) \tag{2} z~←Z/(1−αt)(2).
边栏推荐
- 3.实现蛇和基本游戏界面
- Analysis of ISP one click download principle in stm32
- The basis of point graph in the map of life information and knowledge
- 29. Right view of binary tree
- flinksql 在yarn上面怎么 以 perjob 模式跑啊,我在sqlclient 提交任务之
- setAttribute、getAttribute、removeAttribute
- 中国消费者和产业链都很难离开苹果,iPhone的影响力太大了
- C language course design -- hotel management system
- 27. Longest increasing subsequence
- 26. Reverse linked list II
猜你喜欢
随机推荐
Are there any useful and free redis client tools recommended?
flinksql 在yarn上面怎么 以 perjob 模式跑啊,我在sqlclient 提交任务之
Summary of recent interviews
基于matlab的语音处理
猿人学第六题
Experience on how to improve the anti-interference of TTL (UART) communication
Proxy
About packaging objects
Step of product switching to domestic chips, stm32f4 switching to gd32
2022.07.15 暑假集训 个人排位赛(十)
基于Kubernetes v1.24.0的集群搭建(一)
1.9. touch pad test
[datasheet] interpretation of cs5480 data book of metering chip
SSM online rental and sales platform multi city version
I 用c I 实现 大顶堆
Modification of EAS login interface
The EAS BOS development environment client cannot be started, but the server does show that it is ready
28. Rainwater connection
Raspberry pie self built NAS cloud disk -- automatic data backup
34. Add two numbers






