当前位置:网站首页>Doctoral Dissertation of the University of Toronto - training efficiency and robustness in deep learning
Doctoral Dissertation of the University of Toronto - training efficiency and robustness in deep learning
2022-06-27 19:49:00 【Zhiyuan community】
Thesis link :https://arxiv.org/abs/2112.01423
The training efficiency of degree learning model is low ; They learn by processing millions of training data many times , And it needs powerful computing resources to process a large amount of data in parallel at the same time , Not sequential processing . The deep learning model also has unexpected failure modes ; They may be fooled , Make a wrong prediction .
In this paper , We study the methods to improve the training efficiency and robustness of the deep learning model . In the context of learning visual semantic embedding , We find that learning more information training data first can improve the convergence speed and the generalization performance of test data . We formalize a simple technique , It is called hard negative mining , Modification of the learning objective function , No computational overhead . Next , We seek to improve the optimization speed in the general optimization method of deep learning . We show that the redundant perceptual modification of training data sampling improves the training speed , An effective method for detecting the diversity of training signals is developed , Gradient clustering . Last , We study the antagonism robustness in deep learning , And the method of realizing the maximum antagonism robustness without using additional data training . For the linear model , We prove that the maximum robustness is guaranteed only by properly selecting the optimizer , Regularization , Or architecture .
边栏推荐
- 基于STM32F103ZET6库函数按键输入实验
- 经纬度分析
- 深度学习和神经网络的介绍
- Current market situation and development prospect forecast of global 3,3 ', 4,4' - biphenyltetracarboxylic dianhydride industry in 2022
- Minmei new energy rushes to Shenzhen Stock Exchange: the annual accounts receivable exceeds 600million and the proposed fund-raising is 450million
- Gartner聚焦中国低代码发展 UniPro如何践行“差异化”
- 基础数据类型和复杂数据类型
- A simple calculation method of vanishing point
- “我让这个世界更酷”2022华清远见研发产品发布会圆满成功
- 【云驻共创】 什么是信息化?什么是数字化?这两者有什么联系和区别?
猜你喜欢
Blink SQL内置函数大全
New Zhongda chongci scientific and Technological Innovation Board: annual revenue of 284million and proposed fund-raising of 557million
Online text batch inversion by line tool
一种朴素的消失点计算方法
【登录界面】
爬取国家法律法规数据库
OpenSSL client programming: SSL session failure caused by an obscure function
Comprehensively analyze the zero knowledge proof: resolve the expansion problem and redefine "privacy security"
DFS and BFS simple principle
C# 二维码生成、识别,去除白边、任意颜色
随机推荐
网络传输是怎么工作的 -- 详解 OSI 模型
Blink SQL built in functions
Tupu digital twin intelligent energy integrated management and control platform
Seven phases of CMS implementation
1029 Median
[cloud based co creation] the "solution" of Digital Travel construction in Colleges and Universities
ABAP随笔-EXCEL-3-批量导入(突破标准函数的9999行)
Informatics Orsay all in one 1335: [example 2-4] connected block
【登录界面】
数仓的字符截取三胞胎:substrb、substr、substring
Market status and development prospect of resorcinol derivatives for skin products in the world in 2022
《第五项修炼》(The Fifth Discipline):学习型组织的艺术与实践
binder hwbinder vndbinder
【云驻共创】 什么是信息化?什么是数字化?这两者有什么联系和区别?
1030 Travel Plan
惊呆!原来 markdown 的画图功能如此强大!
循环遍历及函数基础知识
Running lantern experiment based on stm32f103zet6 library function
拥抱云原生:江苏移动订单中心实践
Bit.Store:熊市漫漫,稳定Staking产品或成主旋律