当前位置:网站首页>Google | ICML 2022: sparse training status in deep reinforcement learning
Google | ICML 2022: sparse training status in deep reinforcement learning
2022-06-22 19:53:00 【Zhiyuan community】
【 title 】The State of Sparse Training in Deep Reinforcement Learning
【 The author team 】Laura Graesser, Utku Evci, Erich Elsen, Pablo Samuel Castro
【 Date of publication 】2022.6.17
【 Thesis link 】https://arxiv.org/pdf/2206.10369.pdf
【 Recommended reasons 】 In recent years , The use of sparse neural networks in various fields of deep learning is growing rapidly , Especially in the field of computer vision . The attraction of sparse neural networks is mainly due to the reduction of the number of parameters required for training and storage , And the improvement of learning efficiency . It's kind of surprising , Few people try to explore them in deep reinforcement learning (DRL) Application in . In this work , The author's team has systematically investigated the application of some existing sparse training techniques in various deep reinforcement learning agents and environments . The final results of the investigation confirm the results of sparse training in the field of computer vision —— In the field of deep reinforcement learning , For the same parameter count , The performance of sparse network is better than that of dense network . The author's team analyzed in detail how various components of deep reinforcement learning are affected by the use of sparse networks , And by proposing promising ways to improve the effectiveness of sparse training methods and promote their use in deep reinforcement learning .
边栏推荐
- 【深入理解TcaplusDB技术】TcaplusDB运维——日常巡检
- C #, introductory tutorial -- a little knowledge about function parameter ref and source program
- 佐治亚理工学院|具有服务质量保证的多无人机野火协同覆盖和跟踪规划
- AB打包有的Shader没有触发IPreprocessShaders的回调
- 卡尔加里大学|基于强化学习的推荐系统综述
- 老师们,我想请教一个问题,我本地跑flinkcdc同步mysql数据timestamp字段解析正常,
- NRF51822外设学习
- Yarn notes
- Teachers, I want to ask you a question. I run flinkcdc locally to synchronize MySQL data. The timestamp field parsing is normal,
- 【深入理解TcaplusDB技术】TcaplusDB事务管理——错误排查
猜你喜欢

推荐一个解剖学网站

84. (cesium chapter) movement of cesium model on terrain

Nrf51822 peripheral learning

树和森林的遍历

Altium Designer中off grid pin解决方法

0816 shortcomings of Feida (improvement direction)

How to use yincan IS903 to master DIY's own USB flash disk? (good items for practicing BGA welding)

Openpnp debugging ------ 0816 Feida Tui 0402 taping

1.2-----机械设计工具(CAD软件)和硬件设计工具(EDA软件)及对比

1.3----- simple setting of 3D slicing software
随机推荐
[deeply understand tcapulusdb technology] tcapulusdb transaction management - error troubleshooting
Solution de pin hors grille dans altium designer
树和森林的遍历
About Random Forest
Assign values to objects
1.3-----Simplify 3D切片软件简单设置
Openpnp使用过程的一些问题记录
2. what is mechanical design?
0.0 - Solidworks如何才能卸载干净?
【深入理解TcaplusDB技术】TcaplusDB运维——日常巡检
Quick indent usage in VIM
谷歌| ICML 2022: 深度强化学习中的稀疏训练状态
Mini web framework: template replacement and routing list function development | dark horse programmer
vim中快速缩进用法
使用 Order by 与 rownum SQL 优化案例一则
08_一句话让你人间清醒
C WinForm embedded flash
图的定义及术语
08_ One word sobers you up
84.(cesium篇)cesium模型在地形上运动