当前位置:网站首页>《强化学习周刊》第50期:SafeRL-Kit、GMI-DRL、RP-SDRL & 离线元强化学习
《强化学习周刊》第50期:SafeRL-Kit、GMI-DRL、RP-SDRL & 离线元强化学习
2022-06-22 20:34:00 【智源社区】
告诉大家一个好消息,《强化学习周刊》开启“订阅功能”,以后我们会向您自动推送最新版的《强化学习周刊》。订阅方法:
1,注册智源社区账号
2,点击周刊界面左上角的作者栏部分“强化学习周刊”(如下图),进入“强化学习周刊”主页。

3,点击“关注TA”(如下图)

4,您已经完成《强化学习周刊》订阅啦,以后智源社区会自动向您推送最新版的《强化学习周刊》!
论文推荐
。
标题:Fast Population-Based Reinforcement Learning on a Single Machine(InstaDeep Ltd:Arthur Flajolet | 单机上基于群体的快速强化学习)
简介:
https://arxiv.org/pdf/2206.08888.pdf
标题:Logic-based Reward Shaping for Multi-Agent Reinforcement Learning(弗吉尼亚大学:Ingy ElSayed-Aly | 多智能体强化学习中的基于逻辑的奖励形成)
简介:
https://arxiv.org/pdf/2206.08881.pdf
标题:Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning(北京大学:Yaodong Yang | 基于强化学习实现人类水平的双手灵巧操作)
简介:
https://arxiv.org/pdf/2206.08686.pdf
标题:SafeRL-Kit: Evaluating Efficient Reinforcement Learning Methods for Safe Autonomous Driving(清华大学深圳研究院&京东:Xueqian Wang&Li Shen | SafeRL-Kit:评估安全自动驾驶的高效强化学习方法)
简介:
https://arxiv.org/pdf/2206.08528.pdf
标题:GMI-DRL: Empowering Multi-GPU Deep Reinforcement Learning with GPU Spatial Multiplexing(加州大学圣芭芭拉分校:Yuke Wang | GMI-DRL:通过 GPU 空间复用增强多 GPU 深度强化学习)
简介:
https://arxiv.org/pdf/2206.08482.pdf
标题:Bootstrapped Transformer for Offline Reinforcement Learning(上海交通大学:Kerong Wang | 基于离线强化学习的自举Transformer)
简介:
https://arxiv.org/pdf/2206.08569.pdf
标题:Micro-behaviour with Reinforcement Knowledge-aware Reasoning for Explainable Recommendation(东华大学: Shaohua Tao|具有强化知识感知推理的可解释推荐微观行为研究)
简介:
https://www.sciencedirect.com/science/article/pii/S0950705122006529
标题:Neural H₂ Control Using Continuous-Time Reinforcement Learning(CINVESTAV-IPN: Adolfo Perrusquia|基于连续时间强化学习的神经 H2控制)
简介:
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9269440
标题:Residual Physics and Post-Posed Shielding for Safe Deep Reinforcement Learning Method(新加坡国立大学: Qingang Zhang|安全深度强化学习方法的残差物理和后置屏蔽)
简介:
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9796122
标题:Blockchain and Federated Deep Reinforcement Learning Based Secure Cloud-Edge-End Collaboration in Power IoT(华北电力大学: Sunxuan Zhang|电力物联网中基于区块链和联合深度强化学习的安全云端协作)
简介:
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9801730
标题:Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning(北京大学: Haoqi Yuan| ICML 2022: 通过对比学习实现离线元强化学习的鲁棒任务表示)
简介:
https://arxiv.org/pdf/2206.10442.pdf
标题:The State of Sparse Training in Deep Reinforcement Learning(谷歌: Laura Graesser| ICML 2022: 深度强化学习中的稀疏训练状态)
简介:
https://arxiv.org/pdf/2206.10369.pdf
标题:Multi-UAV Planning for Cooperative Wildfire Coverage and Tracking with Quality-of-Service Guarantees(佐治亚理工学院: Esmaeil Seraj|具有服务质量保证的多无人机野火协同覆盖和跟踪规划)
简介:
https://arxiv.org/pdf/2206.10544.pdf
标题:Graph Convolutional Recurrent Networks for Reward Shaping in Reinforcement Learning(康考迪亚大学: Hani Sami|图卷积循环网络用于强化学习中的奖励生成)
简介:
https://www.sciencedirect.com/science/article/pii/S0020025522006442
标题:卡塔尔大学:Omar Elharrouss | Backbones-Review:深度学习和深度强化学习方法的特征提取网络
简介:
https://arxiv.org/pdf/2206.08016.pdf
标题:Reinforcement Learning based Recommender Systems: A Survey(卡尔加里大学: M. Mehdi Afsa|基于强化学习的推荐系统综述)
简介:
https://dl.acm.org/doi/pdf/10.1145/3543846

边栏推荐
- The machine that lies in the 52nd monthly race of Niuke (the complexity of interval assignment operation from O (n^2) to o (n))
- 校园跑腿管理端APP—陕西格创
- Implementation of depth traversal adjacency matrix of figure 6-5
- Solve the problem that MySQL in phpstudy cannot be started and conflicts with locally installed MySQL
- 【象棋人生】01 人生如棋
- 【MAVROS】MAVROS 啓動指南
- liunx 安装mysql
- Kdd'22 | Ali: fine tuning CTR estimation based on EE exploration
- Liunx installing MySQL
- Linux安装Mysql(包成功!!)
猜你喜欢

redis 报错解决与常用配置

Cvpr2022 𞓜 feature decoupling learning and dynamic fusion for re captured images

Implementation of depth traversal adjacency table in Figure 6-7

6-3 non recursive traversal of binary tree

7-9 超级玛丽

腾讯云上传文件出现的问题:in a frame because it set ‘X-Frame-Options‘ to ‘deny‘.

Research hotspot - Official publicity! The release time of JCR zoning and impact factors will be determined in 2022!

Icml2022 | using virtual nodes to promote graph structure learning

RealNetworks vs. Microsoft: the battle in the early streaming media industry

Lesson 026: Dictionary: when the index is not easy to use 2 | after class test questions and answers
随机推荐
NiO copy file call getchannel method transferfrom()
Redis core technology and practice: learning summary directory
6-1 operation set of binary search tree
【ROS】ROSmsg cakin_ Make compilation error
7-1 前序序列创建二叉树
【ROS 入门学习 】CmakeList.txt 和Packages.xml释义
VS代码一键整理快捷键
Mask image modeling for self supervised representation pre training: CAE and its relationship with Mae and Beit
Is data scientist a promising profession?
Based on AI driven macromolecular drug discovery, "Huashen Zhiyao" obtained nearly 500million yuan of round a financing
CSV add a new column
SPA项目开发之登录注册
自监督表征预训练之掩码图像建模:CAE 及其与 MAE、BEiT 的联系
6-7 图的深度遍历-邻接表实现
The required reading for candidates | PMP the test on June 25 is approaching. What should we pay attention to?
6-5 图的深度遍历-邻接矩阵实现
SPA项目开发之动态树+数据表格+分页
The third training of Hongmeng
Lesson 023 and 024: recursion: these little bunnies, Hanoi Tower after class test questions and answers
RapidEye快鸟、SPOT卫星遥感影像数据
