当前位置:网站首页>VPT模型视频讲解
VPT模型视频讲解
2022-06-27 10:32:00 【智源社区】
Minecraft is one of the harder challenges any RL agent could face. Episodes are long, and the world is procedurally generated, complex, and huge. Further, the action space is a keyboard and a mouse, which has to be operated only given the game's video input. OpenAI tackles this challenge using Video PreTraining, leveraging a small set of contractor data in order to pseudo-label a giant corpus of scraped footage of gameplay. The pre-trained model is highly capable in basic game mechanics and can be fine-tuned much better than a blank slate model. This is the first Minecraft agent that achieves the elusive goal of crafting a diamond pickaxe all by itself.
OUTLINE:
0:00 - Intro
3:50 - How to spend money most effectively?
8:20 - Getting a large dataset with labels
14:40 - Model architecture
19:20 - Experimental results and fine-tuning
25:40 - Reinforcement Learning to the Diamond Pickaxe
30:00 - Final comments and hardware
Blog: https://openai.com/blog/vpt/
Paper: https://arxiv.org/abs/2206.11795
Code & Model weights: https://github.com/openai/Video-Pre-Training
边栏推荐
- Future & CompletionService
- C語言學習-Day_04
- 3D mobile translate3d
- Oracle连接MySQL报错IM002
- [methodot topic] what kind of low code platform is more suitable for developers?
- Leetcode 729. 我的日程安排表 I(牛逼,已解决)
- “全班29人24人成功读研”冲上热搜!剩下的5个人去哪了?
- 使用Karmada实现Helm应用的跨集群部署【云原生开源】
- Analysis of mobile ar implementation based on edge computing (Part 2)
- Red envelope rain: a wonderful encounter between redis and Lua
猜你喜欢
随机推荐
Analysis of mobile ar implementation based on edge computing (Part 2)
【TcaplusDB知识库】Tmonitor单机安装指引介绍(一)
mysql数据库汉字模糊查询出现异常
[tcapulusdb knowledge base] tcapulusdb Model Management Introduction
ECMAScript 6(es6)
C语言学习-Day_04
CPU设计(单周期和流水线)
2021 CSP J2 entry group csp-s2 improvement group round 2 video and question solution
基于swiftadmin极速后台开发框架,我制作了菜鸟教程[专业版]
torchvision. models._ utils. Intermediatelayergetter tutorial
21:第三章:开发通行证服务:4:进一步完善【发送短信,接口】;(在【发送短信,接口】中,调用阿里云短信服务和redis服务;一种设计思想:BaseController;)
记一次 .NET 某物管后台服务 卡死分析
JS client storage
C apprentissage des langues - jour 12.
软件系统架构的演变
学习笔记之——数据集的生成
Review of last week's hot spots (6.20-6.26)
Red envelope rain: a wonderful encounter between redis and Lua
oracle触发器 存储过程同时写入
【Methodot 专题】什么样的低代码平台更适合开发者?








