当前位置:网站首页>Richard Sutton, the father of reinforcement learning, paper: pursuing a general model for intelligent decision makers
Richard Sutton, the father of reinforcement learning, paper: pursuing a general model for intelligent decision makers
2022-06-24 13:16:00 【Zhiyuan community】
The idea of this paper is to strengthen and deepen this premise by putting forward views on decision makers , This view in Psychology 、 Artificial intelligence 、 economics 、 Control theory and neuroscience have substantial and extensive applications , I call it a generic model for intelligent agents . The generic model does not include anything specific to any organism 、 Anything in the world or in the field of application . The generic model does include all aspects of the decision-maker's interaction with his world ( There must be input and output , And a goal ) And the internal components of the decision maker ( For perception 、 Decision making 、 Internal assessment and world model ). I identified these aspects and components , Note that they are given different names in different disciplines , But essentially it means the same idea , The challenges and benefits of designing a neutral term that can be used across disciplines are discussed . It is time to recognize and establish the integration of multiple different disciplines on the substantive general model of intelligent agents .
边栏推荐
- 【概率论期末抱佛脚】概念+公式(不含参数估计)
- The pod is evicted due to insufficient disk space of tke node
- Opengauss kernel: simple query execution
- LVGL库入门教程 - 颜色和图像
- 实现领域驱动设计 - 使用ABP框架 - 创建实体
- Another prize! Tencent Youtu won the leading scientific and technological achievement award of the 2021 digital Expo
- What should I do if I fail to apply for the mime database? The experience from failure to success is shared with you ~
- Use abp Zero builds a third-party login module (I): Principles
- 简述聚类分析
- 初中级开发如何有效减少自身的工作量?
猜你喜欢
脚本之美│VBS 入门交互实战
使用 Abp.Zero 搭建第三方登录模块(一):原理篇
快速了解常用的消息摘要算法,再也不用担心面试官的刨根问底
[data mining] final review (sample questions + a few knowledge points)
Comparator 排序函数式接口
华为AppLinking中统一链接的创建和使用
使用 Abp.Zero 搭建第三方登录模块(一):原理篇
CVPR 2022 - Interpretation of selected papers of meituan technical team
MySQL foreign key impact
‘高并发&高性能&高可用服务程序’编写及运维指南
随机推荐
谁是鱼谁是饵?红队视角下蜜罐识别方式汇总
“我这个白痴,招到了一堆只会“谷歌”的程序员!”
申请MIMIC数据库失败怎么办?从失败到成功的经验分享给你~
使用 Abp.Zero 搭建第三方登录模块(一):原理篇
"I, an idiot, have recruited a bunch of programmers who can only" Google "
Leetcode 1218. 最长定差子序列
Attack Science: DDoS (Part 2)
A hero's note stirred up a thousand waves across 10 countries, and the first-line big factories sent people here- Gwei 2022 Singapore
How can ffmpeg streaming to the server save video as a file through easydss video platform?
Open source monitoring system Prometheus
16 safety suggestions from metamask project to solid programmers
The pod is evicted due to insufficient disk space of tke node
What is SCRM? What is the difference between SCRM and CRM
Reset the password, and the automatic login of the website saved by chrome Google browser is lost. What is the underlying reason?
Getting started with the lvgl Library - colors and images
Reading notes of returning to hometown
实现领域驱动设计 - 使用ABP框架 - 创建实体
Understanding openstack network
用一个软件纪念自己故去的母亲,这或许才是程序员最大的浪漫吧
Continuous testing | key to efficient testing in Devops Era