当前位置:网站首页>Zhiyuan community weekly 86: Gary Marcus talks about three linguistic factors that can be used for reference in large model research; Google puts forward the Wensheng graph model parti which is compar
Zhiyuan community weekly 86: Gary Marcus talks about three linguistic factors that can be used for reference in large model research; Google puts forward the Wensheng graph model parti which is compar
2022-06-24 13:58:00 【Zhiyuan community】
Must see every week AI Point of view 、 Research and various resources , Don't miss an important piece of information ! Welcome to click here , Subscribe to Zhiyuan community AI weekly .
Point of view
“ If we want to focus on the missing elements of the pre training big model , There are three key factors that should be considered :
1. Reference resources (Reference): Words and sentences do not exist in isolation . Language is a word / The connection between sentences and the outside world , The word sequences in large language models and their lack of connection with the outside world .
2. Cognitive models (Cognitive models): The ultimate goal of the language system is to update the world , Continuous but dynamic perception . Large models do not produce such cognitive models , At least there is no such recognition that people can reliably use it .
3. form (Compositionality): A complex whole , in the majority of cases , Be able to systematically explain the part where it is , And how these parts are organized together . image DALL-E When it comes to the composition of such a system , Face significant challenges . for example ,GPT...... Can not produce a reflection of the structural relationship between sentences 、 An interpretable representation .”
—— In recent days, , When it comes to the defects of the pre training large model , Professor, New York University Gary Marcus I think we can learn three important factors from linguistics .( Extended reading )
“( In this paper ) I propose a general model called agent (Common Model of the Intelligent Agent) The concept of , Such decision makers (Decision Maker) It can be substantially and widely applied to psychology 、 Artificial intelligence 、 economics 、 Control theory and neuroscience ...... This generic model includes many aspects : Decision makers interact directly with them , You need input 、 Output and goals , And the system composition within the decision-maker , For perception 、 Decision making 、 Internal assessment , And a world model . I noticed that they have different names in different disciplines , But essentially the same concept ...... Now is the time to endorse and build a substantial generic agent model , It can span and integrate multiple fields .”
—— In a new paper this year , The father of reinforcement learning Richard Sutton A general model of intelligent decision maker is proposed , Think it can unify the research of multiple disciplines .( Extended reading )
Scientist trends
6 month 20 Japan , Professor at the University of Texas at Austin Scott Aaronson Announced in OpenAI Work for a year , Its main responsibility is to think about AI security and alignment (AI Safety and Alignment) Theoretical basis .Scott Araonson Professor of computer science at the University of Texas at Austin , Director of quantum information center , His research areas include the performance and limitations of quantum computers , More generalized computational complexity theory, etc .2020 In, he was awarded a prize for his contribution in the field of quantum computing ACM Calculation Award .
Oren Etzioni He is an honorary professor at the University of Washington , He was a professor in the Department of computer science and engineering . At present, he will continue to serve as CEO until this year 9 month 30 Japan , Then he served as a member of the board of directors and a consultant .AI2 By the late co-founder of Microsoft Paul Allen On 2014 An artificial intelligence research institute established in the United States in , The development includes NLP And other artificial intelligence research and engineering projects , Well known projects include academic search engines Semantic Scholar etc. .
In the past two years , Scientists who have resigned from large domestic and foreign technology companies , There are two main development paths : One is to return from industry to academia , The second is to leave the big factory 、 Start your own business . This article takes stock of AI Domestic start-up companies joined by scientists , Like little ice 、 Innovative wisdom 、 Circular intelligence, etc , And the development of these scientists .
Research frontier
- Google's proposal is based on Pathways Autoregressive Wensheng graph model Parti, The effect is comparable to Imgen
- The father of reinforcement learning Richard Sutton writing : A general model for pursuing intelligent decision makers
- Tsinghua tianjixin X Chip boarding Science Robotics
- OpenAI Propose a video pre training model VPT, You can play Minecraft game
Mechanism dynamics
- OpenAI Three products (GPT-3、Copilot、DALL-E) The number of registered users has exceeded one million ,DALL-E This goal has been achieved in less than three months
- CIFAR Announce the second phase of Pan Canada AI strategic , Will provide more than... Within ten years 4.43 Billion dollars in funding
- Cohere、OpenAI、AI21 Three best practice guidelines for jointly publishing deployment models
Activities
- Video playback | 2022 The video of the opening ceremony of Zhiyuan conference and sub forum was online
- Event registration | China Artificial Intelligence Society :2022 China International Intelligent Driving forum - Intelligent driving of technological change (6 month 25 Japan )
- Event registration | MIT、 Wisconsin 、UMass、 Researchers such as the University of Utah :MLNLP The eighth academic seminar (6 month 26 Japan )
- Event registration | University of Illinois at urbana - Champagne (UIUC) Li Bo : The combination of machine learning and knowledge reasoning in trusted machine learning (6 month 30 Japan )
resources
- FlagAI Feizhi :AI Basic model open source project , Support one click call OPT Wait for the model
- NATO group study report : Knowledge representation and reasoning - Overview of current technology and future opportunities
- 2021 China deep learning software framework Market Research Report
- CVPR2022 Microsoft 《 Progress of visual language pre training 》 course
View Pre Workout 、 Weekly content in areas such as intensive learning , Welcome to click here .
Weekly clue collection and cooperation , Please contact the :[email protected]
边栏推荐
- 图扑软件数字孪生海上风电 | 向海图强,奋楫争先
- #21Set经典案例
- 【R语言数据科学】(十四):随机变量和基本统计量
- 远程办公之:在家露营办公小工具| 社区征文
- Kotlin coordination channel
- 2022 Quality Officer - Equipment direction - post skills (Quality Officer) recurrent training question bank and online simulation examination
- Explain kubernetes backup and recovery tools velero | learn more about carina series phase III
- 10 reduce common "tricks"
- kotlin 协程通道
- [AI player cultivation record] use AI to identify what kind of wealth is next door
猜你喜欢
SAP Marketing Cloud 功能概述(四)
Google waymo proposed r4d: remote distance estimation using reference target
融云通信“三板斧”,“砍”到了银行的心坎上
Daily question 8-515 Find the maximum value in each tree row
《中国数据库安全能力市场洞察,2022》报告研究正式启动
SAP Marketing Cloud 功能概述(三)
**Unity中莫名其妙得小问题-灯光和天空盒
一键生成大学、专业甚至录取概率,AI填报志愿卡这么神奇?
谷歌WayMo提出R4D: 采用参考目标做远程距离估计
[R language data science] (XIV): random variables and basic statistics
随机推荐
吉时利静电计宽测量范围
源碼解析 Handler 面試寶典
Jerry's test mic energy automatic recording automatic playback reference [article]
Seven challenges faced by data scientists and Solutions
How to manage tasks in the low code platform of the Internet of things?
Eight major trends in the industrial Internet of things (iiot)
SAP Marketing Cloud 功能概述(四)
Use of kotlin arrays, collections, and maps
The first open source MySQL HTAP database in China will be released soon, and the three highlights will be notified in advance
The research on the report "market insight into China's database security capabilities, 2022" was officially launched
Tupu software is the digital twin of offshore wind power, striving to be the first
谷歌WayMo提出R4D: 采用参考目标做远程距离估计
Kotlin asynchronous flow
[5g NR] 5g NR system architecture
Kotlin language features
How to avoid serious network security accidents?
Jerry's seamless looping [chapter]
HarmonyOS-3
Mysql题目篇
2022年江西省安全员B证考试题库模拟考试平台操作