当前位置:网站首页>How to understand the mechanism of memory and attention in deep learning? This 212 page doctoral thesis of Deakin University gives you answers
How to understand the mechanism of memory and attention in deep learning? This 212 page doctoral thesis of Deakin University gives you answers
2022-07-16 09:09:00 【Zhiyuan community】

Intelligence needs memory . There is no memory , Human beings cannot complete all kinds of important tasks , For example, reading novels 、 Play games or solve math problems . The ultimate goal of machine learning is to develop intelligent systems that automatically learn and act like humans , Therefore, the memory construction of the machine is inevitable . The artificial neural network connects the calculation units through weights , Modeling neurons and synapses in the brain , This is a typical machine learning algorithm similar to memory structure . Their descendants have more sophisticated modeling techniques ( Deep learning ), It has been successfully applied to many practical problems , And proved the importance of memory in the process of machine system learning . In recent years , The research of memory modeling in deep learning mainly focuses on the external memory structure , Inspired by computational Turing model and biological neuron system . Attention mechanism is produced to support the acquisition and retention of external memory . Despite the lack of theoretical basis , These methods have shown the hope of helping the machine system reach a higher level of intelligence . The purpose of this paper is to improve the understanding of memory and attention in deep learning . Its contributions include : (i) Present a collection of memory categories ,(ii) Construct a new memory enhancing neural network supporting multiple control and memory units (MANN), (iii) Introduce variability through memory in the sequence generation model ,(iv) Find the best write operation in the slot based memory network to maximize the memory ability ;(v) A new type of external memory through neural network —— Neural memory program memory to simulate general-purpose Turing machine .
The contents of the catalog :
1 Introduction
2 Taxonomy for Memory in RNNs 9
3 Memory-augmented Neural Networks 40
4 Memory Models for Multiple Processes 57
5 Variational Memory in Generative Models 89
6 Optimal Writing Memory 103
7 Neural Stored-Program Memory 120
8 Conclusions 134
In this paper , We introduce several memory types of Neural Networks , Especially recurrent neural networks (RNNs). We emphasize memory as RNN The concept of external storage , among rnn You can learn to read and write external memory , To support their working memory ( The first 2 Chapter ). We reviewed the solution training RNN Difficult progress , Such as gating and attention mechanism , Especially slot based MANN, This is the main content of the new model proposed in Chapter 3 . Our main contributions are in four aspects . First , We MANN As a multi process multi viewpoint model to deal with complex problems , Such as sequence-to-sequence Mapping and multi view sequence learning ( Chapter four ). We further expand MANNs As a model of discrete sequence, the variability and consistency of session data are required ( The first 5 Chapter ). We also explain to memory last , We introduce a new class MANN, It follows the principle of storing programs , Different functions can be performed by switching the program of the controller .

边栏推荐
猜你喜欢

HCIP第四天笔记

HCIP第六天笔记

Memory mapping principle and detailed explanation (very practical)

How to set various displays in 3dmax2021?

Audio and video learning (VII) -- aac audio code stream

mac通过brew方式安装mysql5.7

vscode 输入 !不提示,没法自动补全的解决方法(最新)

Preorder and inorder traversal sequences determine a binary tree (restore binary tree)

用 AnimatedBuilder 分离组件和动画,实现动效复用

redis实现分布式锁
随机推荐
ODOO form视图详解(一)
032 SQL知识点小记
Emqx cloud update: add redis and JWT external authentication authorization
np.unravel_index() 求出数组某元素(或某组元素)拉成一维后的索引值在原本维度(或指定新维度)中对应的索一般与np.argmax(A) 或 np.argmin(A) 配合使用
常用邮箱访问协议
Security of odoo
Selected multiple choice questions of spark final exam
Hcip third day notes
Séquence de traversée de l'ordre initial et de l'ordre moyen pour déterminer un arbre binaire (restauration de l'arbre binaire)
通过网上的客户经理办理炒股开户,安全吗?
Cao Jie spark programming Scala version after class exercise answers
排序——基数排序
js中树形结构的深度遍历与广度遍历
林子雨spark scala版编程小结
1,jvm类加载机制
渲染流程,代码是如何变为页面的(一)
HCIP第三天笔记
Qstring regular expression of QT
判断两棵二叉树是否同构,三种实现方式(递归、队列、堆栈)
Hcip day 6 notes