当前位置:网站首页>A survey on dynamic neural networks for natural language processing, University of California
A survey on dynamic neural networks for natural language processing, University of California
2022-06-24 16:32:00 【Zhiyuan community】
author :Canwen Xu, Julian McAuley
brief introduction : Scale large objects effectively Transformer Models are the main driving force behind the latest advances in natural language processing . Dynamic neural network is a new research direction , It can dynamically adjust the calculation path of neural network according to the input , Thus, the calculation amount and time can be increased in a sub linear way . Dynamic neural networks may be a promising solution , It can solve the increasing number of parameters of the pre training language model , It can use trillions of parameters for model pre training , Faster reasoning on mobile devices . In this review , The author summarizes the progress of three dynamic neural networks in natural language processing : skimming (skimming)、 Hybrid expert model (mixture of experts) And early exit reasoning (early exit). The author also emphasizes the current challenges and future research direction of dynamic neural network .
Paper download :https://arxiv.org/pdf/2202.07101.pdf
边栏推荐
- Bitwise Operators
- [download attached] installation and simple use of Chinese version of awvs
- 2021-05-01: given an ordered array arr, it represents the points located on the X axis. Given a positive number k
- Snowflake algorithm implemented in go language
- mysql时间戳格式转换日期格式字符串
- April 26, 2021: the length of the integer array arr is n (3 < = n < = 10^4), and each number is
- Global and Chinese market of inverted syrup 2022-2028: Research Report on technology, participants, trends, market size and share
- Istio FAQ: sidecar startup sequence
- MySQL日期时间戳转换
- A troubleshooting of golang memory leak
猜你喜欢
Cognition and difference of service number, subscription number, applet and enterprise number (enterprise wechat)
MySQL Advanced Series: locks - locks in InnoDB
Wechat official account debugging and natapp environment building
C. Three displays(动态规划)Codeforces Round #485 (Div. 2)
Applet - use of template
MySQL進階系列:鎖-InnoDB中鎖的情况
My network relationship with "apifox"
There are potential safety hazards Land Rover recalls some hybrid vehicles
C. K-th not divisible by n (Mathematics + thinking) codeforces round 640 (Div. 4)
Problems encountered in the work of product manager
随机推荐
How to use the national standard streaming media server to view the video stream of the surveillance camera? How to correctly use UDP and TCP protocols?
Enterprise security attack surface analysis tool
My network relationship with "apifox"
If only 2 people are recruited, can the enterprise do a good job in content risk control?
AI video structured intelligent security platform easycvr realizes intelligent security monitoring scheme for procuratorate building
Comparison of jmeter/k6/locust pressure measuring tools (not completed yet)
MySQL日期时间戳转换
山金期货安全么?期货开户都是哪些流程?期货手续费怎么降低?
Web page live broadcast on demand RTMP streaming platform easydss newly added virtual live broadcast support dash streaming function
对深度可分离卷积、分组卷积、扩张卷积、转置卷积(反卷积)的理解
C. Three displays(动态规划)Codeforces Round #485 (Div. 2)
Global and Chinese market of inverted syrup 2022-2028: Research Report on technology, participants, trends, market size and share
Some adventurer hybrid versions with potential safety hazards will be recalled
[tke] modify the cluster corendns service address
转置卷积学习笔记
Virtual machine virtual disk recovery case tutorial
The mystery of redis data migration capacity
Fastjson vulnerability utilization techniques
During JMeter pressure measurement, time_ The number of requests does not go up due to many waits. The problem is solved
Cause analysis of the failure of web page live broadcast on demand RTMP streaming platform easydss streaming live broadcast