当前位置:网站首页>A survey on dynamic neural networks for natural language processing, University of California
A survey on dynamic neural networks for natural language processing, University of California
2022-06-24 16:32:00 【Zhiyuan community】
author :Canwen Xu, Julian McAuley
brief introduction : Scale large objects effectively Transformer Models are the main driving force behind the latest advances in natural language processing . Dynamic neural network is a new research direction , It can dynamically adjust the calculation path of neural network according to the input , Thus, the calculation amount and time can be increased in a sub linear way . Dynamic neural networks may be a promising solution , It can solve the increasing number of parameters of the pre training language model , It can use trillions of parameters for model pre training , Faster reasoning on mobile devices . In this review , The author summarizes the progress of three dynamic neural networks in natural language processing : skimming (skimming)、 Hybrid expert model (mixture of experts) And early exit reasoning (early exit). The author also emphasizes the current challenges and future research direction of dynamic neural network .





Paper download :https://arxiv.org/pdf/2202.07101.pdf
边栏推荐
- Use Google search like a professional
- A set of very good H3C and Tianrongxin Internet cutover scheme templates, with word document download
- One Minute! No code! Add [statistical analysis] to the website
- How do HPE servers make RAID5 arrays? Teach you step by step today!
- Introduction to new features of ECMAScript 2019 (ES10)
- 50 growers | closed door meeting of marketing circle of friends ス gathering Magic City thinking collision to help enterprise marketing growth
- My network relationship with "apifox"
- Introduction of thread pool and sharing of practice cases
- [go] runtime package for concurrent programming and its common methods
- What is the difference between get and post? After reading it, you won't be confused and forced, and you won't have to fight with your friends anymore
猜你喜欢

There are potential safety hazards Land Rover recalls some hybrid vehicles

A new weapon to break the memory wall has become a "hot search" in the industry! Persistent memory enables workers to play with massive data + high-dimensional models

C. K-th Not Divisible by n(数学+思维) Codeforces Round #640 (Div. 4)

Ui- first lesson
MySQL Advanced Series: Locks - Locks in InnoDB
![[download attached] installation and simple use of Chinese version of awvs](/img/3b/f26617383690c86edff465c9a1099e.png)
[download attached] installation and simple use of Chinese version of awvs

Cognition and difference of service number, subscription number, applet and enterprise number (enterprise wechat)

ZOJ——4104 Sequence in the Pocket(思维问题)
![[application recommendation] the hands-on experience and model selection suggestions of apifox & apipost in the recent fire](/img/dd/24df91a8a1cf1f1b9ac635abd6863a.png)
[application recommendation] the hands-on experience and model selection suggestions of apifox & apipost in the recent fire

My network relationship with "apifox"
随机推荐
Global and Chinese market of insect proof clothing 2022-2028: Research Report on technology, participants, trends, market size and share
2021-04-27: if the adjacent position of a character does not have the same character
Istio FAQ: sidecar startup sequence
There are potential safety hazards Land Rover recalls some hybrid vehicles
Interpretation of swin transformer source code
Pytorch transpose convolution
How does the effective date of SAP PP ECM affect the work order?
Applet - use of template
Nature publishes significant progress in quantum computing: the first quantum integrated circuit implementation in history
Go deep into the implementation principle of go language defer
Handling of communication failure between kuberbetes pod
Load MySQL table data consumption quick installation configuration through kafka/flink
6 things all engineers should know before FEA
企业安全攻击面分析工具
D. Solve the maze (thinking +bfs) codeforces round 648 (Div. 2)
How do HPE servers make RAID5 arrays? Teach you step by step today!
Global and Chinese market of computer protective film 2022-2028: Research Report on technology, participants, trends, market size and share
Serial of H3CNE experiment column - spanning tree STP configuration experiment
[tke] analysis of CLB loopback in Intranet under IPVS forwarding mode
期货怎么开户安全些?哪些期货公司靠谱些?