当前位置:网站首页>A survey on dynamic neural networks for natural language processing, University of California
A survey on dynamic neural networks for natural language processing, University of California
2022-06-24 16:32:00 【Zhiyuan community】
author :Canwen Xu, Julian McAuley
brief introduction : Scale large objects effectively Transformer Models are the main driving force behind the latest advances in natural language processing . Dynamic neural network is a new research direction , It can dynamically adjust the calculation path of neural network according to the input , Thus, the calculation amount and time can be increased in a sub linear way . Dynamic neural networks may be a promising solution , It can solve the increasing number of parameters of the pre training language model , It can use trillions of parameters for model pre training , Faster reasoning on mobile devices . In this review , The author summarizes the progress of three dynamic neural networks in natural language processing : skimming (skimming)、 Hybrid expert model (mixture of experts) And early exit reasoning (early exit). The author also emphasizes the current challenges and future research direction of dynamic neural network .





Paper download :https://arxiv.org/pdf/2202.07101.pdf
边栏推荐
- Abnormal dockgeddon causes CPU 100%
- Introduction to new features of ECMAScript 2019 (ES10)
- Tencent blue whale Zhiyun community version v6.0.3 was officially released together with the container management platform!
- Cognition and difference of service number, subscription number, applet and enterprise number (enterprise wechat)
- There are potential safety hazards Land Rover recalls some hybrid vehicles
- Wechat official account debugging and natapp environment building
- 2021-05-02: given the path of a file directory, write a function
- How does easydss, an online classroom / online medical live on demand platform, separate audio and video data?
- Web page live broadcast on demand RTMP streaming platform easydss newly added virtual live broadcast support dash streaming function
- If only 2 people are recruited, can the enterprise do a good job in content risk control?
猜你喜欢
![[application recommendation] the hands-on experience and model selection suggestions of apifox & apipost in the recent fire](/img/dd/24df91a8a1cf1f1b9ac635abd6863a.png)
[application recommendation] the hands-on experience and model selection suggestions of apifox & apipost in the recent fire

A new weapon to break the memory wall has become a "hot search" in the industry! Persistent memory enables workers to play with massive data + high-dimensional models

Ui- first lesson

ZOJ - 4104 sequence in the pocket

My network relationship with "apifox"

C. K-th Not Divisible by n(数学+思维) Codeforces Round #640 (Div. 4)
![[go] concurrent programming channel](/img/6a/d62678467bbc6dfb6a50ae42bacc96.jpg)
[go] concurrent programming channel

Cognition and difference of service number, subscription number, applet and enterprise number (enterprise wechat)

ZOJ——4104 Sequence in the Pocket(思维问题)

There are potential safety hazards Land Rover recalls some hybrid vehicles
随机推荐
Development trend of CAE simulation analysis software
Batch BOM Bapi test
SQL multi table updating data is very slow
Bitwise Operators
Nature publishes significant progress in quantum computing: the first quantum integrated circuit implementation in history
A new weapon to break the memory wall has become a "hot search" in the industry! Persistent memory enables workers to play with massive data + high-dimensional models
An error is reported during SVN uploading -svn sqlite[s13]
There are potential safety hazards Land Rover recalls some hybrid vehicles
AI video structured intelligent security platform easycvr intelligent security monitoring scheme for protecting community residents
The million bonus competition is about to start, and Ti-One will be upgraded to help you win the championship!
2021-05-03: given a non negative integer num, how to avoid circular statements,
A very good educational man and resource center planning scheme, with word file download
Global and Chinese markets of stainless steel barbecue ovens 2022-2028: Research Report on technology, participants, trends, market size and share
MySQL timestamp format conversion date format string
A memory leak caused by timeout scheduling of context and goroutine implementation
Video structured intelligent analysis platform easycvr video recording plan function optimization / regularly delete expired videos
Siggraph 2022 | truly restore the hand muscles. This time, the digital human hands have bones, muscles and skin
Load MySQL table data consumption quick installation configuration through kafka/flink
sql 多表更新数据非常慢
Fastjson 漏洞利用技巧