当前位置:网站首页>Gary Marcus发文:AI研究者需要知道的三个来自语言学家的观点
Gary Marcus发文:AI研究者需要知道的三个来自语言学家的观点
2022-06-23 11:20:00 【智源社区】
Everybody knows that large language models like GPT-3 and LaMDA have made tremendous strides, at least in some respects, and powered past many benchmarks, and Cosmo recently described DALL-E but most in the field also agree that something is still missing. A group of engineers at Facebook, for example, wrote in 2019 that:
A growing body of evidence shows that state-of-the-art models learn to exploit spurious statistical patterns in datasets... instead of learning meaning in the flexible and generalizable way that humans do."
Since then, the results on benchmarks have gotten better, but there’s still something missing.
If we had to put our finger on what is still missing, we would focus on these three key elements:
Reference: Words and sentence don’t exist in isolation. Language is about a connection between words (or sentence) and the world; the sequences of words that large language models utter lack connection to the external world.
Cognitive models: The ultimate goal of a language system should be to update a persisting but dynamic sense of the world. Large language models don’t produce such cognitive models, at least not in a way that anybody has been able to make reliable use of.
Compositionality: Complex wholes are (mostly) systematically interpreted in terms of their parts, and how these parts are arranged. Systems like DALL-E face clear challenges when it comes to compositionality. (LLM’s like GPT produce well-formed prose but do not produce interpretable representations of utterances that reflect structured relationships between the parts of those sentences.)
In our view, inadequate attention to these three factors has serious consequences, including:
(a) the tendency of large language models to lose coherence over time, drifting into “empty” language with no clear connection to reality;
(b) the difficulty of large language models in distinguishing truth from falsehoods;
(c) the struggle in these models to avoid perpetuating bias and toxic speech.
Now here’s the thing: none of these three elements we have been stressing are news to linguists. In fact, at least since the work of Gottlob Frege in the late 19th century, they have been pretty central to what many linguists worry about. To be sure, none of these three issues has been solved so far; for example, there is still debate about “how much” of our everyday language use actually relies on compositionality, and what the right cognitive models of language should be. But we do think that linguistics has a lot to offer in terms of formulating and thinking about these questions.
边栏推荐
- Esp32-cam high cost performance temperature and humidity monitoring system
- 【ML】QuantileRegressor
- 强化责任意识和底线思维 全力筑牢抗洪抢险“安全堤”
- 最简单DIY基于STM32的远程控制电脑系统②(无线遥杆+按键控制)
- 视频数据标注工具与平台(数据标注公司)
- 运行时应用自我保护(RASP):应用安全的自我修养
- Creating neural networks using tensorflow2
- 电容参数哪里找!?
- ESP32-CAM高性价比温湿度监控系统
- The simplest DIY pca9685 steering gear control program based on the integration of upper and lower computers of C # and 51 single chip microcomputer
猜你喜欢

最简单DIY基于STM32F407探索者开发板的MPU6050陀螺仪姿态控制舵机程序

坚持五件事,带你走出迷茫困境!

The simplest DIY serial port Bluetooth hardware implementation scheme

最简单DIY基于51单片机的舵机控制器
![[golden section] and [Fibonacci series]](/img/6a/69dba98951d37cdb4793c3d49cbb1a.png)
[golden section] and [Fibonacci series]

最简单DIY基于STM32的远程控制电脑系统①(电容触摸+按键控制)

ESP32-CAM无线监控智能网关的设计与实现

The simplest DIY pca9685 steering gear control program based on the integration of upper and lower computers of C # and 51 single chip microcomputer

程序中创建一个子进程,然后父子进程各自独自运行,父进程在标准输入设备上读入小写字母,写入管道。子进程从管道读取字符并转化为大写字母。读到x结束

开发增效利器—2022年VsCode插件分享
随机推荐
运行时应用自我保护(RASP):应用安全的自我修养
某问答社区App x-zse-96签名分析
Win10 微软输入法(微软拼音) 不显示 选字栏(无法选字) 解决方法
Explain in detail the method of judging the size end
Parity of UART
How to implement a distributed lock with redis
程序中创建一个子进程,然后父子进程各自独自运行,父进程在标准输入设备上读入小写字母,写入管道。子进程从管道读取字符并转化为大写字母。读到x结束
php 正则表达式
What does NFTs, Web3 and metauniverse mean for digital marketing?
电感有极性吗?
Attack and defense drill collection | 3 stages, 4 key points, interpretation of the blue team defense whole process outline
最简单DIY串口蓝牙硬件实现方案
今天14:00 | 12位一作华人学者开启 ICLR 2022
Analysis of LinkedList source code
PHP regular expression
Is the online security of securities account opening high
Is it difficult to register stocks and open accounts online? Is it safe to open an account online now?
Installation and use of binabsinspector, an open source binary file static vulnerability analysis tool
互联网奇迹-小米究竟是怎么盈利
Flutter series: wrap in flutter