当前位置:网站首页>AI writes its own code to let agents evolve! The big model of openai has the flavor of "human thought"
AI writes its own code to let agents evolve! The big model of openai has the flavor of "human thought"
2022-06-25 03:42:00 【QbitAl】
Jin Lei Dream morning From the Aofei temple
qubits | official account QbitAI
Do things !
AI“ see ” I took a look GitHub How do humans submit updates on the Internet (commit) Of , Then imitate human beings Programmers modify code ……
Final , This AI Also successful “ Training ” There's a agent robot :

No kidding , such Consider very fear Things about , stay OpenAI In a newly published study , It really happened ……

Originally , What the researchers want to solve is a genetic programming (GP) problem —— Let an intelligent robot learn to move .
(GP It is a special field in evolutionary computation , It is mainly aimed at automatically building programs to solve problems independently .)
but OpenAI The sword moves sideways , Put your own large-scale language model (LLM) Put it in , The result is a big “ Never in my wildest dreams ”.
Previously, in the process of agent evolution , Human researchers need to be involved in making some detailed adjustments 、 Determine the evolution direction , Let agents develop in a good direction .
Ok now , These jobs are all taken care of by the big model , Learn by yourself 、 Write your own code 、 Go by yourself “ Training ”:

As soon as this matter is discussed Joel Lehman Exposure on the Internet , It immediately attracted a lot of attention from netizens :

A programmer netizen called directly after reading “ To keep pace with ( technology ) The pace of development ” 了 :

even to the extent that OpenAI I have said in my research that :
It has bridged the gap between evolutionary algorithm and human thought .
So this one “ Magic ” Things about ,AI How did you do it ?
Take a look at GitHub,AI Knock the code yourself
Design mobile robot in virtual environment , It is a very popular project in genetic algorithm research .
especially Sodarace The competition needs less calculation , Easy visualization of processes is very popular .

The rules are simple , from “ The joints ” and “ muscle ” The robot races on all kinds of terrain .
OpenAI It also specially rewrites the entire competition program from a special genetic code to Python edition , To demonstrate the versatility of the new approach to modern programming languages .
For example, such a paragraph Python Code , Can be used as the initial seed robot .

Define the four vertex joints of a square 、 End joint , Both of them use “ muscle ” After connecting , give the result as follows .

However, such a square structure cannot move , Next, we need to modify the code by genetic algorithm .
According to the research team , Modify the code with traditional genetic algorithm VS Human programmers do it themselves , There are still two gaps in efficiency :
One is that the software is becoming more and more complex , Humans can do modular code reuse to deal with , At present, the most advanced genetic algorithm can not do this in the programming language used by human beings .
The other is that almost all genetic algorithms rely on random mutation (mutation), Every time a human programmer changes his code, he has a purpose , Or add functions 、 Or improve efficiency 、 Or repair bug.
So is there any way to make AI Learn how humans modify code ?
There are , All the required training data exist GitHub On .
Good programmers write code every time they submit it commit describe , Make it clear what has been modified in this submission .

commit Describe the code comparison before and after the submission diff data , Namely AI Great learning materials .

The researchers sifted through some descriptions with clear intent 、 Submit data to train a small amount of modified code GPT-3 Architecturally AI Model .
It's equivalent to letting AI Learn from human programmers how to purposefully modify a piece of code .
The model used in this paper does not need a full version GPT-3 Of 1750 Billion parameters , The highest 7.5 Billion parameters are enough .

Thus the basic AI Model , It will play the role of mutation operator in genetic algorithm .
Next let's AI The process of designing a new robot is divided into three steps .

First step , Start with the classic MAP-Elites The algorithm generates a set of initial robots .

This is a kind of QD( Quality diversity ) Algorithm , It can ensure that robots have different behaviors and high quality .

The second step , Use the initial data generated in the first step for pre training , Give Way AI First learn to design a robot within the training data distribution .
That is, the moving picture that amazed everyone on the Internet at the beginning , It shows AI How to move the immovable “ square ” Transformed into a mobile robot with alternating legs .

The third step , Combined with reinforcement learning algorithm to make fine adjustment , Give Way AI It can generate robots that can adapt to the environment according to different terrain conditions .


Final , The researchers selected the robot evolved from the first three seeds to show the effect .
It can be seen that their structure and movement are completely different .

The netizen exclaimed “ The train of thought is strange ”
Once the study was published , It can be said that one stone aroused thousands of waves .
Many netizens are amazed at this “ Big model + Evolutionary algorithm ” A novel way of combining :

Researchers who have done related work also said , I never thought I could use a big model to diffs To learn about mutation :

In addition to the discussion of the research form and itself , There are also netizens with this picture :

Emmm…… It's a little like that .
team introduction
The team members of this study are all from OpenAI.
The first paper is Joel Lehman, Is a machine learning scientist . Its focus areas include artificial intelligence security 、 Reinforcement learning and open search algorithms .

meanwhile ,Joel Lehman Previously, he wrote a scientific book based on his thinking on the development of artificial intelligence 《 Why can't greatness be planned : The secret of objectivity 》:

And the next step for this study ,Joel Lehman I said :
There is another important issue , It is how much the model can be applied to other environments .
GP The mutation effect in can now be achieved by ELM Greatly improve , This will stimulate a wide range of new applications and research directions .
So does this study have any new implications for you ?
Welcome to comment and discuss ~
Reference link :
[1]https://arxiv.org/abs/2206.08896
[2]https://twitter.com/joelbot3000/status/1538770905119150080?s=21&t=l8AASYjgC6RAEEimcQaFog
— End —
「 qubits · viewpoint 」 Live registration
With the development of biometric technology , How to build enhanced identity authentication in the digital age ? How to realize the electronic license with both security and privacy ? How will the essence and real value of artificial intelligence be reflected in industrial digitalization ?
6 month 23 On Thursday , Participate in the live broadcast , Answer for you ~

Focus on me here , Remember to mark the star ~
边栏推荐
- Is it safe to open an account with flush securities?
- 程序员真人秀又来了!呼兰当主持挑灯狂补知识,SSS大佬本科竟是药学,清华朱军张敏等加入导师团...
- Cloud native database vs traditional database
- Program. Launch (xxx) open file
- CVPR大会现场纪念孙剑博士,最佳学生论文授予同济阿里,李飞飞获黄煦涛纪念奖...
- What if Alipay is controlled by risk for 7 days? Payment solution
- Void* pointer
- 孙武玩《魔兽》?有图有真相
- 浏览器下载的文件属性里都有保护,如何去掉
- 在Microsoft Exchange Server 2007中安装SSL证书的教程
猜你喜欢

Before the age of 36, Amazon transgender hackers were sentenced to 20 years' imprisonment for stealing data from more than 100million people!

AI越进化越跟人类大脑像!Meta找到了机器的“前额叶皮层”,AI学者和神经科学家都惊了...

腾讯开源项目「应龙」成Apache顶级项目:前身长期服务微信支付,能hold住百万亿级数据流处理...
![[FPGA] serial port controls temperature acquisition by command](/img/63/19b909437b59867c577b39b1138f8f.png)
[FPGA] serial port controls temperature acquisition by command

可能是拿反了的原因

马斯克被诉传销索赔2580亿美元,台积电公布2nm制程,中科院发现月壤中含有羟基形式的水,今日更多大新闻在此...

完美洗牌问题

How to play well in the PMP Exam?

大咖说*计算讲谈社|如何提出关键问题?

AI自己写代码让智能体进化!OpenAI的大模型有“人类思想”那味了
随机推荐
Performance rendering of dSPACE
发布功能完成02《ivx低代码签到系统制作》
Insurance app aging service evaluation analysis 2022 issue 06
威马招股书拆解:电动竞争已结束,智能排位赛刚开始
DateTimeFormat放到@RequestBody下是无效的
Is it safe for tonghuashun securities to open an account
后台页制作01《ivx低代码签到系统制作》
Is it safe for Guoxin golden sun to open an account in the steps of opening new bonds
什么是SSL证书,拥有一个SSL证书有什么好处?
支付宝被风控7天怎么办?付解决方案
Is it safe to open an account on your mobile phone?
XML modeling
Is it safe to open an account in the way of winning 100% of the new bonds
大咖说*计算讲谈社|如何提出关键问题?
Is it safe to open an account on the compass? Is it reliable?
TensorFlow,危!抛弃者正是谷歌自己
The era of copilot free is over! Student party and defenders of popular open source projects can prostitute for nothing
How to play well in the PMP Exam?
Skywalking implements cross thread trace delivery
[proteus simulation] Arduino uno+ nixie tube display 4X4 keyboard matrix keys