当前位置:网站首页>The five minute demonstration "teaches" actors to speak foreign languages and can seamlessly switch languages. This AI dubbing company has just received a round a financing of 20million US dollars
The five minute demonstration "teaches" actors to speak foreign languages and can seamlessly switch languages. This AI dubbing company has just received a round a financing of 20million US dollars
2022-06-25 06:26:00 【QbitAl】
Line early From the Aofei temple
qubits | official account QbitAI
It only takes five minutes of actors' voice material , You can make him speak another language in the movie ?
I don't believe it until I see this video , Listen to the effect of this passage :
This video is taken from 《 Son of Bodo 》( English name Every Time I Die), Is a english Thriller .
But we can see in the broadcast , Just one click , You can convert English into Spanish at any time , And it still sounds like the voice of the original actor .
Even talking in horror 、 The trembling details were faithfully inherited , Show us a AI The magical power of dubbing .
Of course , This wave of operation has not surprisingly moved many investors .
The company that made this paragraph Deepdub ( Deep dubbing ), Recently in A Got... In the round of financing 2000 Thousands of dollars . Among the investors are the former president of Fox TV studio 、Snyk Co-founder of 、Meta Vice president of engineering, etc .
AI Dubbing impacts the traditional mode
AI Why is dubbing so expected ? Because it contains huge business opportunities .
Need to know , English audiences in the United States and other places are not used to watching subtitles . therefore , Facing some excellent works in non English , They have a strong Localization needs , That is, the English dubbing version .
For example, some time ago, the fire broke out Korean dramas 《 Squid game 》, At the premiere 28 Days. , The total viewing time is 16.5 100 million hours , Add up to 18.2 In ten thousand, . Become at one stroke Netflix The number one program in history .
But such a big cake , From a traditional point of view , It's very hard to eat .
△ Figure note :《 Squid game 》 Play volume , The first row in the right column
for example , Local publishers have to spend money translating the script , We have to hire a voice actor to play the role 、 Rent space and equipment 、 Complete a lot of dubbing and recording , Finally, we need to splice the dubbing into the original video .
There are also many cultural differences .
This one comes down , According to the market, we should 15-20 Zhou .
and Deepdub Of AI The dubbing method only requires the original actor to record five minutes of random text , Let the neural network learn the actor's voice and express it in another language .
It sounds like the original actor learned another language , And the same workload can be completed in only four weeks , Including translation 、 Adaptation 、 Mixing, etc .
In terms of technical details ,Deepdub Not much public , Maybe it can be used in GitHub On fire Mocking Bird Make reference .
It only takes five seconds , You can clone any Chinese voice , Then use the same voice color to synthesize other voice content , Realize the process from voice to text and then to voice .
The model structure is mainly composed of the speaker encoder (Speaker encoder)、 Synthesizer (Synthesizer) Harmony coder (Vocoder) form .

The speaker encoder ( green ) Extract the feature vector of the speaker's voice , Learn timbre .
Then the traditional TTS(Text-to-Speech) link :
In the synthesizer ( Blue ) The speech features are integrated into the specified text , Take the Mel spectrum as the intermediate variable , Transmit the generated speech spectrum to the vocoder ( Red ).
Finally, the depth autoregressive model WaveNet As a vocoder , Use the spectrum to generate the final speech .
however ,Deepdub Although he didn't disclose his technical details , But they claim to have taken the lead in this field of academic research .
This is also a bit credible , From their products 、 The investment obtained and the background of brother founders can also be seen :
Younger brother Nir Krakowski Yes 25 Years of professional R & D experience , brother Ofir Krakowski He also worked in the machine learning Department of the Israeli air force ……
AI There are many racing cars in dubbing track
Of course , There are more than just people who like this market Deepdub a , It's just a little different in strategy .
Deepdub It is the way to modify the audio , The video content remains intact . They plan to use this round of financing to expand the team's marketing 、 Research and engineering department , And is talking about cooperation with Hollywood .
British companies Papercup Methods adopted and Deepdub similar , Also focus on audio , Redeploy the original actor's voice through the flip , Use synthetic sound , Keep the video the same .
And the other one Flawless In audio, we also rely on dubbing actors , But I can edit the face and mouth shape in the video , It looks more like speaking the target language .
Like the others , Amazon and other technology giants are also doing relevant research , But there is no product yet .
So it seems , Maybe we can really create the video industry in the future “ Babel Tower ”, Make barrier free communication in online drama .
Or, , Some individual actors really don't have to memorize their lines ?
Reference link :
[1]https://techcrunch.com/2022/02/10/deepdub-raises-20m-for-a-i-powered-dubbing-that-uses-actors-original-voices/
[2]https://venturebeat.com/2022/02/10/deepdub-closes-fresh-financing-round-for-ai-that-dubs-movies-shows-and-games/
边栏推荐
- JS dynamic table creation
- Guess the size of the number
- Metauniverse in 2022: robbing people, burning money and breaking through the experience boundary
- Detailed explanation of @jsoninclude annotation in Jackson
- An interview question record about where in MySQL
- Gb28181 protocol -- timing
- C switch nested syntax
- The sum problem
- @The difference between notempty, @notnull and @notblank
- Hands on deep learning (III)
猜你喜欢

Day21 JMeter usage basis

Es11 new methods: dynamic import(), bigint, globalthis, optional chain, and null value merging operator

Getting started with Silverlight development 1

JS to determine whether an element exists in the array (four methods)

Methods for obtaining some information of equipment
![[kicad image] download and installation](/img/88/cebf8cc55cb8904c91f9096312859a.jpg)
[kicad image] download and installation

Exercise: completion
![[Suanli network] technological innovation of Suanli Network -- Key Technologies of green and security](/img/52/7dedc5b6e213839fbf5cee3963ac99.jpg)
[Suanli network] technological innovation of Suanli Network -- Key Technologies of green and security

Three tier architecture experiment

ctfshow-misc
随机推荐
Why can't GC () free memory- Why does gc() not free memory?
Arm register (cortex-a), coprocessor and pipeline
Analysis report on global and Chinese pharmaceutical excipients industry competition and marketing model 2022-2028
Mongodb delete data
证券如何在线开户?在线开户是安全么?
Rational investment and internationalism
The elephant turns around and starts the whole body. Ali pushes Maoxiang not only to Jingdong
PHP output (print) log to TXT text
@Detailed explanation of valid annotation usage
CTFSHOW
How to chain multiple different InputStreams into one InputStream
Hands on deep learning (III)
John
Analysis report on investment and financing status and operation benefits of global and Chinese dental industry (2022 Edition)
Understand what MSS is
Ping command – test network connectivity between hosts
Gb28181 protocol -- timing
Understand what MTU is
Forecast report on output demand and supply scale of global and Chinese structural ceramics market for semiconductor equipment (2022 Edition)
Global and China financial guarantee marketing strategy and channel dynamic construction report 2022
