当前位置:网站首页>Voiceprint Technology (V): voiceprint segmentation and clustering technology
Voiceprint Technology (V): voiceprint segmentation and clustering technology
2022-06-25 09:05:00 【u013250861】
5.1 Segmentation clustering : Better understand the voice of dialogue
5.1.1 About name and history
Voiceprint segmentation clustering (speaker diarization) It is second only to voiceprint recognition in the field of voiceprint , It is much more difficult than voiceprint recognition . The problems solved by voiceprint recognition can be summarized as ——“ Who said that ”, This includes a hypothesis , That is, the known speech to be recognized , There is only one speaker's voice . In the voiceprint segmentation and clustering problem , We have overturned this assumption , in other words , A speech can contain the voice of multiple speakers speaking alternately . therefore , The problems solved by voiceprint segmentation and clustering can be summarized as ——“ Who said it at what time ”(who spoke when).
In English diarization The word" , From words diary, That is, diary or diary . from diary To verb diarize, And then to nouns diarization, Literally , It can be understood as “ send …… Become a log ”, Or say “ Log ”. generally speaking , A journal is usually recorded in the time of the day , Who did what at what time . Then it is extended to speaker diarization, Naturally, it can be understood as “ Who said what at what time ”.
About speaker diarization The earliest origin of this name , It is difficult to study . Some early literature directly referred to this problem as speaker segmentation and clustering [114,115], This is why many Chinese documents translate it into “ Voiceprint segmentation clustering ”[116]. But with the development of this field , Especially in recent years, the supervised method ( see 5.5 section ) Even end-to-end models ( see 5.5.6 section ) Appearance ,“ Segmentation clustering ” The name is no longer appropriate . Whether it is segmentation or clustering , Can be replaced by other methods . Another Chinese translation that I prefer is “ Voiceprint time sharing archive ”
边栏推荐
- Jmeter中的断言使用讲解
- Object.defineProperty也能监听数组变化?
- 1、 Construction of single neural network
- How can games copied from other people's libraries be displayed in their own libraries
- Unity--configurable joint -- a simple tutorial to get you started with configurable joints
- 四、卷积神经网络(Convolution Neural Networks)
- 对常用I/O模型进行比较说明
- Unity发布webGL的时候JsonConvert.SerializeObject()转换失败
- WebGL谷歌提示内存不够(RuntimeError:memory access out of bounds,火狐提示索引超出界限(RuntimeError:index out of bounds)
- (翻译)采用字母间距提高全大写文本可读性的方式
猜你喜欢
Unity--configurable joint -- a simple tutorial to get you started with configurable joints
How to become a software testing expert? From 3K to 17k a month, what have I done?
compiling stm32f4xx_it.c... “.\Objects\BH-F407.axf“ - 42 Error(s), 1 Warning(s).
浅谈Mysql底层索引原理
Easyplayer streaming media player plays HLS video. Technical optimization of slow starting speed
C # startup program loses double quotation marks for parameters passed. How to solve it?
wav文件(波形文件)格式分析与详解
关掉一个线程
cazy长安战役八卦迷宫
Compile time annotations for custom annotations (retentionpolicy.class)
随机推荐
Notes on key words in the original English work biography of jobs (VI) [chapter three]
Jmeter接口测试,关联接口实现步骤(token)
JMeter interface test, associated interface implementation steps (token)
C#程序终止问题CLR20R3解决方法
一、单个神经元网络构建
《乔布斯传》英文原著重点词汇笔记(五)【 chapter three 】
106. 简易聊天室9:使用 Socket 传递音频
Notes on key words in the original English work biography of jobs (II) [chapter one]
[untitled] * * database course design: complete the student information management system in three days**
Socket programming -- epoll model
六级易混词整理
How to solve the 10061 error of MySQL in Linux
Swiperefreshlayout+recyclerview failed to pull down troubleshooting
《乔布斯传》英文原著重点词汇笔记(四)【 chapter two 】
Stimulsoft ultimate presents reports and dashboards
jmeter中csv参数化
matplotlib matplotlib中axvline()和axhline()函数
Abbreviations of common English terms for IC R & D
获取扫码的客户端是微信还是支付宝
Voiceprint Technology (VII): the future of voiceprint Technology