当前位置:网站首页>Voiceprint Technology (II): Fundamentals of audio signal processing
Voiceprint Technology (II): Fundamentals of audio signal processing
2022-06-25 09:05:00 【u013250861】
2.1 To understand voiceprint , Learn audio first
In terms of discipline classification , Voiceprint technology is a branch of speech signal processing , and Speech signal processing belongs to the category of audio signal processing .
Voice signals and sound signal , The difference between the two is :
- Voice signals It refers to the voice with social significance when human beings speak ,
- sound signal It generally refers to all the sounds that human beings can hear . For example, the sound of an instrument , Sounds made by animals , The sound of a car engine , And people snore 、 Sneeze 、 The sound of coughing , These are audio signals in a broad sense , But they are not voice signals , Therefore, it is usually not within the scope of voiceprint technology research .
Many basic concepts and knowledge in audio signal processing , It is very important for learning voiceprint technology .
Any voiceprint system , No matter how advanced the model is , How sophisticated the algorithm is , Can not do without dealing with sound . Only when the correct audio signal is connected , The meaningful feature representation is extracted from it , The latter model can play its role to the greatest extent .
So this chapter , We will specifically and systematically learn these concepts and knowledge related to sound . This chapter covers a wide range , Involving human auditory perception 、 Audio interface 、 Coding technology 、 Discrete signal processing and many other sub fields . At first glance, these sub areas , It doesn't seem to have much to do with each other . However , When we really embark on research or engineering projects in the field of voiceprint , You will find that all the knowledge in these sub domains will inevitably be used . In an enterprise or research institution , Yes
边栏推荐
- 【无标题】**数据库课设:三天完成学生信息管理系统**
- Matplotlib decision boundary drawing function plot in Matplotlib_ decision_ Boundary and plt Detailed explanation of contour function
- C language: find all integers that can divide y and are odd numbers, and put them in the array indicated by B in the order from small to large
- Is it safe to open a stock account through the account opening QR code of the account manager? Or is it safe to open an account in a securities company?
- How can games copied from other people's libraries be displayed in their own libraries
- 五、项目实战---识别人和马
- Prepare for the 1000 Android interview questions and answers that golden nine silver ten must ask in 2022, and completely solve the interview problems
- mysql之Unknown table ‘COLUMN_STATISTICS‘ in information_schema (1109)
- Is it safe for Huatai Securities to open a stock account on it?
- A 35 year old Tencent employee was laid off and sighed: a suite in Beijing, with a deposit of more than 7 million, was anxious about unemployment
猜你喜欢
5、 Project practice --- identifying man and horse
matplotlib matplotlib中plt.grid()
Make a skylearn high-dimensional dataset_ Circles and make_ moons
Emergency administrative suspension order issued Juul can continue to sell electronic cigarette products in the United States for the time being
Explanation of assertions in JMeter
Cazy eight trigrams maze of Chang'an campaign
Where are the hotel enterprises that have been under pressure since the industry has warmed up in spring?
2、 Training fashion_ MNIST dataset
微服务调用组件Ribbon底层调用流程分析
(translation) the use of letter spacing to improve the readability of all capital text
随机推荐
【OpenCV】—输入输出XML和YAML文件
C language: bubble sort
C language: count the number of words in a paragraph
(translation) the use of letter spacing to improve the readability of all capital text
How can games copied from other people's libraries be displayed in their own libraries
Benefits and types of cloud network technology
OpenFOAM:底层
二、训练fashion_mnist数据集
A 35 year old Tencent employee was laid off and sighed: a suite in Beijing, with a deposit of more than 7 million, was anxious about unemployment
How to become a software testing expert? From 3K to 17k a month, what have I done?
Oracle one line function Encyclopedia
RTOS 多线程下hardfault问题总结
sklearn 高维数据集制作make_circles 和 make_moons
¥3000 | 录「TBtools」视频,交个朋友&拿现金奖!
Mapping mode of cache
【OpenCV】—离散傅里叶变换
(翻译)采用字母间距提高全大写文本可读性的方式
紧急行政中止令下达 Juul暂时可以继续在美国销售电子烟产品
Matplotlib plt grid()
自定义注解之编译时注解(RetentionPolicy.CLASS)