当前位置:网站首页>Speech breakpoint detection (short time improved subband spectral entropy)
Speech breakpoint detection (short time improved subband spectral entropy)
2022-06-21 22:42:00 【qq-120】
1. Audio analysis
1. Output speech segmentation time point information , The time point is expressed in milliseconds ;
2. Split the speech into multiple wav file ;
Endpoint detection : Determine the time starting point and ending point of the sentence , Ignore a small number of non voice frames in the middle ,
For speech recognition .(Speech Endpoint Detection)
Entropy is a quantity that reflects information measurement in information theory . The greater the randomness of a random event ,
That is, the higher the uncertainty , The greater the entropy , So the more information you carry .
This operation adopts Spectral entropy method End point detection for voice .
2. Spectral entropy method


3. Preprocessing

4. Double threshold method endpoint detection

5. experimental result





Handle PHONE_001.wav Information obtained
(1)time.csv: Segment information for voice ;
(2)PHONE_001_vad.wav: For voice VAD After processing , Speech segment synthetic wav;
(3)segmentation Folder : It is the speech of each segment after speech segmentation ;
(4)main_VAD.m: The main function ;
(5)vad.m: It is the endpoint detection function of double threshold method ;
(6)houzhichuli.m: Is the interval length decision function ;
(7)frame2time.m: As a function of time for a frame ;
边栏推荐
- Contact five heart matchmaker to take off the order
- 班主任让开股票账户,在启牛开户安全吗?
- The way of FPGA -- interface level standard between digital systems
- Sampler collection
- WPF startup with parameters
- GDB debugging skills (0) getting started with GDB
- WPF x:ArrayExtension
- 电脑屏幕分辨率怎么调?电脑屏幕修改分辨率SwitchResX
- UWP 手写板InkCanvas
- GDB debugging practice (7) signal processing
猜你喜欢

Implement a middleware from -1

可乐与凉茶加速互卷

About Eureka starting successfully but accessing 404

The way of FPGA -- project scheme and FPGA design scheme of FPGA development process
![[in depth understanding of tcapulusdb technology] how to realize single machine installation of tmonitor](/img/74/a645742a8e135b32154859be956760.png)
[in depth understanding of tcapulusdb technology] how to realize single machine installation of tmonitor

电脑屏幕分辨率怎么调?电脑屏幕修改分辨率SwitchResX

About LG (n!) Asymptotically compact supremum of

WPF 路由

Contact five heart matchmaker to take off the order

.bmp图片的文件头解析
随机推荐
Wonderful review Figure 1 learn about Huawei cloud special dry goods
Sampler collection
关于lg(n!)的渐进紧确界
About LG (n!) Asymptotically compact supremum of
GDB debugging practice (7) signal processing
WPF select Folder
WPF 路由
语音断点检测(短时改进子带谱熵)
Five minutes, Xie Yunyuan
Synplify Pro的常用选项及命令
Nacos installation guide
flutter系列之:flutter中的IndexedStack
UWP 手写板InkCanvas
The method of ram and ROM initialization in FPGA design
UWP Dispatcher用法
【电子方案设计】酒精测试仪PCBA解决方案
2022-06-21:golang选择题,以下golang代码输出什么?A:3;B:4;C:100;D:编译失败。 package main import (
Bitmap使用注意事项
WPF 启动带参数
[wustctf2020] plain and unpretentious -1