当前位置:网站首页>Speech signal processing - concept (II): amplitude spectrum (STFT spectrum), Mel spectrum [the deep learning of speech mainly uses amplitude spectrum and Mel spectrum] [extracted with librosa or torch
Speech signal processing - concept (II): amplitude spectrum (STFT spectrum), Mel spectrum [the deep learning of speech mainly uses amplitude spectrum and Mel spectrum] [extracted with librosa or torch
2022-06-27 07:28:00 【u013250861】
One 、 What kind of spectrum is used in the deep learning of speech ?
answer : With amp spec and melspec Mainly , Usually it can be used librosa Library or torchaudio Library for extraction .
Two 、 that Fbank、MFCC Well ?
answer : Because it contains less information , It is no longer suitable for this big data era . But some tasks, because of their special nature , Still use MFCC Spectrum . Such as emotional voice conversion task .
边栏推荐
- 【软件工程】山东大学软件工程复习提纲
- The interviewer of a large front-line factory asked: do you really understand e-commerce order development?
- 语音信号处理-概念(二):幅度谱(短时傅里叶变换谱/STFT spectrum)、梅尔谱(Mel spectrum)【语音的深度学习主要用幅度谱、梅尔谱】【用librosa或torchaudio提取】
- DMU software syntax highlighting VIM setting -- Learning Notes 6
- Winow10 installation nexus nexus-3.20.1-01
- Window right click management
- 从5秒优化到1秒,系统飞起来了...
- VNC Viewer方式的远程连接树莓派
- Delay queue `delayqueue`
- 专业四第二周自测
猜你喜欢
How to download opencv? How to configure opencv after downloading?
一线大厂面试官问:你真的懂电商订单开发吗?
Oppo interview sorting, real eight part essay, abusing the interviewer
[openairinterface5g] rrcsetupcomplete for RRC NR resolution
一个人管理1000台服务器?这款自动化运维工具一定要掌握
POI replacing text and pictures in docx
再见了,敏捷Scrum
IDEA连接数据库报错
云服务器配置ftp、企业官网、数据库等方法
MySQL
随机推荐
How torch. gather works
How to download opencv? How to configure opencv after downloading?
Idea method template
Window right click management
RNA SEQ data analysis in R - investigate differentially expressed genes in the data!
Goodbye, agile Scrum
请问如何将数据从oracle导入fastDFS?
POI replacing text and pictures in docx
Gérer 1000 serveurs par personne? Cet outil d'automatisation o & M doit être maîtrisé
Winow10 installation nexus nexus-3.20.1-01
Websocket database listening
SQL injection bypass (I)
R language calculates Spearman correlation coefficient in parallel to speed up the construction of co occurrence network
云服务器配置ftp、企业官网、数据库等方法
面试官:用分库分表如何做到永不迁移数据和避免热点问题?
Unrecognized VM option ‘‘
Yolov6's fast and accurate target detection framework is open source
Date database date strings are converted to and from each other
NoViableAltException([email protected][2389:1: columnNameTypeOrConstraint : ( ( tableConstraint ) | ( columnNameT
Tar: /usr/local: cannot find tar in the Archive: due to the previous error, it will exit in the last error state