当前位置:网站首页>Speech signal processing - concept (II): amplitude spectrum (STFT spectrum), Mel spectrum [the deep learning of speech mainly uses amplitude spectrum and Mel spectrum] [extracted with librosa or torch

Speech signal processing - concept (II): amplitude spectrum (STFT spectrum), Mel spectrum [the deep learning of speech mainly uses amplitude spectrum and Mel spectrum] [extracted with librosa or torch

2022-06-27 07:28:00 u013250861

One 、 What kind of spectrum is used in the deep learning of speech ?
answer : With amp spec and melspec Mainly , Usually it can be used librosa Library or torchaudio Library for extraction .

Two 、 that Fbank、MFCC Well ?
answer : Because it contains less information , It is no longer suitable for this big data era . But some tasks, because of their special nature , Still use MFCC Spectrum . Such as emotional voice conversion task .
 Insert picture description here

 Insert picture description here

原网站

版权声明
本文为[u013250861]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/178/202206270658217841.html