当前位置:网站首页>Train-clean-100 dataset
Train-clean-100 dataset
2022-07-24 07:40:00 【yn20000227】
LibriSpeech :
It is a reading phonetic corpus , be based on LibriVox Public domain audiobooks . Its purpose is to realize automatic speech recognition (ASR) System training and testing .
The corpus is divided into several parts , So that users can selectively download its subset according to their own needs . The name carries “clean” A subset of is considered more than other audio and American English accents “ clean ”( At least on average ). This classification is obtained using very rough automated means , It should not be considered completely reliable . Subsets are disjoint , That is, each speaker's audio is assigned to a subset .
The structure of the corpus is as follows :
* dev-clean, test-clean - contain “ clean ” Voice development and test set .
* train-clean-100 - Training set , about 100 Hours of “ clean ” voice
* train-clean-360 - Training set , about 360 Hours of “ clean ” voice* dev-other, test-other - Development and test sets , Voice is automatically selected to be more “ challenging ” The identification of
* train-other-500 - about 500 Hour training set , Contains are not classified as “ clean ” The voice of
* intro - subset Only some readers LibriVox Introduce a subset of disclaimers
* mp3 - The original on which the corpus is based MP3 Encode audio
* texts - The text corresponding to the audio in the corpus
1. train-clean-100 The architecture of the dataset :

among :
【19】: Speaker's ID
【198/227】: The chapter of the article that the speaker reads
【*.trans.txt】: Contains text from the corresponding chapter , and FLAC The document is the voice of each chapter .
边栏推荐
- Give a string ① please count the number of times each letter appears ② please print the pair with the most letters
- 给一个字符串 ① 请统计出其中每一个字母出现的次数② 请打印出字母次数最多的那一对
- Selenium basic knowledge automatic search
- Appium doctor command error pit - resolved
- Installation and use of Zen path & defect report & defect operation
- 2022-07-23:给定N件物品,每个物品有重量(w[i])、有价值(v[i]), 只能最多选两件商品,重量不超过bag,返回价值最大能是多少? N <= 10^5, w[i] <= 10^5, v
- 【云原生】MySql索引分析及查询优化
- Laplace distribution
- 【Pytorch】Dataset_ DataLoader
- 【Pytorch】conv2d torchvision.transforms
猜你喜欢

Simple Gateway - intranet server safely obtains external network data

C language file operation

【HiFlow】腾讯云HiFlow场景连接器实现校园信息管理智能化

Using depth and normal textures in unity

About using the alignment function of VMD

Selenium basic knowledge multi window processing
![[steering wheel] code review ability of idea to ensure code quality](/img/70/dec438ba57f9cbd5020bba5da652ba.png)
[steering wheel] code review ability of idea to ensure code quality
![2022-07-23: given n items, each item has weight (w[i]) and value (v[i]), only two items can be selected at most, and the weight does not exceed bag. What is the maximum return value? N <= 10^5, w[i] <](/img/f4/ba2706e93f042dd8b110fac0d873c8.png)
2022-07-23: given n items, each item has weight (w[i]) and value (v[i]), only two items can be selected at most, and the weight does not exceed bag. What is the maximum return value? N <= 10^5, w[i] <

Who can stand it when the project goes online

Influxdb未授权访问&CouchDB权限绕过
随机推荐
numpy.arange
给一个字符串 ① 请统计出其中每一个字母出现的次数② 请打印出字母次数最多的那一对
23. Component customization events
Mitre att & CK ultra detailed learning notes-02 (a large number of cases)
游戏三子棋
System integration project management engineer (soft test intermediate) key knowledge, recitation version
Mitre att & CK ultra detailed learning notes-01 (background, terms, cases)
C language to achieve mine sweeping game
【sklearn】PCA
Jackson 解析 JSON 详细教程
Deep learning two or three things - review those classical convolutional neural networks
2021-06-03pip error valueerror: unable to find resource t64.exe in package pip_ vendor.distlib
numpy.inf
Cloud version upgrade
Using bidirectional linked list to realize stack (c)
Deep analysis of data storage in memory
CentOS 7 install mysql5.6.37
服务漏洞&FTP&RDP&SSH&rsync
[leetcode simple] 20. Valid brackets stack
C language advanced part VII. Program compilation and preprocessing