ASVP audio datasets

发布人：李艳雄发布时间：2024-07-17 动态浏览次数:10

Contributors：Wenchang Cao, Wei Xie, Yanxiong Li

Three audio datasets, including FSC-89, NSynth-100 and LS-100, are constructed as experimental data to evaluate the performance of audio classification methods. The details of these three audio datasets are described at three websites [1]-[3]. What’s more, they can be downloaded from the above three websites and be freely used for research purpose.

The dataset of FSC-89 has 89 sound events (audio classes) which cover a diverse range of real-world sounds, from human andanimal sounds to natural, musical or miscellaneous sounds. The dataset of NSynth-100has 100 sounds (musical notes) of different musical instruments. The dataset of LS-100 consists of speech samples uttered by 100 different speakers (classes).

[1] https://www.modelscope.cn/datasets/pp199124903/LS-100/summary

[2] https://www.modelscope.cn/datasets/pp199124903/FSC-89/summary

[3] https://www.modelscope.cn/datasets/pp199124903/NSynth-100/summary