資源簡介
A sound vocabulary and dataset. AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos.
代碼片段和文件信息
評論
共有 條評論