Language-based audio retrieval DCASE 2022 evaluation dataset

Description

This is the evaluation dataset for Task 6 (Subtask B), Language-based Audio Retrieval, in DCASE 2022 Challenge. This evaluation dataset is meant to be used for the purposes of the Subtask B in the Task 6 at the scientific challenge 2022. This dataset is not meant to be used for developing language-based audio retrieval methods. For developing language-based audio retrieval methods, you should use the development dataset, i.e., the Clotho v2.1 dataset, which can be found also in Zenodo, at: https://zenodo.org/record/4783391. == License == The audio files in the archives: retrieval_audio.7z and the associated meta-data in the CSV file: retrieval_audio_metadata.csv are under the corresponding licenses of Freesound [1] platform, mentioned explicitly in the CSV file for each of the audio files. That is, each audio file in the 7z archives is listed in the CSV file with the meta-data. The meta-data for each file are: File name Keywords URL for the orignal audio file Start and end samples for the excerpt that is used in the dataset Uploader/user in the Freesound platform (manufacturer) Link to the license of the file The caption queries in the file: retrieval_captions.csv are under the Tampere University license, described in the LICENSE file. ==References== [1] Frederic Font, Gerard Roma, and Xavier Serra. 2013. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 411-412. DOI: https://doi.org/10.1145/2502081.2502245
Show more

Year of publication

2022

Type of data

Authors

Samuel Lipping - Creator

Zenodo - Publisher

Project

Other information

Fields of science

Computer and information sciences

Language

English

Open access

Open

License

License Not Specified

Keywords

Computer and information sciences

Subject headings

Temporal coverage

undefined

Related to this research data