The DIRHA-ENGLISH corpus and related tasks for remote speech recognition in domestic environments
DOI:
https://doi.org/10.17469/O2102AISV000017Keywords:
distant speech recognition, microphone arrays, corpora, Kaldi, DNNAbstract
This paper addresses the contents and the possible usage of the DIRHA-ENGLISH multi-microphone corpus, realized under the EC DIRHA project. The reference scenario is a domestic environment equipped with a large number of microphones distributed in space. The corpus is composed of both real and simulated material, and it includes 12 US and 12 UK English native speakers’ utterances. Each speaker uttered different sets of phonetically-rich sentences, newspaper articles, conversational speech, keywords, and commands. From this material, a large set of 1-minute sequences was generated, which also includes typical domestic background noise and inter/intra-room reverberation effects. Development and test sets were derived. The paper reports a first set of baseline results obtained using different techniques, including Deep Neural Networks (DNN), aligned with the state-of-the-art at international level. Various tasks and Kaldi recipes have already been developed.
Downloads
Published
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.