The audio data, the annotations for the 6ch track, and some baseline software (acoustic simulation, additional speech enhancement tool) are only distributed via the LDC as part of the CHiME-3 package LDC2017S24.
In addition, the annotations for the 1ch and 2ch tracks, the list of the cross correlation coefficients between channels, and the original enhancement and ASR baseline can be downloaded here. Note that the enhancement and ASR baseline differs both from the CHiME-3 baseline and from the CHiME-4 recipe in the latest version of Kaldi and it is available under the Apache license, version 2.0.
- CHiME4_diff (v1.0) - download
To refer to these data and these baselines in a publication, please cite:
- Emmanuel Vincent, Shinji Watanabe, Aditya Arie Nugraha, Jon Barker, and Ricard Marxer
An analysis of environment, microphone and data simulation mismatches in robust speech recognition
Computer Speech and Language, 2017.