The CHiME-6 dataset has been used in CHiME challenges from CHiME-6 onwards. It is a version of the original CHiME-5 dataset in which the alignment between the audio channels has been corrected. We recommend that you use CHiME-6 instead of CHiME-5 for any new work.

CHiME-6 can be downloaded from the Open Speech and Language Resource site.

Note, for publications that use the data please cite the following paper:

  • Barker, J., Watanabe, S., Vincent, E., Trmal, J. (2018) The Fifth ‘CHiME’ Speech Separation and Recognition Challenge: Dataset, Task and Baselines. Proc. Interspeech 2018, 1561-1565, doi: 10.21437/Interspeech.2018-1768

You can use the follow bib entry:

  author={Jon Barker and Shinji Watanabe and Emmanuel Vincent and Jan Trmal},
  booktitle={Proc. Interspeech 2018},