Programme
The 2024 CHiME Workshop will take place on September 6, 2024 at the Kos International Convention Centre (KICC) on Kos Island, Greece in the AKESO (ΑΚΕΣΩ) room on the ground floor.
The Programme will proceed as follows:
8:30-9:00 | Registration, with coffee and pastries |
9:00-9:10 | Welcome |
9:10-9:45 | Task 1 overview and spotlights |
9:45-10:20 | Task 2 overview and spotlights |
10:20-10:55 | Task 3 overview and spotlights |
10:55-11:30 | Coffee break |
11:30-12:30 | Poster session 1 |
12:30-14:00 | Lunch |
14:00-15:00 | Invited speaker: Prof. Hung-yi Lee |
15:00-15:30 | Coffee break |
15:30-16:30 | Poster session 2 |
16:30-17:30 | Pitching/feedback session + Closing session |
All oral sessions will take place in the AKESO (ΑΚΕΣΩ) room on the ground floor, with posters in the atrium outside of the room.
9:10-9:45 Task 1 overview and spotlights
Time | Title | Authors |
9:10-9:35 | The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization [paper] [slides] [poster] | Samuele Cornell (Carnegie Mellon University); Tae Jin Park (NVIDIA); He Huang (Nvidia); Christoph Boeddeker (Paderborn University); Xuankai Chang (Carnegie Mellon University); Matthew Maciejewski (Johns Hopkins University); Matthew S Wiesner (Johns Hopkins University); Paola Garcia (Johns Hopkins University); Shinji Watanabe (Carnegie Mellon University) |
9:35-9:40 | STCON System for the CHiME-8 Challenge [paper] [slides] [poster] | Anton Mitrofanov (STC-innovations Ltd); Tatiana Prisyach (STCON LLC); Tatiana Timofeeva (STC); Sergey Novoselov (ITMO University); Maxim Korenevsky (Speech Technology Center); Yuri Khokhlov (Speech Technology Center); Artem Akulov (STC Innovations); Aleksandr Anikin (Speech Technology Center); Roman Khalili (STC); Iurii Lezhenin (Speech Technology Center); Aleksandr Melnikov (STC); Dmitriy Miroshnichenko (STC); Nikita Mamaev (ITMO University); Ilya Odegov (STC); Olga Rudnitskaya (STC); Aleksei Romanenko (Speech Technology Center) |
9:40-9:45 | NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge [paper] [slides] [poster] | Naoyuki Kamo (NTT); Naohiro Tawara (NTT); Atsushi Ando (NTT Corporation); Takatomo Kano (NTT Corporation); Hiroshi Sato (NTT); Rintaro Ikeshita (NTT); Takafumi Moriya (NTT); Shota Horiguchi (NTT Corporation); Kohei Matsuura (NTT); Atsunori Ogawa (NTT Corporation); Alexis Plaquet (IRIT); Takanori Ashihara (NTT Corp.); Tsubasa Ochiai (NTT); Masato Mimura (NTT corporation); Marc Delcroix (NTT); Tomohiro Nakatani (NTT Communication Science Laboratories); Taichi Asami (NTT); Shoko Araki (NTT Corporation) |
9:45-10:20 Task 2 oral presentations
Time | Title | Authors |
9:45-10:10 | NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription [paper] [slides] [poster] | Alon Vinnikov (Microsoft), Amir Ivry (Microsoft), Aviv Hurvitz (Microsoft), Igor Abramovski (Microsoft), Sharon Koubi (Microsoft), Ilya Gurvich (Microsoft), Shai Pe‘er (Microsoft), Xiong Xiao (Microsoft), Benjamin Martinez Elizalde (Microsoft), Naoyuki Kanda (Microsoft), Xiaofei Wang (Microsoft), Shalev Shaer (Microsoft), Stav Yagev (Microsoft), Yossi Asher (Microsoft), Sunit Sivasankaran (Microsoft), Yifan Gong (Microsoft), Min Tang (Microsoft), Huaming Wang (Microsoft), Eyal Krupka (Microsoft) |
10:10-10:15 | The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge [paper] | Shutong Niu (University of Science and Technology of China ); Ruoyu Wang (University of Science and Technology of China); Jun Du (University of Science and Technology of China); Gaobin Yang (University of Science and Technology of China); Yanhui Tu (iFlytek); Siyuan Wu (iFlytek Research); Shuangqing Qian (iFlytek); Huaxin Wu (iFlytek Research); Haitao Xu (iFlytek Research); Xueyang Zhang (iFlytek Research); Guolong Zhong (iFlytek Research); Xindi Yu (iFlytek Research); Jieru Chen (iFlytek Research); Mengzhi Wang (iFlytek Research); Di Cai (iFlytek Research); Tian Gao (iFlytek Research); Genshun Wan (iFlytek Research); Feng Ma (iFlytek Research); Jia Pan (iFlytek Research); Jianqing Gao (iFLYTEK) |
10:15-10:20 | BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge [paper] [slides] [poster] | Alexander Polok (Brno University of Technology); Dominik Klement (Brno University of Technology); Jiangyu Han (Brno University of Technology); Šimon Sedláček (Brno University of Technology); Bolaji Yusuf (Bogazici University); Matthew Maciejewski (Johns Hopkins University); Matthew S Wiesner (Johns Hopkins University); Lukáš Burget (Brno University of Technology) |
10:20-10:55 Task 3 oral presentations
Time | Title | Authors |
10:20-10:45 | The CHiME-8 MMCSG Challenge: Multi-modal conversations in smart glasses [paper] [slides] [poster] | Kateřina Žmolíková (Meta AI); Simone Merello (Meta AI); Kaustubh Kalgaonkar (Meta AI); Ju Lin (Meta); Niko Moritz (Meta); Pingchuan Ma (Meta AI); Ming Sun (Meta); Honglie Chen (Meta); Antoine Saliou (Meta AI); Stavros Petridis (Meta AI); Christian Fuegen (Meta); Michael Mandel (Meta) |
10:45-10:50 | The NPU-TEA System Report for the CHiME-8 MMCSG Challenge [paper] | Kaixun Huang (NWPU); Wei Rao (Tencent); Yue Li (Northwestern Polytechnical University, China); Hongji Wang (Tencent); Yannan Wang (Tencent); Shen Huang (Tencent Research); Lei Xie (NWPU) |
10:50-10:55 | The SEUEE System for the CHiME-8 MMCSG Challenge [paper] [slides] [poster] | Cong Pang (Southeast University); Feifei Xiong (Alibaba Group); Ye Ni (southeast university); Lin Zhou (Southeast University); Jinwei Feng ( Alibaba Group) |
11:30-12:30 Poster Session
Poster ID | Title | Authors | Track |
1 | The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization [paper] [slides] [poster] | Samuele Cornell (Carnegie Mellon University); Tae Jin Park (NVIDIA); He Huang (Nvidia); Christoph Boeddeker (Paderborn University); Xuankai Chang (Carnegie Mellon University); Matthew Maciejewski (Johns Hopkins University); Matthew S Wiesner (Johns Hopkins University); Paola Garcia (Johns Hopkins University); Shinji Watanabe (Carnegie Mellon University) | Task 1 DASR (Overview) |
2 | NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription [paper] [slides] [poster] | Alon Vinnikov (Microsoft), Amir Ivry (Microsoft), Aviv Hurvitz (Microsoft), Igor Abramovski (Microsoft), Sharon Koubi (Microsoft), Ilya Gurvich (Microsoft), Shai Pe‘er (Microsoft), Xiong Xiao (Microsoft), Benjamin Martinez Elizalde (Microsoft), Naoyuki Kanda (Microsoft), Xiaofei Wang (Microsoft), Shalev Shaer (Microsoft), Stav Yagev (Microsoft), Yossi Asher (Microsoft), Sunit Sivasankaran (Microsoft), Yifan Gong (Microsoft), Min Tang (Microsoft), Huaming Wang (Microsoft), Eyal Krupka (Microsoft) | Task 2 NOTSOFAR (Overview) |
3 | The CHiME-8 MMCSG Challenge: Multi-modal conversations in smart glasses [paper] [slides] [poster] | Kateřina Žmolíková (Meta AI); Simone Merello (Meta AI); Kaustubh Kalgaonkar (Meta AI); Ju Lin (Meta); Niko Moritz (Meta); Pingchuan Ma (Meta AI); Ming Sun (Meta); Honglie Chen (Meta); Antoine Saliou (Meta AI); Stavros Petridis (Meta AI); Christian Fuegen (Meta); Michael Mandel (Meta) | Task 3 MMCSG (Overview) |
4 | STCON System for the CHiME-8 Challenge [paper] [slides] [poster] | Anton Mitrofanov (STC-innovations Ltd); Tatiana Prisyach (STCON LLC); Tatiana Timofeeva (STC); Sergey Novoselov (ITMO University); Maxim Korenevsky (Speech Technology Center); Yuri Khokhlov (Speech Technology Center); Artem Akulov (STC Innovations); Aleksandr Anikin (Speech Technology Center); Roman Khalili (STC); Iurii Lezhenin (Speech Technology Center); Aleksandr Melnikov (STC); Dmitriy Miroshnichenko (STC); Nikita Mamaev (ITMO University); Ilya Odegov (STC); Olga Rudnitskaya (STC); Aleksei Romanenko (Speech Technology Center) | Task 1 DASR |
5 | BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge [paper] [slides] [poster] | Alexander Polok (Brno University of Technology); Dominik Klement (Brno University of Technology); Jiangyu Han (Brno University of Technology); Šimon Sedláček (Brno University of Technology); Bolaji Yusuf (Bogazici University); Matthew Maciejewski (Johns Hopkins University); Matthew S Wiesner (Johns Hopkins University); Lukáš Burget (Brno University of Technology) | Task 2 NOTSOFAR |
6 | ToTaTo System Descriptions for NOTSOFAR1 Challenge [paper] | Joonas Kalda (Tallinn University of Technology); Tanel Alumae (Tallinn University of Technology); Séverin BAROUDI (LIS); Martin Lebourdais (IRIT/CNRS); Hervé Bredin (CNRS); Ricard Marxer (Université de Toulon, Aix Marseille Univ, CNRS, LIS, Toulon) | Task 2 NOTSOFAR |
7 | System Description of NJU-AALab’s submission for The CHiME-8 NOTSOFAR-1 Challenge [paper] [poster] | Qinwen Hu (Nanjing University); Tianchi Sun (Nanjing University); Xinan Chen (Nanjing University); Xiaobin Rong (Nanjing University); Jing Lu (Nanjing University) | Task 2 NOTSOFAR |
8 | The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge [paper] | Shutong Niu (University of Science and Technology of China); Ruoyu Wang (University of Science and Technology of China); Jun Du (University of Science and Technology of China); Gaobin Yang (University of Science and Technology of China); Yanhui Tu (iFlytek); Siyuan Wu (iFlytek Research); Shuangqing Qian (iFlytek); Huaxin Wu (iFlytek Research); Haitao Xu (iFlytek Research); Xueyang Zhang (iFlytek Research); Guolong Zhong (iFlytek Research); Xindi Yu (iFlytek Research); Jieru Chen (iFlytek Research); Mengzhi Wang (iFlytek Research); Di Cai (iFlytek Research); Tian Gao (iFlytek Research); Genshun Wan (iFlytek Research); Feng Ma (iFlytek Research); Jia Pan (iFlytek Research); Jianqing Gao (iFLYTEK) | Task 2 NOTSOFAR |
9 | The NPU-TEA System Report for the CHiME-8 MMCSG Challenge [paper] | Kaixun Huang (NWPU); Wei Rao (Tencent); Yue Li (Northwestern Polytechnical University, China); Hongji Wang (Tencent); Yannan Wang (Tencent); Shen Huang (Tencent Research); Lei Xie (NWPU) | Task 3 MMCSG |
10 | The USTC-NERCSLIP Systems for the CHiME-8 MMCSG Challenge [paper] [poster] | Ya Jiang (University of Science and Technology of China); Jun Du (University of Science and Technology of China); Qing Wang (University of Science and Technology of China); Hongbo Lan (University of Science and Technology of China); Shutong Niu (University of Science and Technology of China ) | Task 3 MMCSG |
11 | THE FOSAFER SYSTEM FOR THE CHiME-8 MMCSG CHALLENGE [paper] | Shangkun Huang (Beijing Fosafer Information Technology Co., Ltd.) | Task 3 MMCSG |
12 | The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection [paper] [poster] | Gabriel Bibbó (University of Surrey); Thomas Deacon (University of Surrey); Arshdeep Singh (University of Surrey); Mark D. Plumbley (University of Surrey) | General |
14:00-15:00 Keynote
- Hung-yi Lee, Professor of the Department of Electrical Engineering and the Department of Computer Science & Information Engineering of National Taiwan University
Teaching Foundation Models New Skills: Insights and Experiences [abstract] [slides]
15:30-16:30 Poster Session 2
Poster ID | Title | Authors | Track |
1 | The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization [paper] [slides] [poster] | Samuele Cornell (Carnegie Mellon University); Tae Jin Park (NVIDIA); He Huang (Nvidia); Christoph Boeddeker (Paderborn University); Xuankai Chang (Carnegie Mellon University); Matthew Maciejewski (Johns Hopkins University); Matthew S Wiesner (Johns Hopkins University); Paola Garcia (Johns Hopkins University); Shinji Watanabe (Carnegie Mellon University) | Task 1 DASR (Overview) |
2 | NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription [[paper]]https://www.isca-archive.org/interspeech_2024/vinnikov24_interspeech.html](papers/NOTSOFAR_alon_slides.pdf) [poster] | Alon Vinnikov (Microsoft), Amir Ivry (Microsoft), Aviv Hurvitz (Microsoft), Igor Abramovski (Microsoft), Sharon Koubi (Microsoft), Ilya Gurvich (Microsoft), Shai Pe‘er (Microsoft), Xiong Xiao (Microsoft), Benjamin Martinez Elizalde (Microsoft), Naoyuki Kanda (Microsoft), Xiaofei Wang (Microsoft), Shalev Shaer (Microsoft), Stav Yagev (Microsoft), Yossi Asher (Microsoft), Sunit Sivasankaran (Microsoft), Yifan Gong (Microsoft), Min Tang (Microsoft), Huaming Wang (Microsoft), Eyal Krupka (Microsoft) | Task 2 NOTSOFAR (Overview) |
3 | The CHiME-8 MMCSG Challenge: Multi-modal conversations in smart glasses [paper] [slides] [poster] | Kateřina Žmolíková (Meta AI); Simone Merello (Meta AI); Kaustubh Kalgaonkar (Meta AI); Ju Lin (Meta); Niko Moritz (Meta); Pingchuan Ma (Meta AI); Ming Sun (Meta); Honglie Chen (Meta); Antoine Saliou (Meta AI); Stavros Petridis (Meta AI); Christian Fuegen (Meta); Michael Mandel (Meta) | Task 3 MMCSG (Overview) |
13 | NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge [paper] [slides] [poster] | Naoyuki Kamo (NTT); Naohiro Tawara (NTT); Atsushi Ando (NTT Corporation); Takatomo Kano (NTT Corporation); Hiroshi Sato (NTT); Rintaro Ikeshita (NTT); Takafumi Moriya (NTT); Shota Horiguchi (NTT Corporation); Kohei Matsuura (NTT); Atsunori Ogawa (NTT Corporation); Alexis Plaquet (IRIT); Takanori Ashihara (NTT Corp.); Tsubasa Ochiai (NTT); Masato Mimura (NTT corporation); Marc Delcroix (NTT); Tomohiro Nakatani (NTT Communication Science Laboratories); Taichi Asami (NTT); Shoko Araki (NTT Corporation) | Task 1 DASR |
14 | UWB-NOTSOFAR: A System for Distant Meeting Transcription with a Single Device Abstract | Jan Lehečka (University of West Bohemia); Zbyněk Zajíc ( University of West Bohemia); Marie Kunešová (University of West Bohemia) | Task 2 NOTSOFAR |
15 | The Fano Labs System for the CHiME-8 NOTSOFAR-1 Challenge Single-channel Track [paper] | Samuel J Broughton (Fano Labs); Lahiru T Samarakoon (Fano Labs, Hong Kong); Harrison Zhu (Imperial College London) | Task 2 NOTSOFAR |
16 | The NPU-TEA System for the CHiME-8 NOTSOFAR-1 Challenge [paper] [poster] | Kaixun Huang (NWPU); Yue Li (Northwestern Polytechnical University, China); Ziqian Wang (Northwestern Polytechnical University); Hongji Wang (None); Wei Rao (Tencent); Zhaokai Sun (NWPU); Zhiyuan Tang (Tencent); Shen Huang (Tencent Research); Yannan Wang (Tencent); Tao Yu (Tencent); Lei Xie (NWPU); Shi-dong Shang (tencent) | Task 2 NOTSOFAR |
17 | The NAIST System for the CHiME-8 NOTSOFAR-1 Task [paper] [poster] | Yuta Hirano (Nara Institute of Science and Technology); Mau Dinh Nguyen (Japan Advanced Institute of Science and Technology); Kakeru Azuma (Nara Institute of Science and Technology); Jan Meyer Saragih (Nara Institute of Science and Technology); Sakriani Sakti (Nara Institute of Science and Technology / Japan Advanced Institute of Science and Technology) | Task 2 NOTSOFAR |
18 | The SEUEE System for the CHiME-8 MMCSG Challenge [paper] [slides] [poster] | Cong Pang (Southeast University); Feifei Xiong (Alibaba Group); Ye Ni (southeast university); Lin Zhou (Southeast University); Jinwei Feng ( Alibaba Group) | Task 3 MMCSG |