Programme

The 2024 CHiME Workshop will take place on September 6, 2024 at the Kos International Convention Centre (KICC) on Kos Island, Greece in the AKESO (ΑΚΕΣΩ) room on the ground floor.

The Programme will proceed as follows:

8:30-9:00 Registration, with coffee and pastries
9:00-9:10 Welcome
9:10-9:45 Task 1 overview and spotlights
9:45-10:20 Task 2 overview and spotlights
10:20-10:55 Task 3 overview and spotlights
10:55-11:30 Coffee break
11:30-12:30 Poster session 1
12:30-14:00 Lunch
14:00-15:00 Invited speaker: Prof. Hung-yi Lee
15:00-15:30 Coffee break
15:30-16:30 Poster session 2
16:30-17:30 Pitching/feedback session + Closing session

All oral sessions will take place in the AKESO (ΑΚΕΣΩ) room on the ground floor, with posters in the atrium outside of the room.

9:10-9:45    Task 1 overview and spotlights

Time Title Authors
9:10-9:35 The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization [paper] [slides] [poster] Samuele Cornell (Carnegie Mellon University); Tae Jin Park (NVIDIA); He Huang (Nvidia); Christoph Boeddeker (Paderborn University); Xuankai Chang (Carnegie Mellon University); Matthew Maciejewski (Johns Hopkins University); Matthew S Wiesner (Johns Hopkins University); Paola Garcia (Johns Hopkins University); Shinji Watanabe (Carnegie Mellon University)
9:35-9:40 STCON System for the CHiME-8 Challenge [paper] [slides] [poster] Anton Mitrofanov (STC-innovations Ltd); Tatiana Prisyach (STCON LLC); Tatiana Timofeeva (STC); Sergey Novoselov (ITMO University); Maxim Korenevsky (Speech Technology Center); Yuri Khokhlov (Speech Technology Center); Artem Akulov (STC Innovations); Aleksandr Anikin (Speech Technology Center); Roman Khalili (STC); Iurii Lezhenin (Speech Technology Center); Aleksandr Melnikov (STC); Dmitriy Miroshnichenko (STC); Nikita Mamaev (ITMO University); Ilya Odegov (STC); Olga Rudnitskaya (STC); Aleksei Romanenko (Speech Technology Center)
9:40-9:45 NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge [paper] [slides] [poster] Naoyuki Kamo (NTT); Naohiro Tawara (NTT); Atsushi Ando (NTT Corporation); Takatomo Kano (NTT Corporation); Hiroshi Sato (NTT); Rintaro Ikeshita (NTT); Takafumi Moriya (NTT); Shota Horiguchi (NTT Corporation); Kohei Matsuura (NTT); Atsunori Ogawa (NTT Corporation); Alexis Plaquet (IRIT); Takanori Ashihara (NTT Corp.); Tsubasa Ochiai (NTT); Masato Mimura (NTT corporation); Marc Delcroix (NTT); Tomohiro Nakatani (NTT Communication Science Laboratories); Taichi Asami (NTT); Shoko Araki (NTT Corporation)

Back to top

9:45-10:20    Task 2 oral presentations

Time Title Authors
9:45-10:10 NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription [paper] [slides] [poster] Alon Vinnikov (Microsoft), Amir Ivry (Microsoft), Aviv Hurvitz (Microsoft), Igor Abramovski (Microsoft), Sharon Koubi (Microsoft), Ilya Gurvich (Microsoft), Shai Pe‘er (Microsoft), Xiong Xiao (Microsoft), Benjamin Martinez Elizalde (Microsoft), Naoyuki Kanda (Microsoft), Xiaofei Wang (Microsoft), Shalev Shaer (Microsoft), Stav Yagev (Microsoft), Yossi Asher (Microsoft), Sunit Sivasankaran (Microsoft), Yifan Gong (Microsoft), Min Tang (Microsoft), Huaming Wang (Microsoft), Eyal Krupka (Microsoft)
10:10-10:15 The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge [paper] Shutong Niu (University of Science and Technology of China ); Ruoyu Wang (University of Science and Technology of China); Jun Du (University of Science and Technology of China); Gaobin Yang (University of Science and Technology of China); Yanhui Tu (iFlytek); Siyuan Wu (iFlytek Research); Shuangqing Qian (iFlytek); Huaxin Wu (iFlytek Research); Haitao Xu (iFlytek Research); Xueyang Zhang (iFlytek Research); Guolong Zhong (iFlytek Research); Xindi Yu (iFlytek Research); Jieru Chen (iFlytek Research); Mengzhi Wang (iFlytek Research); Di Cai (iFlytek Research); Tian Gao (iFlytek Research); Genshun Wan (iFlytek Research); Feng Ma (iFlytek Research); Jia Pan (iFlytek Research); Jianqing Gao (iFLYTEK)
10:15-10:20 BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge [paper] [slides] [poster] Alexander Polok (Brno University of Technology); Dominik Klement (Brno University of Technology); Jiangyu Han (Brno University of Technology); Šimon Sedláček (Brno University of Technology); Bolaji Yusuf (Bogazici University); Matthew Maciejewski (Johns Hopkins University); Matthew S Wiesner (Johns Hopkins University); Lukáš Burget (Brno University of Technology)

Back to top

10:20-10:55    Task 3 oral presentations

Time Title Authors
10:20-10:45 The CHiME-8 MMCSG Challenge: Multi-modal conversations in smart glasses [paper] [slides] [poster] Kateřina Žmolíková (Meta AI); Simone Merello (Meta AI); Kaustubh Kalgaonkar (Meta AI); Ju Lin (Meta); Niko Moritz (Meta); Pingchuan Ma (Meta AI); Ming Sun (Meta); Honglie Chen (Meta); Antoine Saliou (Meta AI); Stavros Petridis (Meta AI); Christian Fuegen (Meta); Michael Mandel (Meta)
10:45-10:50 The NPU-TEA System Report for the CHiME-8 MMCSG Challenge [paper] Kaixun Huang (NWPU); Wei Rao (Tencent); Yue Li (Northwestern Polytechnical University, China); Hongji Wang (Tencent); Yannan Wang (Tencent); Shen Huang (Tencent Research); Lei Xie (NWPU)
10:50-10:55 The SEUEE System for the CHiME-8 MMCSG Challenge [paper] [slides] [poster] Cong Pang (Southeast University); Feifei Xiong (Alibaba Group); Ye Ni (southeast university); Lin Zhou (Southeast University); Jinwei Feng ( Alibaba Group)

Back to top

11:30-12:30    Poster Session

Poster ID Title Authors Track
1 The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization [paper] [slides] [poster] Samuele Cornell (Carnegie Mellon University); Tae Jin Park (NVIDIA); He Huang (Nvidia); Christoph Boeddeker (Paderborn University); Xuankai Chang (Carnegie Mellon University); Matthew Maciejewski (Johns Hopkins University); Matthew S Wiesner (Johns Hopkins University); Paola Garcia (Johns Hopkins University); Shinji Watanabe (Carnegie Mellon University) Task 1 DASR (Overview)
2 NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription [paper] [slides] [poster] Alon Vinnikov (Microsoft), Amir Ivry (Microsoft), Aviv Hurvitz (Microsoft), Igor Abramovski (Microsoft), Sharon Koubi (Microsoft), Ilya Gurvich (Microsoft), Shai Pe‘er (Microsoft), Xiong Xiao (Microsoft), Benjamin Martinez Elizalde (Microsoft), Naoyuki Kanda (Microsoft), Xiaofei Wang (Microsoft), Shalev Shaer (Microsoft), Stav Yagev (Microsoft), Yossi Asher (Microsoft), Sunit Sivasankaran (Microsoft), Yifan Gong (Microsoft), Min Tang (Microsoft), Huaming Wang (Microsoft), Eyal Krupka (Microsoft) Task 2 NOTSOFAR (Overview)
3 The CHiME-8 MMCSG Challenge: Multi-modal conversations in smart glasses [paper] [slides] [poster] Kateřina Žmolíková (Meta AI); Simone Merello (Meta AI); Kaustubh Kalgaonkar (Meta AI); Ju Lin (Meta); Niko Moritz (Meta); Pingchuan Ma (Meta AI); Ming Sun (Meta); Honglie Chen (Meta); Antoine Saliou (Meta AI); Stavros Petridis (Meta AI); Christian Fuegen (Meta); Michael Mandel (Meta) Task 3 MMCSG (Overview)
4 STCON System for the CHiME-8 Challenge [paper] [slides] [poster] Anton Mitrofanov (STC-innovations Ltd); Tatiana Prisyach (STCON LLC); Tatiana Timofeeva (STC); Sergey Novoselov (ITMO University); Maxim Korenevsky (Speech Technology Center); Yuri Khokhlov (Speech Technology Center); Artem Akulov (STC Innovations); Aleksandr Anikin (Speech Technology Center); Roman Khalili (STC); Iurii Lezhenin (Speech Technology Center); Aleksandr Melnikov (STC); Dmitriy Miroshnichenko (STC); Nikita Mamaev (ITMO University); Ilya Odegov (STC); Olga Rudnitskaya (STC); Aleksei Romanenko (Speech Technology Center) Task 1 DASR
5 BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge [paper] [slides] [poster] Alexander Polok (Brno University of Technology); Dominik Klement (Brno University of Technology); Jiangyu Han (Brno University of Technology); Šimon Sedláček (Brno University of Technology); Bolaji Yusuf (Bogazici University); Matthew Maciejewski (Johns Hopkins University); Matthew S Wiesner (Johns Hopkins University); Lukáš Burget (Brno University of Technology) Task 2 NOTSOFAR
6 ToTaTo System Descriptions for NOTSOFAR1 Challenge [paper] Joonas Kalda (Tallinn University of Technology); Tanel Alumae (Tallinn University of Technology); Séverin BAROUDI (LIS); Martin Lebourdais (IRIT/CNRS); Hervé Bredin (CNRS); Ricard Marxer (Université de Toulon, Aix Marseille Univ, CNRS, LIS, Toulon) Task 2 NOTSOFAR
7 System Description of NJU-AALab’s submission for The CHiME-8 NOTSOFAR-1 Challenge [paper] [poster] Qinwen Hu (Nanjing University); Tianchi Sun (Nanjing University); Xinan Chen (Nanjing University); Xiaobin Rong (Nanjing University); Jing Lu (Nanjing University) Task 2 NOTSOFAR
8 The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge [paper] Shutong Niu (University of Science and Technology of China); Ruoyu Wang (University of Science and Technology of China); Jun Du (University of Science and Technology of China); Gaobin Yang (University of Science and Technology of China); Yanhui Tu (iFlytek); Siyuan Wu (iFlytek Research); Shuangqing Qian (iFlytek); Huaxin Wu (iFlytek Research); Haitao Xu (iFlytek Research); Xueyang Zhang (iFlytek Research); Guolong Zhong (iFlytek Research); Xindi Yu (iFlytek Research); Jieru Chen (iFlytek Research); Mengzhi Wang (iFlytek Research); Di Cai (iFlytek Research); Tian Gao (iFlytek Research); Genshun Wan (iFlytek Research); Feng Ma (iFlytek Research); Jia Pan (iFlytek Research); Jianqing Gao (iFLYTEK) Task 2 NOTSOFAR
9 The NPU-TEA System Report for the CHiME-8 MMCSG Challenge [paper] Kaixun Huang (NWPU); Wei Rao (Tencent); Yue Li (Northwestern Polytechnical University, China); Hongji Wang (Tencent); Yannan Wang (Tencent); Shen Huang (Tencent Research); Lei Xie (NWPU) Task 3 MMCSG
10 The USTC-NERCSLIP Systems for the CHiME-8 MMCSG Challenge [paper] [poster] Ya Jiang (University of Science and Technology of China); Jun Du (University of Science and Technology of China); Qing Wang (University of Science and Technology of China); Hongbo Lan (University of Science and Technology of China); Shutong Niu (University of Science and Technology of China ) Task 3 MMCSG
11 THE FOSAFER SYSTEM FOR THE CHiME-8 MMCSG CHALLENGE [paper] Shangkun Huang (Beijing Fosafer Information Technology Co., Ltd.) Task 3 MMCSG
12 The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection [paper] [poster] Gabriel Bibbó (University of Surrey); Thomas Deacon (University of Surrey); Arshdeep Singh (University of Surrey); Mark D. Plumbley (University of Surrey) General

Back to top

14:00-15:00    Keynote

  • Hung-yi Lee, Professor of the Department of Electrical Engineering and the Department of Computer Science & Information Engineering of National Taiwan University
    Hung-yi Lee

    Teaching Foundation Models New Skills: Insights and Experiences [abstract] [slides]


    Bio Hung-yi Lee is a professor of the Department of Electrical Engineering at National Taiwan University (NTU), with a joint appointment at the Department of Computer Science & Information Engineering of the university. His recent research focuses on developing technology that can reduce the requirement of annotated data for speech processing (including voice conversion and speech recognition) and natural language processing (including abstractive summarization and question answering). He won Salesforce Research Deep Learning Grant in 2019, AWS ML Research Award in 2020, Outstanding Young Engineer Award from The Chinese Institute of Electrical Engineering in 2018, Young Scholar Innovation Award from Foundation for the Advancement of Outstanding Scholarship in 2019, Ta-You Wu Memorial Award from Ministry of Science and Technology of Taiwan in 2019, and The 59th Ten Outstanding Young Person Award in Science and Technology Research & Development of Taiwan.

Back to top

15:30-16:30    Poster Session 2

Poster ID Title Authors Track
1 The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization [paper] [slides] [poster] Samuele Cornell (Carnegie Mellon University); Tae Jin Park (NVIDIA); He Huang (Nvidia); Christoph Boeddeker (Paderborn University); Xuankai Chang (Carnegie Mellon University); Matthew Maciejewski (Johns Hopkins University); Matthew S Wiesner (Johns Hopkins University); Paola Garcia (Johns Hopkins University); Shinji Watanabe (Carnegie Mellon University) Task 1 DASR (Overview)
2 NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription [[paper]]https://www.isca-archive.org/interspeech_2024/vinnikov24_interspeech.html](papers/NOTSOFAR_alon_slides.pdf) [poster] Alon Vinnikov (Microsoft), Amir Ivry (Microsoft), Aviv Hurvitz (Microsoft), Igor Abramovski (Microsoft), Sharon Koubi (Microsoft), Ilya Gurvich (Microsoft), Shai Pe‘er (Microsoft), Xiong Xiao (Microsoft), Benjamin Martinez Elizalde (Microsoft), Naoyuki Kanda (Microsoft), Xiaofei Wang (Microsoft), Shalev Shaer (Microsoft), Stav Yagev (Microsoft), Yossi Asher (Microsoft), Sunit Sivasankaran (Microsoft), Yifan Gong (Microsoft), Min Tang (Microsoft), Huaming Wang (Microsoft), Eyal Krupka (Microsoft) Task 2 NOTSOFAR (Overview)
3 The CHiME-8 MMCSG Challenge: Multi-modal conversations in smart glasses [paper] [slides] [poster] Kateřina Žmolíková (Meta AI); Simone Merello (Meta AI); Kaustubh Kalgaonkar (Meta AI); Ju Lin (Meta); Niko Moritz (Meta); Pingchuan Ma (Meta AI); Ming Sun (Meta); Honglie Chen (Meta); Antoine Saliou (Meta AI); Stavros Petridis (Meta AI); Christian Fuegen (Meta); Michael Mandel (Meta) Task 3 MMCSG (Overview)
13 NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge [paper] [slides] [poster] Naoyuki Kamo (NTT); Naohiro Tawara (NTT); Atsushi Ando (NTT Corporation); Takatomo Kano (NTT Corporation); Hiroshi Sato (NTT); Rintaro Ikeshita (NTT); Takafumi Moriya (NTT); Shota Horiguchi (NTT Corporation); Kohei Matsuura (NTT); Atsunori Ogawa (NTT Corporation); Alexis Plaquet (IRIT); Takanori Ashihara (NTT Corp.); Tsubasa Ochiai (NTT); Masato Mimura (NTT corporation); Marc Delcroix (NTT); Tomohiro Nakatani (NTT Communication Science Laboratories); Taichi Asami (NTT); Shoko Araki (NTT Corporation) Task 1 DASR
14 UWB-NOTSOFAR: A System for Distant Meeting Transcription with a Single Device Abstract Jan Lehečka (University of West Bohemia); Zbyněk Zajíc ( University of West Bohemia); Marie Kunešová (University of West Bohemia) Task 2 NOTSOFAR
15 The Fano Labs System for the CHiME-8 NOTSOFAR-1 Challenge Single-channel Track [paper] Samuel J Broughton (Fano Labs); Lahiru T Samarakoon (Fano Labs, Hong Kong); Harrison Zhu (Imperial College London) Task 2 NOTSOFAR
16 The NPU-TEA System for the CHiME-8 NOTSOFAR-1 Challenge [paper] [poster] Kaixun Huang (NWPU); Yue Li (Northwestern Polytechnical University, China); Ziqian Wang (Northwestern Polytechnical University); Hongji Wang (None); Wei Rao (Tencent); Zhaokai Sun (NWPU); Zhiyuan Tang (Tencent); Shen Huang (Tencent Research); Yannan Wang (Tencent); Tao Yu (Tencent); Lei Xie (NWPU); Shi-dong Shang (tencent) Task 2 NOTSOFAR
17 The NAIST System for the CHiME-8 NOTSOFAR-1 Task [paper] [poster] Yuta Hirano (Nara Institute of Science and Technology); Mau Dinh Nguyen (Japan Advanced Institute of Science and Technology); Kakeru Azuma (Nara Institute of Science and Technology); Jan Meyer Saragih (Nara Institute of Science and Technology); Sakriani Sakti (Nara Institute of Science and Technology / Japan Advanced Institute of Science and Technology) Task 2 NOTSOFAR
18 The SEUEE System for the CHiME-8 MMCSG Challenge [paper] [slides] [poster] Cong Pang (Southeast University); Feifei Xiong (Alibaba Group); Ye Ni (southeast university); Lin Zhou (Southeast University); Jinwei Feng ( Alibaba Group) Task 3 MMCSG

Back to top

Back to the CHiME 2024 Workshop