Technical Program

SLP-P15: Distant Speech Recognition

Session Type: Poster
Time: Thursday, May 16, 15:30 - 17:30
Location: Poster Area A, Ground Floor
Session Chair: Tomohiro Nakatani, NTT Corporation
 
SLP-P15.1: SPATIAL AND CHANNEL ATTENTION BASED CONVOLUTIONAL NEURAL NETWORKS FOR MODELING NOISY SPEECH
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Sirui Xu; The Ohio State University
         Eric Fosler-Lussier; The Ohio State University
 
SLP-P15.2: ACOUSTIC MODELING FOR DISTANT MULTI-TALKER SPEECH RECOGNITION WITH SINGLE- AND MULTI-CHANNEL BRANCHES
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Naoyuki Kanda; Hitachi Ltd.
         Yusuke Fujita; Hitachi Ltd.
         Shota Horiguchi; Hitachi Ltd.
         Rintaro Ikeshita; Hitachi Ltd.
         Kenji Nagamatsu; Hitachi Ltd.
         Shinji Watanabe; Johns Hopkins University
 
SLP-P15.3: MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Kenichi Kumatani; Amazon
         Wu Minhua; Amazon
         Shiva Sundaram; Amazon
         Nikko Ström; Amazon
         Björn Hoffmeister; Amazon
 
SLP-P15.4: FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Wu Minhua; Amazon
         Kenichi Kumatani; Amazon
         Shiva Sundaram; Amazon
         Nikko Ström; Amazon
         Björn Hoffmeister; Amazon
 
SLP-P15.5: ON REDUCING THE EFFECT OF SPEAKER OVERLAP FOR CHIME-5
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Catalin Zorila; Toshiba Cambridge Research Laboratory
         Rama Doddipatla; Toshiba Cambridge Research Laboratory
 
SLP-P15.6: A TWO-STAGE SINGLE-CHANNEL SPEAKER-DEPENDENT SPEECH SEPARATION APPROACH FOR CHIME-5 CHALLENGE
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Lei Sun; University of Science and Technology of China
         Jun Du; University of Science and Technology of China
         Tian Gao; University of Science and Technology of China
         Yi Fang; iFlytek Company
         Feng Ma; iFlytek Company
         Jia Pan; iFlytek Company
         Chin-Hui Lee; Georgia Institute of Technology
 
SLP-P15.7: JOINT OPTIMIZATION OF NEURAL NETWORK-BASED WPE DEREVERBERATION AND ACOUSTIC MODEL FOR ROBUST ONLINE ASR
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Jahn Heymann; Paderborn University
         Lukas Drude; Paderborn University
         Reinhold Häb-Umbach; Paderborn University
         Keisuke Kinoshita; NTT Communication Science Laboratories
         Tomohiro Nakatani; NTT Communication Science Laboratories
 
SLP-P15.8: INVESTIGATION INTO JOINT OPTIMIZATION OF SINGLE CHANNEL SPEECH ENHANCEMENT AND ACOUSTIC MODELING FOR ROBUST ASR
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Tobias Menne; RWTH Aachen University
         Ralf Schlüter; RWTH Aachen University
         Hermann Ney; RWTH Aachen University
 
SLP-P15.9: ACOUSTIC MODELING FOR OVERLAPPING SPEECH RECOGNITION: JHU CHIME-5 CHALLENGE SYSTEM
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Vimal Manohar; Johns Hopkins University
         Szu-Jui Chen; Johns Hopkins University
         Zhiqi Wang; Johns Hopkins University
         Yusuke Fujita; Hitachi Ltd.
         Shinji Watanabe; Johns Hopkins University
         Sanjeev Khudanpur; Johns Hopkins University
 
SLP-P15.10: LESSONS FROM BUILDING ACOUSTIC MODELS WITH A MILLION HOURS OF SPEECH
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Sree Hari Krishnan Parthasarathi; Amazon
         Nikko Ström; Amazon