AASP-P14: Acoustic Event Detection and Speech Enhancement |
| Session Type: Poster |
| Time: Friday, May 17, 08:30 - 10:30 |
| Location: Poster Area D, Ground Floor |
| Session Chair: Sven Nordholm, Curtin University
|
| |
| AASP-P14.1: SCENE-DEPENDENT ANOMALOUS ACOUSTIC-EVENT DETECTION BASED ON CONDITIONAL WAVENET AND I-VECTOR |
| Tatsuya Komatsu; NEC Corporation |
| Tomoki Hayashi; Nagoya University |
| Reishi Kondo; NEC Corporation |
| Tomoki Toda; Nagoya University |
| Kazuya Takeda; Nagoya University |
| |
| AASP-P14.2: TEACHER-STUDENT TRAINING FOR ACOUSTIC EVENT DETECTION USING AUDIOSET |
| Ruibo Shi; Emotech Labs |
| Raymond W. M. Ng; Emotech Labs |
| Pawel Swietojanski; The University of New South Wales |
| |
| AASP-P14.3: ACTIVE LEARNING FOR EFFICIENT AUDIO ANNOTATION AND CLASSIFICATION WITH A LARGE AMOUNT OF UNLABELED DATA |
| Yu Wang; New York University |
| Ana Elisa Mendez Mendez; New York University |
| Mark Cartwright; New York University |
| Juan Pablo Bello; New York University |
| |
| AASP-P14.4: POLYPHONIC SOUND EVENT DETECTION USING CONVOLUTIONAL BIDIRECTIONAL LSTM AND SYNTHETIC DATA-BASED TRANSFER LEARNING |
| Seokwon Jung; Humelo Inc. / Korea Advanced Institute of Science and Technology (KAIST) |
| Jungbae Park; Humelo Inc. / Korea Advanced Institute of Science and Technology (KAIST) |
| Sangwan Lee; Korea Advanced Institute of Science and Technology / Humelo Inc. |
| |
| AASP-P14.5: A MULTI-SPIKE APPROACH FOR ROBUST SOUND RECOGNITION |
| Qiang Yu; Tianjin University |
| Yanli Yao; Tianjin University |
| Longbiao Wang; Tianjin University |
| Huajin Tang; Sichuan University |
| Jianwu Dang; Tianjin University |
| |
| AASP-P14.6: CROSS EVALUATION OF SPEECH ENHANCEMENT METHODS UNDER DIFFERENT NOISE CONDITIONS |
| Lara Nahma; Curtin University |
| Pei Chee Yong; Nuheara |
| Hai Huyen Dam; Curtin University |
| Sven Nordholm; Curtin University |
| |
| AASP-P14.7: DIFFERENTIABLE CONSISTENCY CONSTRAINTS FOR IMPROVED DEEP SPEECH ENHANCEMENT |
| Scott Wisdom; Google, Inc. |
| John R. Hershey; Google, Inc. |
| Kevin Wilson; Google, Inc. |
| Jeremy Thorpe; Google, Inc. |
| Michael Chinen; Google, Inc. |
| Brian Patton; Google, Inc. |
| Rif Saurous; Google, Inc. |
| |
| AASP-P14.8: A DEEP GENERATIVE MODEL OF SPEECH COMPLEX SPECTROGRAMS |
| Aditya Arie Nugraha; RIKEN Center for Advanced Intelligence Project |
| Kouhei Sekiguchi; RIKEN Center for Advanced Intelligence Project |
| Kazuyoshi Yoshii; RIKEN Center for Advanced Intelligence Project |
| |
| AASP-P14.9: DNN TRAINING BASED ON CLASSIC GAIN FUNCTION FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION |
| Yanhui Tu; University of Science and Technology of China |
| Jun Du; University of Science and Technology of China |
| Chin-Hui Lee; Georgia Institute of Technology |
| |
| AASP-P14.10: SNIPER: FEW-SHOT LEARNING FOR ANOMALY DETECTION TO MINIMIZE FALSE-NEGATIVE RATE WITH ENSURED TRUE-POSITIVE RATE |
| Yuma Koizumi; NTT Corporation |
| Shin Murata; NTT Corporation |
| Noboru Harada; NTT Corporation |
| Shoichiro Saito; NTT Corporation |
| Hisashi Uematsu; NTT Corporation |
| |