SLP-L1: Speech LLM: Training & Generation
Oral
Tue, 5 May, 14:00 - 16:00
Location: Room 114
Session Type: Oral
Session Co-Chairs: George Saon, IBM and Yanmin Qian, Shanghai Jiao Tong University
Track: Speech and Language Processing [SL]
Click the to view the manuscript on IEEE Xplore Open Preview
Tue, 5 May, 14:00 - 14:20
SLP-L1.1: CROSS-MODAL KNOWLEDGE DISTILLATION FOR SPEECH LARGE LANGUAGE MODELS
Tue, 5 May, 14:40 - 15:00
SLP-L1.3: GELINA: UNIFIED SPEECH AND GESTURE SYNTHESIS VIA INTERLEAVED TOKEN PREDICTION
Tue, 5 May, 15:20 - 15:40
SLP-L1.5: GROUP RELATIVE POLICY OPTIMIZATION FOR TEXT-TO-SPEECH WITH LARGE LANGUAGE MODELS
Tue, 5 May, 15:40 - 16:00