Sledovat
Ke-Han Lu
Název
Citace
Citace
Rok
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech
C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
322024
Non-autoregressive asr modeling using pre-trained language models for chinese speech recognition
FH Yu, KY Chen, KH Lu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1474-1482, 2022
282022
A context-aware knowledge transferring strategy for CTC-based ASR
KH Lu, KY Chen
2022 IEEE Spoken Language Technology Workshop (SLT), 60-67, 2023
162023
Investigating zero-shot generalizability on mandarin-english code-switched asr and speech-to-text translation of recent foundation models with self-supervision and weak supervision
CK Yang, KP Huang, KH Lu, CY Kuan, CY Hsiao, H Lee
2024 IEEE International Conference on Acoustics, Speech, and Signal …, 2024
102024
Desta: Enhancing speech language models through descriptive speech-text alignment
KH Lu, Z Chen, SW Fu, H Huang, B Ginsburg, YCF Wang, H Lee
INTERSPEECH 2024, 2024
52024
Codec-superb@ slt 2024: A lightweight benchmark for neural audio codec models
H Wu, X Chen, YC Lin, K Chang, J Du, KH Lu, AH Liu, HL Chung, YK Wu, ...
2024 IEEE Spoken Language Technology Workshop (SLT), 570-577, 2024
42024
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks
C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ...
arXiv preprint arXiv:2411.05361, 2024
32024
Hypr: A comprehensive study for ASR hypothesis revising with a reference corpus
YW Wang, KH Lu, KY Chen
INTERSPEECH 2024, 2023
32023
Speech-Copilot: Leveraging Large Language Models for Speech Processing Via Task Decomposition, Modularization, and Program Generation
CY Kuan, CK Yang, WP Huang, KH Lu, H Lee
2024 IEEE Spoken Language Technology Workshop (SLT), 1060-1067, 2024
22024
A transformer-based cross-modal fusion model with adversarial training for vqa challenge 2021
KH Lu, BH Fang, KY Chen
arXiv preprint arXiv:2106.13033, 2021
22021
Building a taiwanese mandarin spoken language model: A first attempt
CK Yang, YK Fu, CA Li, YC Lin, YX Lin, WC Chen, HL Chung, CY Kuan, ...
arXiv preprint arXiv:2411.07111, 2024
12024
Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data
KH Lu, Z Chen, SW Fu, CHH Yang, J Balam, B Ginsburg, YCF Wang, ...
arXiv preprint arXiv:2409.20007, 2024
12024
SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning
C Huang, MH Shih, KH Lu, CY Hsiao, H Lee
arXiv preprint arXiv:2408.13891, 2024
12024
Listen and Speak Fairly: a Study on Semantic Gender Bias in Speech Integrated Large Language Models
YC Lin, TQ Lin, CK Yang, KH Lu, WC Chen, CY Kuan, H Lee
2024 IEEE Spoken Language Technology Workshop (SLT), 439-446, 2024
2024
ntust-nlp-2 at ROCLING-2021 Shared Task: BERT-based semantic analyzer with word-level information
KH Lu, KY Chen
Proceedings of the 33rd Conference on Computational Linguistics and Speech …, 2021
2021
2020 福爾摩沙臺語語音辨識比賽之初步實驗 (A Preliminary Study of Formosa Speech Recognition Challenge 2020–Taiwanese ASR)
FH Yu, KH Lu, YW Wang, WZ Chang, WK Huang, KY Chen
International Journal of Computational Linguistics & Chinese Language …, 2021
2021
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–16