ติดตาม
Kaitao Song
Kaitao Song
Senior Researcher, Microsoft Research
ยืนยันอีเมลแล้วที่ microsoft.com - หน้าแรก
ชื่อ
อ้างโดย
อ้างโดย
ปี
Pyramid vision transformer: A versatile backbone for dense prediction without convolutions
W Wang, E Xie, X Li, DP Fan, K Song, D Liang, T Lu, P Luo, L Shao
ICCV 2021, 2021
47522021
Pvt v2: Improved baselines with pyramid vision transformer
W Wang, E Xie, X Li, DP Fan, K Song, D Liang, T Lu, P Luo, L Shao
Computational visual media 8 (3), 415-424, 2022
17072022
Mpnet: Masked and permuted pre-training for language understanding
K Song, X Tan, T Qin, J Lu, TY Liu
NeurIPS 2020, 2020
12522020
Mass: Masked sequence to sequence pre-training for language generation
K Song, X Tan, T Qin, J Lu, TY Liu
ICML 2019, 2019
12022019
HuggingGPT: Solving AI tasks with ChatGPT and Its Friends in Huggingface
Y Shen, K Song, X Tan, D Li, W Lu, Y Zhuang
NeurIPS 2023, 2023
10792023
NaturalSpeech 3: Zero-shot speech synthesis with factorized codec and diffusion models
Z Ju, Y Wang, K Shen, X Tan, D Xin, D Yang, Y Liu, Y Leng, K Song, ...
ICML 2024, 2024
1452024
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search
J Xu, X Tan, R Luo, K Song, J Li, T Qin, TY Liu
KDD 2021, 2021
862021
SongMASS: Automatic Song Writing with Pre-training and Alignment Constraint
Z Sheng, K Song, X Tan, Y Ren, W Ye, S Zhang, T Qin
AAAI 2021, 2020
742020
Bi-modal progressive mask attention for fine-grained recognition
K Song, XS Wei, X Shu, RJ Song, J Lu
IEEE Transactions on Image Processing 29, 7006-7018, 2020
652020
DiffusionNER: Boundary Diffusion for Named Entity Recognition
Y Shen, K Song, X Tan, D Li, W Lu, Y Zhuang
ACL 2023, 2023
572023
DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling
L Xue, K Song, D Wu, X Tan, NL Zhang, T Qin, WQ Zhang, TY Liu
ACL 2021, 2021
432021
Easytool: Enhancing llm-based agents with concise tool instruction
S Yuan, K Song, J Chen, X Tan, Y Shen, R Kan, D Li, D Yang
NAACL 2025, 2024
412024
Prompttts 2: Describing and generating voices with text prompt
Y Leng, Z Guo, K Shen, X Tan, Z Ju, Y Liu, Y Liu, D Yang, L Zhang, ...
ICLR 2024, 2023
412023
Taskbench: Benchmarking large language models for task automation
Y Shen, K Song, X Tan, W Zhang, K Ren, S Yuan, W Lu, D Li, Y Zhuang
NeurIPS 2024, 2023
362023
Generating adversarial examples with conditional generative adversarial net
P Yu, K Song, J Lu
2018 24th International conference on pattern recognition (ICPR), 676-681, 2018
332018
Analyzing and Mitigating Interference in Neural Architecture Search
J Xu, X Tan, K Song, R Luo, Y Leng, T Qin, TY Liu, J Li
ICML 2022, 2021
322021
Learning domain invariant prompt for vision-language models
C Zhao, Y Wang, X Jiang, Y Shen, K Song, D Li, D Miao
IEEE Transactions on Image Processing 33, 1348-1360, 2024
252024
Mixed-phoneme bert: Improving bert with mixed phoneme and sup-phoneme representations for text to speech
G Zhang, K Song, X Tan, D Tan, Y Yan, Y Liu, G Wang, W Zhou, T Qin, ...
INTERSPEECH 2022, 2022
252022
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Y Leng, X Tan, W Liu, K Song, R Wang, XY Li, T Qin, E Lin, TY Liu
AAAI 2023, 2022
212022
Learning to teach large language models logical reasoning
M Chen, Y Ma, K Song, Y Cao, Y Zhang, D Li
ACL 2024, 2023
19*2023
ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง
บทความ 1–20