Theo dõi
Zhi Zhong
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
Diffusion-based speech enhancement with joint generative and predictive decoders
H Shi, K Shimada, M Hirano, T Shibuya, Y Koyama, Z Zhong, S Takahashi, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
162024
An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification
Z Zhong, M Hirano, K Shimada, K Tateishi, S Takahashi, Y Mitsufuji
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
152023
SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation
K Saito, D Kim, T Shibuya, CH Lai, Z Zhong, Y Takida, Y Mitsufuji
arXiv preprint arXiv:2405.18503, 2024
52024
Extending Audio Masked Autoencoders Toward Audio Restoration
Z Zhong, H Shi, M Hirano, K Shimada, K Tateishi, T Shibuya, S Takahashi, ...
WASPAA 2023-2023 IEEE Workshop on Applications of Signal Processing to Audio …, 2023
42023
Assessment of a beamforming implementation developed for surface sound source separation
Z Zhong, M Shakeel, K Itoyama, K Nishida, K Nakadai
2021 IEEE/SICE International Symposium on System Integration (SII), 369-374, 2021
42021
SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond
M Comunita, Z Zhong, A Takahashi, S Yang, M Zhao, K Saito, Y Ikemiya, ...
arXiv preprint arXiv:2406.17672, 2024
32024
Design and assessment of a scan-and-sum beamformer for surface sound source separation
Z Zhong, K Itoyama, K Nishida, K Nakadai
2020 IEEE/SICE International Symposium on System Integration (SII), 808-813, 2020
32020
OpenMU: Your Swiss Army Knife for Music Understanding
M Zhao, Z Zhong, Z Mao, S Yang, WH Liao, S Takahashi, H Wakaki, ...
arXiv preprint arXiv:2410.15573, 2024
12024
Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation
S Yang, Z Zhong, M Zhao, S Takahashi, M Ishii, T Shibuya, Y Mitsufuji
arXiv preprint arXiv:2405.14598, 2024
12024
Music Foundation Model as Generic Booster for Music Downstream Tasks
WH Liao, Y Takida, Y Ikemiya, Z Zhong, CH Lai, G Fabbro, K Shimada, ...
arXiv preprint arXiv:2411.01135, 2024
2024
VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression
Y Chae, W Choi, Y Takida, J Koo, Y Ikemiya, Z Zhong, KW Cheuk, ...
arXiv preprint arXiv:2410.06016, 2024
2024
On the Language Encoder of Contrastive Cross-modal Models
M Zhao, J Ono, Z Zhong, CH Lai, Y Takida, N Murata, WH Liao, T Shibuya, ...
arXiv preprint arXiv:2310.13267, 2023
2023
Source separation device, source separation method and program
KN Kazuhiro Nakadai, Zhi Zhong, Katsutoshi Itoyama
JP Patent 特許第7316614号, 2023
2023
FLEXOUNDIT: VARIABLE-LENGTH DIFFUSION TRANSFORMER FOR TEXT-TO-AUDIO GENERATION
Z Zhong, Y Ikemiya, K Toyama, WH Liao, S Takahashi, Y Mitsufuji
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–14