Zhi Zhong

Trích dẫn bởi

	Tất cả	Từ 2020
Trích dẫn	52	52
h-index	4	4
i10-index	2	2

202120222023202420251 3 11 30 7

Đồng tác giả

Yuki MitsufujiDistinguished Engineer, Sony; Specially Appointed Associate Professor, Tokyo Institute of TechnologyEmail được xác minh tại sony.com
Takashi ShibuyaSonyEmail được xác minh tại sony.com
Kazuki ShimadaSonyEmail được xác minh tại sony.com
Shusuke TakahashiSony Group CorporationEmail được xác minh tại sony.com
Hao ShiKyoto UniversityEmail được xác minh tại kyoto-u.ac.jp
Mengjie ZhaoSony Group CorporationEmail được xác minh tại cis.lmu.de
Yuhta TakidaSony AIEmail được xác minh tại sony.com
Yukara IkemiyaSonyEmail được xác minh tại sony.com
Shiqi YangResearch Scientist, SB Intuitions, SoftBankEmail được xác minh tại sbintuitions.co.jp
Chieh-Hsin Lai (Jesse)Sony AI; Visiting Assistant Professor of Applied Math, National Yang Ming Chiao Tung UniversityEmail được xác minh tại sony.com
Koichi SaitoSony AIEmail được xác minh tại sony.com
Katsutoshi ItoyamaTokyo Institute of TechnologyEmail được xác minh tại ra.sc.e.titech.ac.jp
Kazuhiro NakadaiInstitute of Science TokyoEmail được xác minh tại ra.sc.e.titech.ac.jp
WeiHsiang LiaoSony Research Inc.Email được xác minh tại sony.com
Tatsuya KawaharaProfessor, School of Informatics, Kyoto UniversityEmail được xác minh tại i.kyoto-u.ac.jp
Dongjun KimStanford UniversityEmail được xác minh tại stanford.edu
Hiromi WakakiSony Group CorporationEmail được xác minh tại sony.com
Marco ComunitàPhD researcher at Queen Mary University of LondonEmail được xác minh tại qmul.ac.uk
Marco A. Martinez-RamirezMusic technology researcher, Sony AIEmail được xác minh tại sony.com
Woosung ChoiSonyAIEmail được xác minh tại sony.com

Theo dõi

Zhi Zhong

Sony

Email được xác minh tại sony.com

Audio Representation Learning Music Technology AI-based Contents Creation Deep Generative Models


Tiêu đề Sắp xếp theo số lượt trích dẫn Sắp xếp theo năm Sắp xếp theo tiêu đề	Trích dẫn bởi Trích dẫn bởi	Năm
Diffusion-based speech enhancement with joint generative and predictive decoders H Shi, K Shimada, M Hirano, T Shibuya, Y Koyama, Z Zhong, S Takahashi, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	16	2024
An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification Z Zhong, M Hirano, K Shimada, K Tateishi, S Takahashi, Y Mitsufuji ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	15	2023
SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation K Saito, D Kim, T Shibuya, CH Lai, Z Zhong, Y Takida, Y Mitsufuji arXiv preprint arXiv:2405.18503, 2024	5	2024
Extending Audio Masked Autoencoders Toward Audio Restoration Z Zhong, H Shi, M Hirano, K Shimada, K Tateishi, T Shibuya, S Takahashi, ... WASPAA 2023-2023 IEEE Workshop on Applications of Signal Processing to Audio …, 2023	4	2023
Assessment of a beamforming implementation developed for surface sound source separation Z Zhong, M Shakeel, K Itoyama, K Nishida, K Nakadai 2021 IEEE/SICE International Symposium on System Integration (SII), 369-374, 2021	4	2021
SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond M Comunita, Z Zhong, A Takahashi, S Yang, M Zhao, K Saito, Y Ikemiya, ... arXiv preprint arXiv:2406.17672, 2024	3	2024
Design and assessment of a scan-and-sum beamformer for surface sound source separation Z Zhong, K Itoyama, K Nishida, K Nakadai 2020 IEEE/SICE International Symposium on System Integration (SII), 808-813, 2020	3	2020
OpenMU: Your Swiss Army Knife for Music Understanding M Zhao, Z Zhong, Z Mao, S Yang, WH Liao, S Takahashi, H Wakaki, ... arXiv preprint arXiv:2410.15573, 2024	1	2024
Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation S Yang, Z Zhong, M Zhao, S Takahashi, M Ishii, T Shibuya, Y Mitsufuji arXiv preprint arXiv:2405.14598, 2024	1	2024
Music Foundation Model as Generic Booster for Music Downstream Tasks WH Liao, Y Takida, Y Ikemiya, Z Zhong, CH Lai, G Fabbro, K Shimada, ... arXiv preprint arXiv:2411.01135, 2024		2024
VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression Y Chae, W Choi, Y Takida, J Koo, Y Ikemiya, Z Zhong, KW Cheuk, ... arXiv preprint arXiv:2410.06016, 2024		2024
On the Language Encoder of Contrastive Cross-modal Models M Zhao, J Ono, Z Zhong, CH Lai, Y Takida, N Murata, WH Liao, T Shibuya, ... arXiv preprint arXiv:2310.13267, 2023		2023
Source separation device, source separation method and program KN Kazuhiro Nakadai, Zhi Zhong, Katsutoshi Itoyama JP Patent 特許第7316614号, 2023		2023
FLEXOUNDIT: VARIABLE-LENGTH DIFFUSION TRANSFORMER FOR TEXT-TO-AUDIO GENERATION Z Zhong, Y Ikemiya, K Toyama, WH Liao, S Takahashi, Y Mitsufuji

Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.

Bài viết 1–14

Trích dẫn mỗi năm

Trích dẫn trùng lặp

Trích dẫn được hợp nhất

Thêm đồng tác giảĐồng tác giả

Theo dõi

Trích dẫn bởi

Đồng tác giả