Macaw-llm: Multi-modal language modeling with image, audio, video, and text integration C Lyu, M Wu, L Wang, X Huang, B Liu, Z Du, S Shi, Z Tu arXiv preprint arXiv:2306.09093, 2023 | 165 | 2023 |
On the cultural gap in text-to-image generation B Liu, L Wang, C Lyu, Y Zhang, J Su, S Shi, Z Tu ECAI 2024, 930-937, 2024 | 12 | 2024 |
Retrieval-augmented multi-modal chain-of-thoughts reasoning for large language models B Liu, C Lyu, Z Min, Z Wang, J Su, L Wang arXiv preprint arXiv:2312.01714, 2023 | 12 | 2023 |
Exploring optimal transport-based multi-grained alignments for text-molecule retrieval Z Min, B Liu, L Zhang, J Song, J Su, S He, X Bo 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM …, 2024 | 1 | 2024 |