Gpt4point: A unified framework for point-language understanding and generation

Z Qi, Y Fang, Z Sun, X Wu, T Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Multimodal Large Language Models (MLLMs) have excelled in 2D image-text
comprehension and image generation but their understanding of the 3D world is notably …

Fine-grained side information guided dual-prompts for zero-shot skeleton action recognition

Y Chen, J Guo, T He, X Lu, L Wang - Proceedings of the 32nd ACM …, 2024 - dl.acm.org
Skeleton-based zero-shot action recognition aims to recognize unknown human actions
based on the learned priors of the known skeleton-based actions and a semantic descriptor …

Pre-trained Graphformer-based Ranking at Web-scale Search

Y Li, H **ong, L Kong, Z Sun, H Chen, S Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Both Transformer and Graph Neural Networks (GNNs) have been employed in the domain
of learning to rank (LTR). However, these approaches adhere to two distinct yet …

Generative Pre-trained Ranking Model with Over-parameterization at Web-Scale

Y Li, H **ong, L Kong, J Bian, S Wang, G Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
Learning to rank (LTR) is widely employed in web searches to prioritize pertinent webpages
from retrieved content based on input queries. However, traditional LTR models encounter …