Multi-modal siamese network for entity alignment

L Chen, Z Li, T Xu, H Wu, Z Wang, NJ Yuan… - Proceedings of the 28th …, 2022 - dl.acm.org
The booming of multi-modal knowledge graphs (MMKGs) has raised the imperative demand
for multi-modal entity alignment techniques, which facilitate the integration of multiple …

Videollm-mod: Efficient video-language streaming with mixture-of-depths vision computation

S Wu, J Chen, KQ Lin, Q Wang, Y Gao… - Advances in …, 2025 - proceedings.neurips.cc
A well-known dilemma in large vision-language models (eg, GPT-4, LLaVA) is that while
increasing the number of vision tokens generally enhances visual understanding, it also …

Interaction-aware drug package recommendation via policy gradient

Z Zheng, C Wang, T Xu, D Shen, P Qin, X Zhao… - ACM Transactions on …, 2023 - dl.acm.org
Recent years have witnessed the rapid accumulation of massive electronic medical records,
which highly support intelligent medical services such as drug recommendation. However …

Learning the explainable semantic relations via unified graph topic-disentangled neural networks

L Wu, H Zhao, Z Li, Z Huang, Q Liu… - ACM Transactions on …, 2023 - dl.acm.org
Graph Neural Networks (GNNs) such as Graph Convolutional Networks (GCNs) can
effectively learn node representations via aggregating neighbors based on the relation …

Collaboration-Aware Hybrid Learning for Knowledge Development Prediction

L Chen, C Qin, Y Sun, X Song, T Xu, H Zhu… - Proceedings of the ACM …, 2024 - dl.acm.org
In recent years, the rise of online Knowledge Management Systems (KMSs) has significantly
improved work efficiency in enterprises. Knowledge development prediction, as a critical …

Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs

L Chen, P Tong, Z **, Y Sun, J Ye, H **ong - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) have shown remarkable reasoning capabilities on complex
tasks, but they still suffer from out-of-date knowledge, hallucinations, and opaque decision …

Unified QA-aware knowledge graph generation based on multi-modal modeling

P Qin, J Yu, Y Gao, D Xu, Y Chen, S Wu, T Xu… - Proceedings of the 30th …, 2022 - dl.acm.org
Understanding the long duration videos' storyline is often considered a major challenge in
the field of video understanding. To promote research on understanding longer videos in the …

When I Fall in Love: Capturing Video-oriented Social Relationship Evolution via Attentive GNN

P Qin, S Wu, T Xu, Y Hao, F Feng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the booming of streaming media platforms, viewers now get used to watching dramas
and movies via online platforms with more intelligent services. Usually, character …

Communication-efficient federated learning with stagewise training strategy

Y Cheng, S Shen, X Liang, J Liu, J Chen, T Zhang… - Neural Networks, 2023 - Elsevier
The efficiency of communication across workers is a significant factor that affects the
performance of federated learning. Though periodic communication strategy is applied to …

Learning social relationship from videos via pre-trained multimodal transformer

Y Teng, C Song, B Wu - IEEE Signal Processing Letters, 2022 - ieeexplore.ieee.org
As a crucial task for video analysis, social relation recognition from characters provides
intelligent applications with great potential to better understand the behaviors or emotions of …