Multi-modal siamese network for entity alignment
The booming of multi-modal knowledge graphs (MMKGs) has raised the imperative demand
for multi-modal entity alignment techniques, which facilitate the integration of multiple …
for multi-modal entity alignment techniques, which facilitate the integration of multiple …
Videollm-mod: Efficient video-language streaming with mixture-of-depths vision computation
A well-known dilemma in large vision-language models (eg, GPT-4, LLaVA) is that while
increasing the number of vision tokens generally enhances visual understanding, it also …
increasing the number of vision tokens generally enhances visual understanding, it also …
Interaction-aware drug package recommendation via policy gradient
Recent years have witnessed the rapid accumulation of massive electronic medical records,
which highly support intelligent medical services such as drug recommendation. However …
which highly support intelligent medical services such as drug recommendation. However …
Learning the explainable semantic relations via unified graph topic-disentangled neural networks
Graph Neural Networks (GNNs) such as Graph Convolutional Networks (GCNs) can
effectively learn node representations via aggregating neighbors based on the relation …
effectively learn node representations via aggregating neighbors based on the relation …
Collaboration-Aware Hybrid Learning for Knowledge Development Prediction
In recent years, the rise of online Knowledge Management Systems (KMSs) has significantly
improved work efficiency in enterprises. Knowledge development prediction, as a critical …
improved work efficiency in enterprises. Knowledge development prediction, as a critical …
Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs
Large Language Models (LLMs) have shown remarkable reasoning capabilities on complex
tasks, but they still suffer from out-of-date knowledge, hallucinations, and opaque decision …
tasks, but they still suffer from out-of-date knowledge, hallucinations, and opaque decision …
Unified QA-aware knowledge graph generation based on multi-modal modeling
Understanding the long duration videos' storyline is often considered a major challenge in
the field of video understanding. To promote research on understanding longer videos in the …
the field of video understanding. To promote research on understanding longer videos in the …
When I Fall in Love: Capturing Video-oriented Social Relationship Evolution via Attentive GNN
With the booming of streaming media platforms, viewers now get used to watching dramas
and movies via online platforms with more intelligent services. Usually, character …
and movies via online platforms with more intelligent services. Usually, character …
Communication-efficient federated learning with stagewise training strategy
The efficiency of communication across workers is a significant factor that affects the
performance of federated learning. Though periodic communication strategy is applied to …
performance of federated learning. Though periodic communication strategy is applied to …
Learning social relationship from videos via pre-trained multimodal transformer
As a crucial task for video analysis, social relation recognition from characters provides
intelligent applications with great potential to better understand the behaviors or emotions of …
intelligent applications with great potential to better understand the behaviors or emotions of …