Pedm: A multi-task learning model for persona-aware emoji-embedded dialogue generation

S Zhao, H Jiang, H Tao, R Zha, K Zhang, T Xu… - ACM Transactions on …, 2023 - dl.acm.org
As a vivid and linguistic symbol, Emojis have become a prevailing medium interspersed in
text-based communication (eg, social media and chit-chat) to express emotions, attitudes …

When I fall in love: Capturing video-oriented social relationship evolution via attentive GNN

P Qin, S Wu, T Xu, Y Hao, F Feng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the booming of streaming media platforms, viewers now get used to watching dramas
and movies via online platforms with more intelligent services. Usually, character …

Semantic interaction matching network for few-shot knowledge graph completion

P Luo, X Zhu, T Xu, Y Zheng, E Chen - ACM Transactions on the Web, 2024 - dl.acm.org
The prosperity of knowledge graphs, as well as related downstream applications, has raised
the urgent need for knowledge graph completion techniques that fully support knowledge …

[PDF][PDF] Whu-Nercms At Trecvid2023: AdHoc Video Search (AVS) And Deep Video Understanding (DVU) Tasks

J He, R Li, J Guo, H Zhang, M Li, Z Wu, Z Wang, B Du… - TRECVID, 2023 - www-nlpir.nist.gov
The WHU-NERCMS team participated in the Ad-hoc Vedio Search (AVS) and Deep Video
Understanding (DVU) tasks at TRECVID 2023. For AVS task, we chose to utilize embedding …

Deep video understanding with video-language model

R Liu, Y Fang, F Yu, R Tian, T Ren, G Wu - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Pre-trained video-language models (VLMs) have shown superior performance in high-level
video understanding tasks, analyzing multi-modal information, aligning with Deep Video …

A Hierarchical Deep Video Understanding Method with Shot-Based Instance Search and Large Language Model

R Li, J Guo, M Li, Z Wu, C Liang - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Deep video understanding (DVU) is often considered a challenge due to the aim of
interpreting a video with storyline, which is designed to solve two levels of problems …

Multi-modal Entity Alignment via Position-enhanced Multi-label Propagation

W Tang, Y Wang - Proceedings of the 2024 International Conference on …, 2024 - dl.acm.org
Multi-modal Entity Alignment (MMEA) refers to utilizing multiple modalities such as text,
images, videos, etc., to match entities from multiple knowledge graphs. Compared to single …

Knowledge-Enhanced Multi-perspective Incongruity Perception Network for Multimodal Sarcasm Detection

Z Niu, Z **e, T Xu, X Wang, Y Hu, Y Yu… - … on Multimedia and …, 2024 - ieeexplore.ieee.org
Recent years have witnessed the urgent request for multi-modal sarcasm detection in social
media platforms. Though large efforts have been made with significant progress, prior arts …

Automated SPARQL Template for Flexible Question Answering.

D Wardani, A Wijaya, A Wijayanto… - International …, 2024 - search.ebscohost.com
The knowledge bases required the query language SPARQL, which consists of subject,
property, and object. SPARQL is a structured query language and is difficult to understand …

A Deep Understanding Video Q&A System for Film Education in Acting Department

Z Wu, R Li, J Guo, Z Wang… - … Conference on Intelligent …, 2023 - ieeexplore.ieee.org
Recently, advancements in artificial intelligence technology have greatly influenced the field
of education, particularly in the area of intelligent homework assistance. However, current …