Google 학술 검색

M Abdar, M Kollati, S Kuraparthi… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Video captioning (VC) is a fast-moving, cross-disciplinary area of research that comprises
contributions from domains such as computer vision, natural language processing …

저장 인용 20회 인용 관련 학술자료 전체 7개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Video captioning with aggregated features based on dual graphs and gated fusion

Y **, B Liu, J Wang - arxiv preprint arxiv:2308.06685, 2023 - arxiv.org

The application of video captioning models aims at translating the content of videos by using
accurate natural language. Due to the complex nature inbetween object interaction in the …

저장 인용 2회 인용 관련 학술자료 전체 2개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] abes.ac.in

A Study of Multimodal Colearning, Application in Biometrics and Authentication

S Avasthi, T Sanwal, A Prakash… - … Biometric and Machine …, 2023 - Wiley Online Library

Summary “Multimodality” refers to utilizing multiple communication methods to comprehend
our environment better and enhance the user's experience. Using multimodal data, we may …

저장 인용 1회 인용 관련 학술자료 전체 5개의 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Video captioning with stacked attention and semantic hard pull

A review of deep learning for video captioning

Video captioning with aggregated features based on dual graphs and gated fusion

A Study of Multimodal Colearning, Application in Biometrics and Authentication