A comprehensive review of the video-to-text problem

J Perez-Martin, B Bustos, SJF Guimaraes… - Artificial Intelligence …, 2022 - Springer
Research in the Vision and Language area encompasses challenging topics that seek to
connect visual and textual information. When the visual information is related to videos, this …

Argus: Efficient activity detection system for extended video analysis

W Liu, G Kang, PY Huang, X Chang… - Proceedings of the …, 2020 - openaccess.thecvf.com
Abstract We propose an Efficient Activity Detection System, Argus, for Extended Video
Analysis in the surveillance scenario. For the spatial-temporal event detection in the …

[HTML][HTML] A supercut of supercuts: aesthetics, histories, databases

M Tohline - Open Screens, 2021 - openscreensjournal.com
The genealogies of the supercut, which extend well past YouTube compilations, back to the
1920s and beyond, reveal it not as an aesthetic that trickled from avant-garde …

[PDF][PDF] FDU Participation in TRECVID 2019 VTT Task.

S Chen, YG Jiang - TRECVID, 2019 - www-nlpir.nist.gov
This notebook paper presents the system design of the FDU team in the TRECVID 2019 [1]
VTT task. Our approach adopts temporal concept prediction as an auxiliary task to assist …

[인용][C] Decoupled Deep Neural Network for Smoke Detection

S Luo, X Zhang, L Zhen, M Wang - 2021 - EasyChair