- Academic Search

AA Nafea, SA Alameri, RR Majeed… - … Journal of Machine …, 2024 - mesopotamian.press

In last years, computer vision has shown important advances, mainly using the application of
supervised machine learning (ML) and deep learning (DL) techniques. The objective of this …

सेव करें उद्धृत करें 36 में हवाला दिया गया मिलते-जुलते लेख सभी 6 वर्शन HTML रूप में देखें

[免费ChatGPT] [DeepSeek可用网址] [PDF] acm.org

Video description: A survey of methods, datasets, and evaluation metrics

N Aafaq, A Mian, W Liu, SZ Gilani, M Shah - ACM Computing Surveys …, 2019 - dl.acm.org

Video description is the automatic generation of natural language sentences that describe
the contents of a given video. It has applications in human-robot interaction, hel** the …

सेव करें उद्धृत करें 256 में हवाला दिया गया मिलते-जुलते लेख सभी 9 वर्शन

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Video recap: Recursive captioning of hour-long videos

MM Islam, N Ho, X Yang, T Nagarajan… - Proceedings of the …, 2024 - openaccess.thecvf.com

Most video captioning models are designed to process short video clips of few seconds and
output text describing low-level visual concepts (eg objects scenes atomic actions). However …

सेव करें उद्धृत करें 31 में हवाला दिया गया मिलते-जुलते लेख सभी 7 वर्शन HTML रूप में देखें

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Howto100m: Learning a text-video embedding by watching hundred million narrated video clips

A Miech, D Zhukov, JB Alayrac… - Proceedings of the …, 2019 - openaccess.thecvf.com

Learning text-video embeddings usually requires a dataset of video clips with manually
provided captions. However, such datasets are expensive and time consuming to create and …

सेव करें उद्धृत करें 1326 में हवाला दिया गया मिलते-जुलते लेख सभी 10 वर्शन HTML रूप में देखें

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Object relational graph with teacher-recommended learning for video captioning

Z Zhang, Y Shi, C Yuan, B Li, P Wang… - Proceedings of the …, 2020 - openaccess.thecvf.com

Taking full advantage of the information from both vision and language is critical for the
video captioning task. Existing models lack adequate visual representation due to the …

सेव करें उद्धृत करें 373 में हवाला दिया गया मिलते-जुलते लेख सभी 8 वर्शन HTML रूप में देखें

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Videos as space-time region graphs

X Wang, A Gupta - Proceedings of the European …, 2018 - openaccess.thecvf.com

How do humans recognize the action" opening a book"? We argue that there are two
important cues: modeling temporal shape dynamics and modeling functional relationships …

सेव करें उद्धृत करें 912 में हवाला दिया गया मिलते-जुलते लेख सभी 11 वर्शन HTML रूप में देखें

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Hierarchical conditional relation networks for video question answering

TM Le, V Le, S Venkatesh… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com

Video question answering (VideoQA) is challenging as it requires modeling capacity to distill
dynamic visual artifacts and distant relations and to associate them with linguistic concepts …

सेव करें उद्धृत करें 322 में हवाला दिया गया मिलते-जुलते लेख सभी 11 वर्शन HTML रूप में देखें

[免费ChatGPT] [DeepSeek可用网址] [PDF] google.com

STAT: Spatial-temporal attention mechanism for video captioning

C Yan, Y Tu, X Wang, Y Zhang, X Hao… - IEEE transactions on …, 2019 - ieeexplore.ieee.org

Video captioning refers to automatic generate natural language sentences, which
summarize the video contents. Inspired by the visual attention mechanism of human beings …

सेव करें उद्धृत करें 407 में हवाला दिया गया मिलते-जुलते लेख सभी 6 वर्शन

[免费ChatGPT] [DeepSeek可用网址] [PDF] ustc.edu.cn

Video question answering via gradually refined attention over appearance and motion

D Xu, Z Zhao, J **ao, F Wu, H Zhang, X He… - Proceedings of the 25th …, 2017 - dl.acm.org

Recently image question answering (ImageQA) has gained lots of attention in the research
community. However, as its natural extension, video question answering (VideoQA) is less …

सेव करें उद्धृत करें 644 में हवाला दिया गया मिलते-जुलते लेख सभी 3 वर्शन

Video captioning with attention-based LSTM and semantic consistency

L Gao, Z Guo, H Zhang, X Xu… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org

Recent progress in using long short-term memory (LSTM) for image captioning has
motivated the exploration of their applications for video captioning. By taking a video as a …

सेव करें उद्धृत करें 689 में हवाला दिया गया मिलते-जुलते लेख सभी 4 वर्शन

अलर्ट बनाएं

उद्धृत करें

बेहतर खोज

मेरी लाइब्रेरी में सेव किया गया

Hierarchical recurrent neural encoder for video representation with application to captioning

A short review on supervised machine learning and deep learning techniques in computer vision

Video description: A survey of methods, datasets, and evaluation metrics

Video recap: Recursive captioning of hour-long videos

Howto100m: Learning a text-video embedding by watching hundred million narrated video clips

Object relational graph with teacher-recommended learning for video captioning

Videos as space-time region graphs

Hierarchical conditional relation networks for video question answering

STAT: Spatial-temporal attention mechanism for video captioning

Video question answering via gradually refined attention over appearance and motion

Video captioning with attention-based LSTM and semantic consistency