الباحث العلمي من Google

J Chai, A Li - … on Machine Learning and Cybernetics (ICMLC), 2019‏ - ieeexplore.ieee.org‏

Deep learning raises interests of research community as their overwhelming successes in
information processing such specific tasks as video/speech recognition. In this paper, we …‏

حفظ اقتباس تم اقتباسها في عدد: 80 مقالات ذات صلة

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Timeception for complex action recognition‏

N Hussein, E Gavves… - Proceedings of the …, 2019‏ - openaccess.thecvf.com‏

This paper focuses on the temporal aspect for recognizing human activities in videos; an
important visual cue that has long been undervalued. We revisit the conventional definition …‏

حفظ اقتباس تم اقتباسها في عدد: 272 مقالات ذات صلة الإصدارات الـ 8كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Exploiting feature and class relationships in video categorization with regularized deep neural networks‏

YG Jiang, Z Wu, J Wang, X Xue… - IEEE transactions on …, 2017‏ - ieeexplore.ieee.org‏

In this paper, we study the challenging problem of categorizing videos according to high-
level semantics such as the existence of a particular human action or a complex event …‏

حفظ اقتباس تم اقتباسها في عدد: 458 مقالات ذات صلة الإصدارات الـ 12كلها

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Video generation from text‏

Y Li, M Min, D Shen, D Carlson, L Carin - Proceedings of the AAAI …, 2018‏ - ojs.aaai.org‏

Generating videos from text has proven to be a significant challenge for existing generative
models. We tackle this problem by training a conditional generative model to extract both …‏

حفظ اقتباس تم اقتباسها في عدد: 303 مقالات ذات صلة الإصدارات الـ 11كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Predicting visual features from text for image and video caption retrieval‏

J Dong, X Li, CGM Snoek - IEEE Transactions on Multimedia, 2018‏ - ieeexplore.ieee.org‏

This paper strives to find amidst a set of sentences the one best describing the content of a
given image or video. Different from existing works, which rely on a joint subspace for their …‏

حفظ اقتباس تم اقتباسها في عدد: 264 مقالات ذات صلة الإصدارات الـ 9كلها

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Multi-shot temporal event localization: a benchmark‏

X Liu, Y Hu, S Bai, F Ding, X Bai… - Proceedings of the …, 2021‏ - openaccess.thecvf.com‏

Current developments in temporal event or action localization usually target actions
captured by a single camera. However, extensive events or actions in the wild may be …‏

حفظ اقتباس تم اقتباسها في عدد: 110 مقالات ذات صلة الإصدارات الـ 9كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Soccernet: A scalable dataset for action spotting in soccer videos‏

S Giancola, M Amine, T Dghaily… - Proceedings of the …, 2018‏ - openaccess.thecvf.com‏

In this paper, we introduce SoccerNet, a benchmark for action spotting in soccer videos. The
dataset is composed of 500 complete soccer games from six main European leagues …‏

حفظ اقتباس تم اقتباسها في عدد: 229 مقالات ذات صلة الإصدارات الـ 13كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] lixirong.net

W2vv++ fully deep learning for ad-hoc video search‏

X Li, C Xu, G Yang, Z Chen, J Dong - Proceedings of the 27th ACM …, 2019‏ - dl.acm.org‏

Ad-hoc video search (AVS) is an important yet challenging problem in multimedia retrieval.
Different from previous concept-based methods, we propose a fully deep learning method …‏

حفظ اقتباس تم اقتباسها في عدد: 157 مقالات ذات صلة الإصدارات الـ 6كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Hawkes processes for events in social media‏

MA Rizoiu, Y Lee, S Mishra, L **e - Frontiers of multimedia research, 2017‏ - dl.acm.org‏

This chapter provides an accessible introduction for point processes, and especially Hawkes
processes, for modeling discrete, inter-dependent events over continuous time. We start by …‏

حفظ اقتباس تم اقتباسها في عدد: 195 مقالات ذات صلة الإصدارات الـ 5كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Omni-sourced webly-supervised learning for video recognition‏

H Duan, Y Zhao, Y **ong, W Liu, D Lin - European conference on …, 2020‏ - Springer‏

We introduce OmniSource, a novel framework for leveraging web data to train video
recognition models. OmniSource overcomes the barriers between data formats, such as …‏

حفظ اقتباس تم اقتباسها في عدد: 116 مقالات ذات صلة الإصدارات الـ 9كلها

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

Eventnet: A large scale structured concept library for complex event detection in video

Deep learning in natural language processing: A state-of-the-art survey‏

Timeception for complex action recognition‏

Exploiting feature and class relationships in video categorization with regularized deep neural networks‏

Video generation from text‏

Predicting visual features from text for image and video caption retrieval‏

Multi-shot temporal event localization: a benchmark‏

Soccernet: A scalable dataset for action spotting in soccer videos‏

W2vv++ fully deep learning for ad-hoc video search‏

Hawkes processes for events in social media‏

Omni-sourced webly-supervised learning for video recognition‏