- Academic Search

W Ikezogwo, S Seyfioglu, F Ghezloo… - Advances in neural …, 2023 - proceedings.neurips.cc

Recent accelerations in multi-modal applications have been made possible with the
plethora of image and text data available online. However, the scarcity of analogous data in …

保存引用被引用次数：100 相关文章所有 8 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dreamstruct: Understanding slides and user interfaces via synthetic data generation

YH Peng, F Huq, Y Jiang, J Wu, XY Li… - … on Computer Vision, 2024 - Springer

Enabling machines to understand structured visuals like slides and user interfaces is
essential for making them accessible to people with disabilities. However, achieving such …

保存引用被引用次数：4 相关文章所有 16 个版本

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Large-scale video retrieval using image queries

A Araujo, B Girod - IEEE transactions on circuits and systems …, 2017 - ieeexplore.ieee.org

Retrieving videos from large repositories using image queries is important for many
applications, such as brand monitoring or content linking. We introduce a new retrieval …

保存引用被引用次数：103 相关文章所有 6 个版本图书馆搜索

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Automatic prediction of presentation style and student engagement from videos

C Thomas, KAVP Sarma, SS Gajula… - Computers and Education …, 2022 - Elsevier

Presentation style is an important dimension to be considered for delivering lectures or
presentations. It affects the quality of the content delivery as well as the engagement of the …

保存引用被引用次数：24 相关文章所有 4 个版本

Semantic navigation of powerpoint-based lecture video for autonote generation

C Xu, W Jia, R Wang, X He, B Zhao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

With the increasing popularity of open educational resources in the past few decades, more
and more users watch online videos to gain knowledge. However, most educational videos …

保存引用被引用次数：16 相关文章所有 4 个版本

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Neural image representations for multi-image fusion and layer separation

S Nam, MA Brubaker, MS Brown - European conference on computer …, 2022 - Springer

We propose a framework for aligning and fusing multiple images into a single view using
neural image representations (NIRs), also known as implicit or coordinate-based neural …

保存引用被引用次数：18 相关文章所有 6 个版本

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

A generic framework for generation of summarized video clips using transfer learning (SumVClip)

R Mahum, A Irtaza, M Nawaz, T Nazir… - 2021 Mohammad Ali …, 2021 - ieeexplore.ieee.org

Video summarization aims to produce highlights of the original video showing informative
key events. Now a days video content is increasing enormously therefor to store, browse …

保存引用被引用次数：11 相关文章所有 2 个版本

[Free GPT-4]
[DeepSeek]

[PDF] wiley.com Full View

Large‐scale video retrieval via deep local convolutional features

C Zhang, B Hu, Y Suo, Z Zou, Y Ji - Advances in Multimedia, 2020 - Wiley Online Library

In this paper, we study the challenge of image‐to‐video retrieval, which uses the query
image to search relevant frames from a large collection of videos. A novel framework based …

保存引用被引用次数：18 相关文章所有 8 个版本

[Free GPT-4]
[DeepSeek]

[PDF] kit.edu

Wise—slide segmentation in the wild

M Haurilet, A Roitberg, M Martinez… - 2019 International …, 2019 - ieeexplore.ieee.org

We address the task of segmenting presentation slides, where the examined page was
captured as a live photo during lectures. Slides are important document types used as visual …

保存引用被引用次数：16 相关文章所有 4 个版本

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

[PDF][PDF] A deep analysis of image based video searching techniques

S Anayat, A Sikandar, SA Rasheed… - International Journal of …, 2020 - researchgate.net

For many applications like brand monitoring, it's important to search a video from large
database using image as query [1]. Numerous visual search technologies have emerged …

保存引用被引用次数：8 相关文章所有 5 个版本 HTML 版

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Large-scale query-by-image video retrieval using bloom filters

Quilt-1m: One million image-text pairs for histopathology

Dreamstruct: Understanding slides and user interfaces via synthetic data generation

Large-scale video retrieval using image queries

[HTML][HTML] Automatic prediction of presentation style and student engagement from videos

Semantic navigation of powerpoint-based lecture video for autonote generation

Neural image representations for multi-image fusion and layer separation

A generic framework for generation of summarized video clips using transfer learning (SumVClip)

Large‐scale video retrieval via deep local convolutional features

Wise—slide segmentation in the wild

[PDF][PDF] A deep analysis of image based video searching techniques