Quilt-1m: One million image-text pairs for histopathology

W Ikezogwo, S Seyfioglu, F Ghezloo… - Advances in neural …, 2023 - proceedings.neurips.cc
Recent accelerations in multi-modal applications have been made possible with the
plethora of image and text data available online. However, the scarcity of analogous data in …

Dreamstruct: Understanding slides and user interfaces via synthetic data generation

YH Peng, F Huq, Y Jiang, J Wu, XY Li… - … on Computer Vision, 2024 - Springer
Enabling machines to understand structured visuals like slides and user interfaces is
essential for making them accessible to people with disabilities. However, achieving such …

Large-scale video retrieval using image queries

A Araujo, B Girod - IEEE transactions on circuits and systems …, 2017 - ieeexplore.ieee.org
Retrieving videos from large repositories using image queries is important for many
applications, such as brand monitoring or content linking. We introduce a new retrieval …

[HTML][HTML] Automatic prediction of presentation style and student engagement from videos

C Thomas, KAVP Sarma, SS Gajula… - Computers and Education …, 2022 - Elsevier
Presentation style is an important dimension to be considered for delivering lectures or
presentations. It affects the quality of the content delivery as well as the engagement of the …

Semantic navigation of powerpoint-based lecture video for autonote generation

C Xu, W Jia, R Wang, X He, B Zhao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the increasing popularity of open educational resources in the past few decades, more
and more users watch online videos to gain knowledge. However, most educational videos …

Neural image representations for multi-image fusion and layer separation

S Nam, MA Brubaker, MS Brown - European conference on computer …, 2022 - Springer
We propose a framework for aligning and fusing multiple images into a single view using
neural image representations (NIRs), also known as implicit or coordinate-based neural …

A generic framework for generation of summarized video clips using transfer learning (SumVClip)

R Mahum, A Irtaza, M Nawaz, T Nazir… - 2021 Mohammad Ali …, 2021 - ieeexplore.ieee.org
Video summarization aims to produce highlights of the original video showing informative
key events. Now a days video content is increasing enormously therefor to store, browse …

Large‐scale video retrieval via deep local convolutional features

C Zhang, B Hu, Y Suo, Z Zou, Y Ji - Advances in Multimedia, 2020 - Wiley Online Library
In this paper, we study the challenge of image‐to‐video retrieval, which uses the query
image to search relevant frames from a large collection of videos. A novel framework based …

Wise—slide segmentation in the wild

M Haurilet, A Roitberg, M Martinez… - 2019 International …, 2019 - ieeexplore.ieee.org
We address the task of segmenting presentation slides, where the examined page was
captured as a live photo during lectures. Slides are important document types used as visual …

[PDF][PDF] A deep analysis of image based video searching techniques

S Anayat, A Sikandar, SA Rasheed… - International Journal of …, 2020 - researchgate.net
For many applications like brand monitoring, it's important to search a video from large
database using image as query [1]. Numerous visual search technologies have emerged …