Computer vision for autonomous vehicles: Problems, datasets and state of the art

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

Efficient structure from motion for large-scale UAV images: A review and a comparison of SfM tools

S Jiang, C Jiang, W Jiang - ISPRS Journal of Photogrammetry and Remote …, 2020 - Elsevier
Unmanned aerial vehicle (UAV) images have gained extensive attention in varying fields,
and the Structure from Motion (SfM) technique has become the gold standard for aerial …

[PDF][PDF] The dawn of lmms: Preliminary explorations with gpt-4v (ision)

Z Yang, L Li, K Lin, J Wang, CC Lin… - arxiv preprint arxiv …, 2023 - stableaiprompts.com
Large multimodal models (LMMs) extend large language models (LLMs) with multi-sensory
skills, such as visual understanding, to achieve stronger generic intelligence. In this paper …

[BOOK][B] Computer vision: algorithms and applications

R Szeliski - 2022 - books.google.com
Humans perceive the three-dimensional structure of the world with apparent ease. However,
despite all of the recent advances in computer vision research, the dream of having a …

Large-scale image retrieval with attentive deep local features

H Noh, A Araujo, J Sim, T Weyand… - Proceedings of the …, 2017 - openaccess.thecvf.com
We propose an attentive local feature descriptor suitable for large-scale image retrieval,
referred to as DELF (DEep Local Feature). The new feature is based on convolutional neural …

Dolg: Single-stage image retrieval with deep orthogonal fusion of local and global features

M Yang, D He, M Fan, B Shi, X Xue… - Proceedings of the …, 2021 - openaccess.thecvf.com
Image Retrieval is a fundamental task of obtaining images similar to the query one from a
database. A common image retrieval practice is to firstly retrieve candidate images via …

What makes paris look like paris?

C Doersch, S Singh, A Gupta, J Sivic… - Communications of the …, 2015 - dl.acm.org
Given a large repository of geo-tagged imagery, we seek to automatically find visual
elements, for example windows, balconies, and street signs, that are most distinctive for a …

Planet-photo geolocation with convolutional neural networks

T Weyand, I Kostrikov, J Philbin - … , The Netherlands, October 11-14, 2016 …, 2016 - Springer
Is it possible to determine the location of a photo from just its pixels? While the general
problem seems exceptionally difficult, photos often contain cues such as landmarks, weather …

Lifelogging: Personal big data

C Gurrin, AF Smeaton, AR Doherty - Foundations and Trends® …, 2014 - nowpublishers.com
We have recently observed a convergence of technologies to foster the emergence of
lifelogging as a mainstream activity. Computer storage has become significantly cheaper …

Using AI and social media multimodal content for disaster response and management: Opportunities, challenges, and future directions

M Imran, F Ofli, D Caragea, A Torralba - Information Processing & …, 2020 - Elsevier
Abstract People increasingly use Social Media (SM) platforms such as Twitter and Facebook
during disasters and emergencies to post situational updates including reports of injured or …