Computer vision for autonomous vehicles: Problems, datasets and state of the art
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …
Efficient structure from motion for large-scale UAV images: A review and a comparison of SfM tools
Unmanned aerial vehicle (UAV) images have gained extensive attention in varying fields,
and the Structure from Motion (SfM) technique has become the gold standard for aerial …
and the Structure from Motion (SfM) technique has become the gold standard for aerial …
[PDF][PDF] The dawn of lmms: Preliminary explorations with gpt-4v (ision)
Large multimodal models (LMMs) extend large language models (LLMs) with multi-sensory
skills, such as visual understanding, to achieve stronger generic intelligence. In this paper …
skills, such as visual understanding, to achieve stronger generic intelligence. In this paper …
[BOOK][B] Computer vision: algorithms and applications
R Szeliski - 2022 - books.google.com
Humans perceive the three-dimensional structure of the world with apparent ease. However,
despite all of the recent advances in computer vision research, the dream of having a …
despite all of the recent advances in computer vision research, the dream of having a …
Large-scale image retrieval with attentive deep local features
We propose an attentive local feature descriptor suitable for large-scale image retrieval,
referred to as DELF (DEep Local Feature). The new feature is based on convolutional neural …
referred to as DELF (DEep Local Feature). The new feature is based on convolutional neural …
Dolg: Single-stage image retrieval with deep orthogonal fusion of local and global features
Image Retrieval is a fundamental task of obtaining images similar to the query one from a
database. A common image retrieval practice is to firstly retrieve candidate images via …
database. A common image retrieval practice is to firstly retrieve candidate images via …
What makes paris look like paris?
Given a large repository of geo-tagged imagery, we seek to automatically find visual
elements, for example windows, balconies, and street signs, that are most distinctive for a …
elements, for example windows, balconies, and street signs, that are most distinctive for a …
Planet-photo geolocation with convolutional neural networks
Is it possible to determine the location of a photo from just its pixels? While the general
problem seems exceptionally difficult, photos often contain cues such as landmarks, weather …
problem seems exceptionally difficult, photos often contain cues such as landmarks, weather …
Lifelogging: Personal big data
We have recently observed a convergence of technologies to foster the emergence of
lifelogging as a mainstream activity. Computer storage has become significantly cheaper …
lifelogging as a mainstream activity. Computer storage has become significantly cheaper …
Using AI and social media multimodal content for disaster response and management: Opportunities, challenges, and future directions
Abstract People increasingly use Social Media (SM) platforms such as Twitter and Facebook
during disasters and emergencies to post situational updates including reports of injured or …
during disasters and emergencies to post situational updates including reports of injured or …