NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs
Abstract A Neural Radiance Field (NeRF) encodes the specific relation of 3D geometry and
appearance of a scene. We here ask the question whether we can transfer the appearance …
appearance of a scene. We here ask the question whether we can transfer the appearance …
Follow Anything: Open-Set Detection, Tracking, and Following in Real-Time
Tracking and following objects of interest is critical to several robotics use cases, ranging
from industrial automation to logistics and warehousing, to healthcare and security. In this …
from industrial automation to logistics and warehousing, to healthcare and security. In this …
Moho: Learning single-view hand-held object reconstruction with multi-view occlusion-aware supervision
Previous works concerning single-view hand-held object reconstruction typically rely on
supervision from 3D ground-truth models which are hard to collect in real world. In contrast …
supervision from 3D ground-truth models which are hard to collect in real world. In contrast …
PartCraft: Crafting Creative Objects by Parts
This paper propels creative control in generative visual AI by allowing users to “select”.
Departing from traditional text or sketch-based methods, we for the first time allow users to …
Departing from traditional text or sketch-based methods, we for the first time allow users to …
[HTML][HTML] Unbiased single-cell morphology with self-supervised vision transformers
Accurately quantifying cellular morphology at scale could substantially empower existing
single-cell approaches. However, measuring cell morphology remains an active field of …
single-cell approaches. However, measuring cell morphology remains an active field of …
MS-DINO: Masked self-supervised distributed learning using vision transformer
Despite promising advancements in deep learning in medical domains, challenges still
remain owing to data scarcity, compounded by privacy concerns and data ownership …
remain owing to data scarcity, compounded by privacy concerns and data ownership …
Learning Video Representations without Natural Videos
We show that useful video representations can be learned from synthetic videos and natural
images, without incorporating natural videos in the training. We propose a progression of …
images, without incorporating natural videos in the training. We propose a progression of …
Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models
Our brains represent the ever-changing environment with neurons in a highly dynamic
fashion. The temporal features of visual pixels in dynamic natural scenes are entrapped in …
fashion. The temporal features of visual pixels in dynamic natural scenes are entrapped in …
Categorical Keypoint Positional Embedding for Robust Animal Re-Identification
Y Lin, L Liu, J Shi - arxiv preprint arxiv:2412.00818, 2024 - arxiv.org
Animal re-identification (ReID) has become an indispensable tool in ecological research,
playing a critical role in tracking population dynamics, analyzing behavioral patterns, and …
playing a critical role in tracking population dynamics, analyzing behavioral patterns, and …
Interactive Teaching For Fine-Granular Few-Shot Object Recognition Using Vision Transformers
In real-world few-shot image classification tasks the lack of abundant data makes training
and testing very challenging. The classification model must learn the most meaningful …
and testing very challenging. The classification model must learn the most meaningful …