Why do we click: visual impression-aware news recommendation
There is a soaring interest in the news recommendation research scenario due to the
information overload. To accurately capture users' interests, we propose to model multi …
information overload. To accurately capture users' interests, we propose to model multi …
A roadmap for big model
With the rapid development of deep learning, training Big Models (BMs) for multiple
downstream tasks becomes a popular paradigm. Researchers have achieved various …
downstream tasks becomes a popular paradigm. Researchers have achieved various …
Where did i leave my keys?-episodic-memory-based question answering on egocentric videos
Humans have a remarkable ability to organize, compress and retrieve episodic memories
throughout their daily life. Current AI systems, however, lack comparable capabilities as they …
throughout their daily life. Current AI systems, however, lack comparable capabilities as they …
Grounded Question-Answering in Long Egocentric Videos
Existing approaches to video understanding mainly designed for short videos from a third-
person perspective are limited in their applicability in certain fields such as robotics. In this …
person perspective are limited in their applicability in certain fields such as robotics. In this …
Encode-Store-Retrieve: Enhancing Memory Augmentation through Language-Encoded Egocentric Perception
We depend on our own memory to encode, store, and retrieve our experiences. However,
memory lapses can occur. One promising avenue for achieving memory augmentation is …
memory lapses can occur. One promising avenue for achieving memory augmentation is …
A memory model for question answering from streaming data supported by rehearsal and anticipation of coreference information
Existing question answering methods often assume that the input content (eg, documents or
videos) is always accessible to solve the task. Alternatively, memory networks were …
videos) is always accessible to solve the task. Alternatively, memory networks were …
Taohighlight: Commodity-aware multi-modal video highlight detection in e-commerce
In e-commerce, product related video is important content to introduce product
characteristics and attract consumers. Especially in the recommendation system of e …
characteristics and attract consumers. Especially in the recommendation system of e …
Track-On: Transformer-based Online Point Tracking with Memory
In this paper, we consider the problem of long-term point tracking, which requires consistent
identification of points across multiple frames in a video, despite changes in appearance …
identification of points across multiple frames in a video, despite changes in appearance …
Encode-Store-Retrieve: Augmenting Human Memory through Language-Encoded Egocentric Perception
We depend on our own memory to encode, store, and retrieve our experiences. However,
memory lapses can occur. One promising avenue for achieving memory augmentation is …
memory lapses can occur. One promising avenue for achieving memory augmentation is …
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Memory is crucial for machine learning tasks on sequential data. From vanilla RNN to LSTM
and memory augmented neural networks, researchers have investigated several types of …
and memory augmented neural networks, researchers have investigated several types of …