Decision-focused learning: Foundations, state of the art, benchmark and future opportunities
Decision-focused learning (DFL) is an emerging paradigm that integrates machine learning
(ML) and constrained optimization to enhance decision quality by training ML models in an …
(ML) and constrained optimization to enhance decision quality by training ML models in an …
Deep learning for image colorization: Current and future prospects
Image colorization, as an essential problem in computer vision (CV), has attracted an
increasing amount of researchers attention in recent years, especially deep learning-based …
increasing amount of researchers attention in recent years, especially deep learning-based …
Self-chained image-language model for video localization and question answering
Recent studies have shown promising results on utilizing large pre-trained image-language
models for video question answering. While these image-language models can efficiently …
models for video question answering. While these image-language models can efficiently …
Dynamic neural networks: A survey
Dynamic neural network is an emerging research topic in deep learning. Compared to static
models which have fixed computational graphs and parameters at the inference stage …
models which have fixed computational graphs and parameters at the inference stage …
Ts2-net: Token shift and selection transformer for text-video retrieval
Text-Video retrieval is a task of great practical value and has received increasing attention,
among which learning spatial-temporal video representation is one of the research hotspots …
among which learning spatial-temporal video representation is one of the research hotspots …
Vision transformer with attentive pooling for robust facial expression recognition
Facial Expression Recognition (FER) in the wild is an extremely challenging task. Recently,
some Vision Transformers (ViT) have been explored for FER, but most of them perform …
some Vision Transformers (ViT) have been explored for FER, but most of them perform …
Efficient video transformers with spatial-temporal token selection
Video transformers have achieved impressive results on major video recognition
benchmarks, which however suffer from high computational cost. In this paper, we present …
benchmarks, which however suffer from high computational cost. In this paper, we present …
Kvq: Kwai video quality assessment for short-form videos
Short-form UGC video platforms like Kwai and TikTok have been an emerging and
irreplaceable mainstream media form thriving on user-friendly engagement and …
irreplaceable mainstream media form thriving on user-friendly engagement and …
[HTML][HTML] Deep learning in computational dermatopathology of melanoma: A technical systematic literature review
Deep learning (DL) has become one of the major approaches in computational
dermatopathology, evidenced by a significant increase in this topic in the current literature …
dermatopathology, evidenced by a significant increase in this topic in the current literature …
Differentiable zooming for multiple instance learning on whole-slide images
Abstract Multiple Instance Learning (MIL) methods have become increasingly popular for
classifying gigapixel-sized Whole-Slide Images (WSIs) in digital pathology. Most MIL …
classifying gigapixel-sized Whole-Slide Images (WSIs) in digital pathology. Most MIL …