Decision-focused learning: Foundations, state of the art, benchmark and future opportunities

J Mandi, J Kotary, S Berden, M Mulamba… - Journal of Artificial …, 2024 - jair.org
Decision-focused learning (DFL) is an emerging paradigm that integrates machine learning
(ML) and constrained optimization to enhance decision quality by training ML models in an …

Deep learning for image colorization: Current and future prospects

S Huang, X **, Q Jiang, L Liu - Engineering Applications of Artificial …, 2022 - Elsevier
Image colorization, as an essential problem in computer vision (CV), has attracted an
increasing amount of researchers attention in recent years, especially deep learning-based …

Self-chained image-language model for video localization and question answering

S Yu, J Cho, P Yadav, M Bansal - Advances in Neural …, 2024 - proceedings.neurips.cc
Recent studies have shown promising results on utilizing large pre-trained image-language
models for video question answering. While these image-language models can efficiently …

Dynamic neural networks: A survey

Y Han, G Huang, S Song, L Yang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Dynamic neural network is an emerging research topic in deep learning. Compared to static
models which have fixed computational graphs and parameters at the inference stage …

Ts2-net: Token shift and selection transformer for text-video retrieval

Y Liu, P **ong, L Xu, S Cao, Q ** - European conference on computer …, 2022 - Springer
Text-Video retrieval is a task of great practical value and has received increasing attention,
among which learning spatial-temporal video representation is one of the research hotspots …

Vision transformer with attentive pooling for robust facial expression recognition

F Xue, Q Wang, Z Tan, Z Ma… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Facial Expression Recognition (FER) in the wild is an extremely challenging task. Recently,
some Vision Transformers (ViT) have been explored for FER, but most of them perform …

Efficient video transformers with spatial-temporal token selection

J Wang, X Yang, H Li, L Liu, Z Wu, YG Jiang - European Conference on …, 2022 - Springer
Video transformers have achieved impressive results on major video recognition
benchmarks, which however suffer from high computational cost. In this paper, we present …

Kvq: Kwai video quality assessment for short-form videos

Y Lu, X Li, Y Pei, K Yuan, Q **e, Y Qu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Short-form UGC video platforms like Kwai and TikTok have been an emerging and
irreplaceable mainstream media form thriving on user-friendly engagement and …

[HTML][HTML] Deep learning in computational dermatopathology of melanoma: A technical systematic literature review

D Sauter, G Lodde, F Nensa, D Schadendorf… - Computers in biology …, 2023 - Elsevier
Deep learning (DL) has become one of the major approaches in computational
dermatopathology, evidenced by a significant increase in this topic in the current literature …

Differentiable zooming for multiple instance learning on whole-slide images

K Thandiackal, B Chen, P Pati, G Jaume… - … on Computer Vision, 2022 - Springer
Abstract Multiple Instance Learning (MIL) methods have become increasingly popular for
classifying gigapixel-sized Whole-Slide Images (WSIs) in digital pathology. Most MIL …