- Academic Search

J Mandi, J Kotary, S Berden, M Mulamba… - Journal of Artificial …, 2024 - jair.org

Decision-focused learning (DFL) is an emerging paradigm that integrates machine learning
(ML) and constrained optimization to enhance decision quality by training ML models in an …

Speichern Zitieren Zitiert von: 54 Ähnliche Artikel Alle 3 Versionen HTML-Version

Deep learning for image colorization: Current and future prospects

S Huang, X **, Q Jiang, L Liu - Engineering Applications of Artificial …, 2022 - Elsevier

Image colorization, as an essential problem in computer vision (CV), has attracted an
increasing amount of researchers attention in recent years, especially deep learning-based …

Speichern Zitieren Zitiert von: 67 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Self-chained image-language model for video localization and question answering

S Yu, J Cho, P Yadav, M Bansal - Advances in Neural …, 2024 - proceedings.neurips.cc

Recent studies have shown promising results on utilizing large pre-trained image-language
models for video question answering. While these image-language models can efficiently …

Speichern Zitieren Zitiert von: 148 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dynamic neural networks: A survey

Y Han, G Huang, S Song, L Yang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Dynamic neural network is an emerging research topic in deep learning. Compared to static
models which have fixed computational graphs and parameters at the inference stage …

Speichern Zitieren Zitiert von: 787 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Ts2-net: Token shift and selection transformer for text-video retrieval

Y Liu, P **ong, L Xu, S Cao, Q ** - European conference on computer …, 2022 - Springer

Text-Video retrieval is a task of great practical value and has received increasing attention,
among which learning spatial-temporal video representation is one of the research hotspots …

Speichern Zitieren Zitiert von: 141 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Vision transformer with attentive pooling for robust facial expression recognition

F Xue, Q Wang, Z Tan, Z Ma… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Facial Expression Recognition (FER) in the wild is an extremely challenging task. Recently,
some Vision Transformers (ViT) have been explored for FER, but most of them perform …

Speichern Zitieren Zitiert von: 98 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Efficient video transformers with spatial-temporal token selection

J Wang, X Yang, H Li, L Liu, Z Wu, YG Jiang - European Conference on …, 2022 - Springer

Video transformers have achieved impressive results on major video recognition
benchmarks, which however suffer from high computational cost. In this paper, we present …

Speichern Zitieren Zitiert von: 80 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Kvq: Kwai video quality assessment for short-form videos

Y Lu, X Li, Y Pei, K Yuan, Q **e, Y Qu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Short-form UGC video platforms like Kwai and TikTok have been an emerging and
irreplaceable mainstream media form thriving on user-friendly engagement and …

Speichern Zitieren Zitiert von: 11 Ähnliche Artikel HTML-Version

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Deep learning in computational dermatopathology of melanoma: A technical systematic literature review

D Sauter, G Lodde, F Nensa, D Schadendorf… - Computers in biology …, 2023 - Elsevier

Deep learning (DL) has become one of the major approaches in computational
dermatopathology, evidenced by a significant increase in this topic in the current literature …

Speichern Zitieren Zitiert von: 16 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Differentiable zooming for multiple instance learning on whole-slide images

K Thandiackal, B Chen, P Pati, G Jaume… - … on Computer Vision, 2022 - Springer

Abstract Multiple Instance Learning (MIL) methods have become increasingly popular for
classifying gigapixel-sized Whole-Slide Images (WSIs) in digital pathology. Most MIL …

Speichern Zitieren Zitiert von: 44 Ähnliche Artikel Alle 9 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Differentiable patch selection for image recognition

Decision-focused learning: Foundations, state of the art, benchmark and future opportunities

Deep learning for image colorization: Current and future prospects

Self-chained image-language model for video localization and question answering

Dynamic neural networks: A survey

Ts2-net: Token shift and selection transformer for text-video retrieval

Vision transformer with attentive pooling for robust facial expression recognition

Efficient video transformers with spatial-temporal token selection

Kvq: Kwai video quality assessment for short-form videos

[HTML][HTML] Deep learning in computational dermatopathology of melanoma: A technical systematic literature review

Differentiable zooming for multiple instance learning on whole-slide images