Moganet: Multi-order gated aggregation network
By contextualizing the kernel as global as possible, Modern ConvNets have shown great
potential in computer vision tasks. However, recent progress on\textit {multi-order game …
potential in computer vision tasks. However, recent progress on\textit {multi-order game …
5D Seismic data interpolation by continuous representation
How to represent a seismic wavefield? Traditionally, while seismic wavefields are
conceptualized continuously, acquisition geometries capture seismic data discretely using 2 …
conceptualized continuously, acquisition geometries capture seismic data discretely using 2 …
Surface-vqmae: Vector-quantized masked auto-encoders on molecular surfaces
Molecular surfaces imply fingerprints of interaction patterns between proteins. However, non-
equivalent efforts have been paid to incorporating the abundant protein surface information …
equivalent efforts have been paid to incorporating the abundant protein surface information …
Semireward: A general reward model for semi-supervised learning
Semi-supervised learning (SSL) has witnessed great progress with various improvements in
the self-training framework with pseudo labeling. The main challenge is how to distinguish …
the self-training framework with pseudo labeling. The main challenge is how to distinguish …
VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling
Similar to natural language models, pre-trained genome language models are proposed to
capture the underlying intricacies within genomes with unsupervised sequence modeling …
capture the underlying intricacies within genomes with unsupervised sequence modeling …
Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning
Large language models equipped with retrieval-augmented generation (RAG) represent a
burgeoning field aimed at enhancing answering capabilities by leveraging external …
burgeoning field aimed at enhancing answering capabilities by leveraging external …
Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation
Recent developments underscore the potential of textual information in enhancing learning
models for a deeper understanding of medical visual semantics. However, language-guided …
models for a deeper understanding of medical visual semantics. However, language-guided …
Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation
Medical image segmentation remains a formidable challenge due to the label scarcity. Pre-
training Vision Transformer (ViT) through masked image modeling (MIM) on large-scale …
training Vision Transformer (ViT) through masked image modeling (MIM) on large-scale …
Interpretable and Generalizable Spatiotemporal Predictive Learning with Disentangled Consistency
In recent years, significant strides have been made in the field of spatiotemporal predictive
learning, a discipline that focuses on accurately forecasting future sequences based on …
learning, a discipline that focuses on accurately forecasting future sequences based on …
Hybrid Self-Supervised and Semi-Supervised Framework for Robust Spatio-Temporal Action Detection
T Yan - 2024 IEEE 7th International Conference on Automation …, 2024 - ieeexplore.ieee.org
This paper presents a novel Hybrid Self-Supervised and Semi-Supervised Framework for
Robust Spatio-Temporal Action Detection, which integrates the advantages of self …
Robust Spatio-Temporal Action Detection, which integrates the advantages of self …