Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Multimodal learning with transformers: A survey
Transformer is a promising neural network learner, and has achieved great success in
various machine learning tasks. Thanks to the recent prevalence of multimodal applications …
various machine learning tasks. Thanks to the recent prevalence of multimodal applications …
How to reuse and compose knowledge for a lifetime of tasks: A survey on continual learning and functional composition
A major goal of artificial intelligence (AI) is to create an agent capable of acquiring a general
understanding of the world. Such an agent would require the ability to continually …
understanding of the world. Such an agent would require the ability to continually …
Dynamically transformed instance normalization network for generalizable person re-identification
Existing person re-identification methods often suffer significant performance degradation on
unseen domains, which fuels interest in domain generalizable person re-identification (DG …
unseen domains, which fuels interest in domain generalizable person re-identification (DG …
Interpretability for reliable, efficient, and self-cognitive DNNs: From theories to applications
In recent years, remarkable achievements have been made in artificial intelligence tasks
and applications based on deep neural networks (DNNs), especially in the fields of vision …
and applications based on deep neural networks (DNNs), especially in the fields of vision …
CX-ToM: Counterfactual explanations with theory-of-mind for enhancing human trust in image recognition models
We propose CX-ToM, short for counterfactual explanations with theory-of-mind, a new
explainable AI (XAI) framework for explaining decisions made by a deep convolutional …
explainable AI (XAI) framework for explaining decisions made by a deep convolutional …
Knowledge-augmented deep learning and its applications: A survey
Deep learning models, though having achieved great success in many different fields over
the past years, are usually data-hungry, fail to perform well on unseen samples, and lack …
the past years, are usually data-hungry, fail to perform well on unseen samples, and lack …
Reconstructing action-conditioned human-object interactions using commonsense knowledge priors
We present a method for inferring diverse 3D models of human-object interactions from
images. Reasoning about how humans interact with objects in complex scenes from a single …
images. Reasoning about how humans interact with objects in complex scenes from a single …
Eqa-mx: Embodied question answering using multimodal expression
Humans predominantly use verbal utterances and nonverbal gestures (eg, eye gaze and
pointing gestures) in their natural interactions. For instance, pointing gestures and verbal …
pointing gestures) in their natural interactions. For instance, pointing gestures and verbal …
Compositional Substitutivity of Visual Reasoning for Visual Question Answering
Compositional generalization has received much attention in vision-and-language and
visual reasoning recently. Substitutivity, the capability to generalize to novel compositions …
visual reasoning recently. Substitutivity, the capability to generalize to novel compositions …
Patron: perspective-aware multitask model for referring expression grounding using embodied multimodal cues
Humans naturally use referring expressions with verbal utterances and nonverbal gestures
to refer to objects and events. As these referring expressions can be interpreted differently …
to refer to objects and events. As these referring expressions can be interpreted differently …