Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Weakly supervised object localization and detection: A survey
As an emerging and challenging problem in the computer vision community, weakly
supervised object localization and detection plays an important role for develo** new …
supervised object localization and detection plays an important role for develo** new …
Advances in deep concealed scene understanding
Concealed scene understanding (CSU) is a hot computer vision topic aiming to perceive
objects exhibiting camouflage. The current boom in terms of techniques and applications …
objects exhibiting camouflage. The current boom in terms of techniques and applications …
Anydoor: Zero-shot object-level image customization
This work presents AnyDoor a diffusion-based image generator with the power to teleport
target objects to new scenes at user-specified locations with desired shapes. Instead of …
target objects to new scenes at user-specified locations with desired shapes. Instead of …
Tracking anything with decoupled video segmentation
Training data for video segmentation are expensive to annotate. This impedes extensions of
end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary …
end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary …
Segment anything is not always perfect: An investigation of sam on different real-world applications
Abstract Recently, Meta AI Research approaches a general, promptable segment anything
model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B) …
model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B) …
Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model
We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …
Mvimgnet: A large-scale dataset of multi-view images
Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …
Visual attention network
While originally designed for natural language processing tasks, the self-attention
mechanism has recently taken various computer vision areas by storm. However, the 2D …
mechanism has recently taken various computer vision areas by storm. However, the 2D …
Putting the object back into video object segmentation
We present Cutie a video object segmentation (VOS) network with object-level memory
reading which puts the object representation from memory back into the video object …
reading which puts the object representation from memory back into the video object …
Visionllm v2: An end-to-end generalist multimodal large language model for hundreds of vision-language tasks
We present VisionLLM v2, an end-to-end generalist multimodal large model (MLLM) that
unifies visual perception, understanding, and generation within a single framework. Unlike …
unifies visual perception, understanding, and generation within a single framework. Unlike …