Μελετητής Google

W Gu, S Bai, L Kong - Image and Vision Computing, 2022 - Elsevier

Image instance segmentation involves labeling pixels of images with classes and instances,
which is one of the pivotal technologies in many domains, such as natural scenes …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 167 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]

[PDF] thecvf.com

Generalized decoding for pixel, image, and language

X Zou, ZY Dou, J Yang, Z Gan, L Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present X-Decoder, a generalized decoding model that can predict pixel-level
segmentation and language tokens seamlessly. X-Decoder takes as input two types of …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 251 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] nowpublishers.com

Multimodal foundation models: From specialists to general-purpose assistants

C Li, Z Gan, Z Yang, J Yang, L Li… - … and Trends® in …, 2024 - nowpublishers.com

Neural compression is the application of neural networks and other machine learning
methods to data compression. Recent advances in statistical machine learning have opened …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 214 Σχετικά άρθρα Όλες οι 6 εκδοχές Αναζήτηση βιβλιοθήκης Προβολή ως HTML

[Free GPT-4]

[PDF] ecva.net

Petr: Position embedding transformation for multi-view 3d object detection

Y Liu, T Wang, X Zhang, J Sun - European Conference on Computer …, 2022 - Springer

In this paper, we develop position embedding transformation (PETR) for multi-view 3D
object detection. PETR encodes the position information of 3D coordinates into image …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 591 Σχετικά άρθρα Όλες οι 6 εκδοχές

[Free GPT-4]

[PDF] ieee.org

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 120 Σχετικά άρθρα Όλες οι 3 εκδοχές

[Free GPT-4]

[PDF] thecvf.com

Detrs with hybrid matching

D Jia, Y Yuan, H He, X Wu, H Yu… - Proceedings of the …, 2023 - openaccess.thecvf.com

One-to-one set matching is a key design for DETR to establish its end-to-end capability, so
that object detection does not require a hand-crafted NMS (non-maximum suppression) to …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 220 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] mdpi.com

A survey of visual transformers

Y Liu, Y Zhang, Y Wang, F Hou, J Yuan… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Transformer, an attention-based encoder–decoder model, has already revolutionized the
field of natural language processing (NLP). Inspired by such significant achievements, some …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 453 Σχετικά άρθρα Όλες οι 22 εκδοχές

[Free GPT-4]

[PDF] baai.ac.cn

A survey on vision transformer

K Han, Y Wang, H Chen, X Chen, J Guo… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 2660 Σχετικά άρθρα Όλες οι 7 εκδοχές

[Free GPT-4]

[PDF] neurips.cc

Rank-DETR for high quality object detection

Y Pu, W Liang, Y Hao, Y Yuan… - Advances in …, 2024 - proceedings.neurips.cc

Modern detection transformers (DETRs) use a set of object queries to predict a list of
bounding boxes, sort them by their classification confidence scores, and select the top …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 57 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

A survey on visual transformer

K Han, Y Wang, H Chen, X Chen, J Guo, Z Liu… - arxiv preprint arxiv …, 2020 - arxiv.org

Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 390 Σχετικά άρθρα Όλες οι 3 εκδοχές Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Solq: Segmenting objects by learning queries

A review on 2D instance segmentation based on deep neural networks

Generalized decoding for pixel, image, and language

Multimodal foundation models: From specialists to general-purpose assistants

Petr: Position embedding transformation for multi-view 3d object detection

Transformer-based visual segmentation: A survey

Detrs with hybrid matching

A survey of visual transformers

A survey on vision transformer

Rank-DETR for high quality object detection

A survey on visual transformer