- Academic Search

A Galassi, M Lippi, P Torroni - IEEE transactions on neural …, 2020 - ieeexplore.ieee.org

Attention is an increasingly popular mechanism used in a wide range of neural
architectures. The mechanism itself has been realized in a variety of formats. However …

Uložit Citovat Počet citací tohoto článku: 756 Související články Všechny verze (počet: 16)

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Energy and policy considerations for modern deep learning research

E Strubell, A Ganesh, A McCallum - … of the AAAI conference on artificial …, 2020 - ojs.aaai.org

The field of artificial intelligence has experienced a dramatic methodological shift towards
large neural networks trained on plentiful data. This shift has been fueled by recent …

Uložit Citovat Počet citací tohoto článku: 4422 Související články Všechny verze (počet: 17) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

What does bert look at? an analysis of bert's attention

K Clark, U Khandelwal, O Levy, CD Manning - arxiv preprint arxiv …, 2019 - arxiv.org

Large pre-trained neural networks such as BERT have had great recent success in NLP,
motivating a growing body of research investigating what aspects of language they are able …

Uložit Citovat Počet citací tohoto článku: 1962 Související články Všechny verze (počet: 13) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] nih.gov

Multimodal transformer for unaligned multimodal language sequences

YHH Tsai, S Bai, PP Liang, JZ Kolter… - Proceedings of the …, 2019 - pmc.ncbi.nlm.nih.gov

Human language is often multimodal, which comprehends a mixture of natural language,
facial gestures, and acoustic behaviors. However, two major challenges in modeling such …

Uložit Citovat Počet citací tohoto článku: 1632 Související články Všechny verze (počet: 9)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Are sixteen heads really better than one?

P Michel, O Levy, G Neubig - Advances in neural …, 2019 - proceedings.neurips.cc

Multi-headed attention is a driving force behind recent state-of-the-art NLP models. By
applying multiple attention mechanisms in parallel, it can express sophisticated functions …

Uložit Citovat Počet citací tohoto článku: 1160 Související články Všechny verze (počet: 8) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Attention, please! A survey of neural attention models in deep learning

A de Santana Correia, EL Colombini - Artificial Intelligence Review, 2022 - Springer

In humans, Attention is a core property of all perceptual and cognitive operations. Given our
limited ability to process competing sources, attention mechanisms select, modulate, and …

Uložit Citovat Počet citací tohoto článku: 230 Související články Všechny verze (počet: 9)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Natural language processing advancements by deep learning: A survey

A Torfi, RA Shirvani, Y Keneshloo, N Tavaf… - arxiv preprint arxiv …, 2020 - arxiv.org

Natural Language Processing (NLP) helps empower intelligent machines by enhancing a
better understanding of the human language for linguistic-based human-computer …

Uložit Citovat Počet citací tohoto článku: 377 Související články Všechny verze (počet: 3) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

What do you learn from context? probing for sentence structure in contextualized word representations

I Tenney, P **a, B Chen, A Wang, A Poliak… - arxiv preprint arxiv …, 2019 - arxiv.org

Contextualized representation models such as ELMo (Peters et al., 2018a) and BERT
(Devlin et al., 2018) have recently achieved state-of-the-art results on a diverse array of …

Uložit Citovat Počet citací tohoto článku: 988 Související články Všechny verze (počet: 9) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Simple bert models for relation extraction and semantic role labeling

P Shi, J Lin - arxiv preprint arxiv:1904.05255, 2019 - arxiv.org

We present simple BERT-based models for relation extraction and semantic role labeling. In
recent years, state-of-the-art performance has been achieved using neural models by …

Uložit Citovat Počet citací tohoto článku: 533 Související články Všechny verze (počet: 2) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

HIBERT: Document level pre-training of hierarchical bidirectional transformers for document summarization

X Zhang, F Wei, M Zhou - arxiv preprint arxiv:1905.06566, 2019 - arxiv.org

Neural extractive summarization models usually employ a hierarchical encoder for
document encoding and they are trained using sentence-level labels, which are created …

Uložit Citovat Počet citací tohoto článku: 510 Související články Všechny verze (počet: 5) Zobrazit jako HTML

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

Linguistically-informed self-attention for semantic role labeling

Attention in natural language processing

Energy and policy considerations for modern deep learning research

What does bert look at? an analysis of bert's attention

Multimodal transformer for unaligned multimodal language sequences

Are sixteen heads really better than one?

Attention, please! A survey of neural attention models in deep learning

Natural language processing advancements by deep learning: A survey

What do you learn from context? probing for sentence structure in contextualized word representations

Simple bert models for relation extraction and semantic role labeling

HIBERT: Document level pre-training of hierarchical bidirectional transformers for document summarization