Академия Google

Y Belinkov, J Glass - … of the Association for Computational Linguistics, 2019 - direct.mit.edu

The field of natural language processing has seen impressive progress in recent years, with
neural network models replacing many of the traditional systems. A plethora of new models …

Сохранить Цитировать Цитируется: 632 Похожие статьи Все версии статьи (13)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Pretraining with artificial language: Studying transferable knowledge in language models

R Ri, Y Tsuruoka - arxiv preprint arxiv:2203.10326, 2022 - arxiv.org

We investigate what kind of structural knowledge learned in neural network encoders is
transferable to processing natural language. We design artificial languages with structural …

Сохранить Цитировать Цитируется: 35 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] What do end-to-end speech models learn about speaker, language and channel information? a layer-wise and neuron-level analysis

SA Chowdhury, N Durrani, A Ali - Computer Speech & Language, 2024 - Elsevier

Deep neural networks are inherently opaque and challenging to interpret. Unlike hand-
crafted feature-based models, we struggle to comprehend the concepts learned and how …

Сохранить Цитировать Цитируется: 15 Похожие статьи Все версии статьи (7)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On evaluating the generalization of LSTM models in formal languages

M Suzgun, Y Belinkov, SM Shieber - arxiv preprint arxiv:1811.01001, 2018 - arxiv.org

Recurrent Neural Networks (RNNs) are theoretically Turing-complete and established
themselves as a dominant model for language processing. Yet, there still remains an …

Сохранить Цитировать Цитируется: 47 Похожие статьи Все версии статьи (10) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

LSTMs compose (and learn) bottom-up

N Saphra, A Lopez - arxiv preprint arxiv:2010.04650, 2020 - arxiv.org

Recent work in NLP shows that LSTM language models capture hierarchical structure in
language data. In contrast to existing work, we consider the\textit {learning} process that …

Сохранить Цитировать Цитируется: 18 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Diversity as a by-product: Goal-oriented language generation leads to linguistic variation

S Schüz, T Han, S Zarrieß - … of the 22nd Annual Meeting of the …, 2021 - aclanthology.org

The ability for variation in language use is necessary for speakers to achieve their
conversational goals, for instance when referring to objects in visual environments. We …

Сохранить Цитировать Цитируется: 14 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Automatically Extracting Challenge Sets for Non local Phenomena in Neural Machine Translation

L Choshen, O Abend - arxiv preprint arxiv:1909.06814, 2019 - arxiv.org

We show that the state of the art Transformer Machine Translation (MT) model is not biased
towards monotonic reordering (unlike previous recurrent neural network models), but that …

Сохранить Цитировать Цитируется: 21 Похожие статьи Все версии статьи (6) В виде HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Analogical inference from distributional structure: What recurrent neural networks can tell us about word learning

PA Huebner, JA Willits - Machine Learning with Applications, 2023 - Elsevier

One proposal that can explain the remarkable pace of word learning in young children is
that they leverage the language-internal distributional similarity of familiar and novel words …

Сохранить Цитировать Цитируется: 2 Похожие статьи

[Free GPT-4]
[DeepSeek]

[PDF] ed.ac.uk

Language models learn POS first

N Saphra, A Lopez - … and Interpreting Neural Networks for NLP, 2018 - research.ed.ac.uk

A glut of recent research shows that language models capture linguistic structure. Linzen et
al.(2016) found that LSTM-based language models may encode syntactic information …

Сохранить Цитировать Цитируется: 12 Похожие статьи Все версии статьи (3) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

How LSTM encodes syntax: Exploring context vectors and semi-quantization on natural text

C Shibata, K Uchiumi, D Mochihashi - arxiv preprint arxiv:2010.00363, 2020 - arxiv.org

Long Short-Term Memory recurrent neural network (LSTM) is widely used and known to
capture informative long-term syntactic dependencies. However, how such information are …

Сохранить Цитировать Цитируется: 8 Похожие статьи Все версии статьи (5) В виде HTML

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Lstms exploit linguistic attributes of data

Analysis methods in neural language processing: A survey

Pretraining with artificial language: Studying transferable knowledge in language models

[HTML][HTML] What do end-to-end speech models learn about speaker, language and channel information? a layer-wise and neuron-level analysis

On evaluating the generalization of LSTM models in formal languages

LSTMs compose (and learn) bottom-up

Diversity as a by-product: Goal-oriented language generation leads to linguistic variation

Automatically Extracting Challenge Sets for Non local Phenomena in Neural Machine Translation

[HTML][HTML] Analogical inference from distributional structure: What recurrent neural networks can tell us about word learning

Language models learn POS first

How LSTM encodes syntax: Exploring context vectors and semi-quantization on natural text