Google Académico

J Gou, B Yu, SJ Maybank, D Tao - International Journal of Computer Vision, 2021 - Springer

In recent years, deep neural networks have been successful in both industry and academia,
especially for computer vision tasks. The great success of deep learning is mainly due to its …

Guardar Citar Citado por 3318 Artículos relacionados Las 12 versiones

[Free GPT-4]

[PDF] arxiv.org

Sequence-level knowledge distillation

Y Kim, AM Rush - arxiv preprint arxiv:1606.07947, 2016 - arxiv.org

Neural machine translation (NMT) offers a novel alternative formulation of translation that is
potentially simpler than statistical approaches. However to reach competitive performance …

Guardar Citar Citado por 1185 Artículos relacionados Las 9 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Bam! born-again multi-task networks for natural language understanding

K Clark, MT Luong, U Khandelwal, CD Manning… - arxiv preprint arxiv …, 2019 - arxiv.org

It can be challenging to train multi-task neural networks that outperform or even match their
single-task counterparts. To help address this, we propose using knowledge distillation …

Guardar Citar Citado por 234 Artículos relacionados Las 11 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Massively multilingual transfer for NER

A Rahimi, Y Li, T Cohn - arxiv preprint arxiv:1902.00193, 2019 - arxiv.org

In cross-lingual transfer, NLP models over one or more source languages are applied to a
low-resource target language. While most prior work has used a single source model or a …

Guardar Citar Citado por 227 Artículos relacionados Las 5 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Head-driven phrase structure grammar parsing on Penn treebank

J Zhou, H Zhao - arxiv preprint arxiv:1907.02684, 2019 - arxiv.org

Head-driven phrase structure grammar (HPSG) enjoys a uniform formalism representing rich
contextual syntactic and even semantic meanings. This paper makes the first attempt to …

Guardar Citar Citado por 163 Artículos relacionados Las 3 versiones Versión en HTML

[Free GPT-4]

[PDF] aclanthology.org

Graph-based dependency parsing with graph neural networks

T Ji, Y Wu, M Lan - Proceedings of the 57th Annual Meeting of the …, 2019 - aclanthology.org

We investigate the problem of efficiently incorporating high-order features into neural graph-
based dependency parsing. Instead of explicitly extracting high-order features from …

Guardar Citar Citado por 119 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Rethinking self-attention: Towards interpretability in neural parsing

K Mrini, F Dernoncourt, Q Tran, T Bui, W Chang… - arxiv preprint arxiv …, 2019 - arxiv.org

Attention mechanisms have improved the performance of NLP tasks while allowing models
to remain explainable. Self-attention is currently widely used, however interpretability is …

Guardar Citar Citado por 108 Artículos relacionados Las 8 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Model compression with two-stage multi-teacher knowledge distillation for web question answering system

Z Yang, L Shou, M Gong, W Lin, D Jiang - Proceedings of the 13th …, 2020 - dl.acm.org

Deep pre-training and fine-tuning models (such as BERT and OpenAI GPT) have
demonstrated excellent results in question answering areas. However, due to the sheer …

Guardar Citar Citado por 109 Artículos relacionados Las 5 versiones

[Free GPT-4]

[PDF] hku.hk

What do recurrent neural network grammars learn about syntax?

A Kuncoro, M Ballesteros, L Kong, C Dyer… - arxiv preprint arxiv …, 2016 - arxiv.org

Recurrent neural network grammars (RNNG) are a recently proposed probabilistic
generative modeling family for natural language. They show state-of-the-art language …

Guardar Citar Citado por 158 Artículos relacionados Las 11 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Deep multitask learning for semantic dependency parsing

H Peng, S Thomson, NA Smith - arxiv preprint arxiv:1704.06855, 2017 - arxiv.org

We present a deep neural architecture that parses sentences into three semantic
dependency graph formalisms. By using efficient, nearly arc-factored inference and a …

Guardar Citar Citado por 150 Artículos relacionados Las 10 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Distilling an ensemble of greedy dependency parsers into one MST parser

Knowledge distillation: A survey

Sequence-level knowledge distillation

Bam! born-again multi-task networks for natural language understanding

Massively multilingual transfer for NER

Head-driven phrase structure grammar parsing on Penn treebank

Graph-based dependency parsing with graph neural networks

Rethinking self-attention: Towards interpretability in neural parsing

Model compression with two-stage multi-teacher knowledge distillation for web question answering system

What do recurrent neural network grammars learn about syntax?

Deep multitask learning for semantic dependency parsing