Google 학술 검색

X He, K Zhao, X Chu - Knowledge-based systems, 2021 - Elsevier

Deep learning (DL) techniques have obtained remarkable achievements on various tasks,
such as image recognition, object detection, and language modeling. However, building a …

저장 인용 1950회 인용 관련 학술자료 전체 8개의 버전

[Free GPT-4]

[PDF] amazonaws.com

[PDF][PDF] Language models are unsupervised multitask learners

A Radford, J Wu, R Child, D Luan… - OpenAI …, 2019 - insightcivic.s3.us-east-1.amazonaws …

Natural language processing tasks, such as question answering, machine translation,
reading comprehension, and summarization, are typically approached with supervised …

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Augmenting organizational decision-making with deep learning algorithms: Principles, promises, and challenges

YR Shrestha, V Krishna, G von Krogh - Journal of Business Research, 2021 - Elsevier

The current expansion of theory and research on artificial intelligence in management and
organization studies has revitalized the theory and research on decision-making in …

저장 인용 251회 인용 관련 학술자료 전체 9개의 버전

[Free GPT-4]

[PDF] usenix.org

{Cost-Efficient} large language model serving for multi-turn conversations with {CachedAttention}

B Gao, Z He, P Sharma, Q Kang, D Jevdjic… - 2024 USENIX Annual …, 2024 - usenix.org

Interacting with humans through multi-turn conversations is a fundamental feature of large
language models (LLMs). However, existing LLM serving engines executing multi-turn …

저장 인용 17회 인용 관련 학술자료 전체 2개의 버전 HTML 버전

[Free GPT-4]

[PDF] acm.org

Big code!= big vocabulary: Open-vocabulary models for source code

RM Karampatsis, H Babii, R Robbes, C Sutton… - Proceedings of the …, 2020 - dl.acm.org

Statistical language modeling techniques have successfully been applied to large source
code corpora, yielding a variety of new software development tools, such as tools for code …

저장 인용 266회 인용 관련 학술자료 전체 13개의 버전

[Free GPT-4]

[PDF] arxiv.org

BPE-dropout: Simple and effective subword regularization

I Provilkov, D Emelianenko, E Voita - arxiv preprint arxiv:1910.13267, 2019 - arxiv.org

Subword segmentation is widely used to address the open vocabulary problem in machine
translation. The dominant approach to subword segmentation is Byte Pair Encoding (BPE) …

저장 인용 310회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Charformer: Fast character transformers via gradient-based subword tokenization

Y Tay, VQ Tran, S Ruder, J Gupta, HW Chung… - arxiv preprint arxiv …, 2021 - arxiv.org

State-of-the-art models in natural language processing rely on separate rigid subword
tokenization algorithms, which limit their generalization ability and adaptation to new …

저장 인용 151회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Barack's wife Hillary: Using knowledge-graphs for fact-aware language modeling

RL Logan IV, NF Liu, ME Peters, M Gardner… - arxiv preprint arxiv …, 2019 - arxiv.org

Modeling human language requires the ability to not only generate fluent text but also
encode factual knowledge. However, traditional language models are only capable of …

[Free GPT-4]

[PDF] arxiv.org

Representation degeneration problem in training natural language generation models

J Gao, D He, X Tan, T Qin, L Wang, TY Liu - arxiv preprint arxiv …, 2019 - arxiv.org

We study an interesting problem in training neural network-based models for natural
language generation tasks, which we call the\emph {representation degeneration problem} …

저장 인용 284회 인용 관련 학술자료 전체 3개의 버전 HTML 버전

[Free GPT-4]

[PDF] wiley.com Full View

Event knowledge in large language models: the gap between the impossible and the unlikely

C Kauf, AA Ivanova, G Rambelli, E Chersoni… - Cognitive …, 2023 - Wiley Online Library

Word co‐occurrence patterns in language corpora contain a surprising amount of
conceptual knowledge. Large language models (LLMs), trained to predict words in context …

저장 인용 53회 인용 관련 학술자료 전체 11개의 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Frage: Frequency-agnostic word representation

AutoML: A survey of the state-of-the-art

[PDF][PDF] Language models are unsupervised multitask learners

[HTML][HTML] Augmenting organizational decision-making with deep learning algorithms: Principles, promises, and challenges

{Cost-Efficient} large language model serving for multi-turn conversations with {CachedAttention}

Big code!= big vocabulary: Open-vocabulary models for source code

BPE-dropout: Simple and effective subword regularization

Charformer: Fast character transformers via gradient-based subword tokenization

Barack's wife Hillary: Using knowledge-graphs for fact-aware language modeling

Representation degeneration problem in training natural language generation models

Event knowledge in large language models: the gap between the impossible and the unlikely