Google Académico

KS Kalyan, A Rajasekharan, S Sangeetha - arxiv preprint arxiv …, 2021 - arxiv.org

Transformer-based pretrained language models (T-PTLMs) have achieved great success in
almost every NLP task. The evolution of these models started with GPT and BERT. These …

Guardar Citar Citado por 364 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Pre-trained language models for text generation: A survey

J Li, T Tang, WX Zhao, JY Nie, JR Wen - ACM Computing Surveys, 2024 - dl.acm.org

Text Generation aims to produce plausible and readable text in human language from input
data. The resurgence of deep learning has greatly advanced this field, in particular, with the …

Guardar Citar Citado por 420 Artículos relacionados Las 7 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A multitask, multilingual, multimodal evaluation of chatgpt on reasoning, hallucination, and interactivity

Y Bang, S Cahyawijaya, N Lee, W Dai, D Su… - arxiv preprint arxiv …, 2023 - arxiv.org

This paper proposes a framework for quantitatively evaluating interactive LLMs such as
ChatGPT using publicly available data sets. We carry out an extensive technical evaluation …

Guardar Citar Citado por 1482 Artículos relacionados Las 5 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

NusaCrowd: Open source initiative for Indonesian NLP resources

S Cahyawijaya, H Lovenia, AF Aji… - Findings of the …, 2023 - aclanthology.org

We present NusaCrowd, a collaborative initiative to collect and unify existing resources for
Indonesian languages, including opening access to previously non-public resources …

Guardar Citar Citado por 970 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Language models are few-shot multilingual learners

GI Winata, A Madotto, Z Lin, R Liu, J Yosinski… - arxiv preprint arxiv …, 2021 - arxiv.org

General-purpose language models have demonstrated impressive capabilities, performing
on par with state-of-the-art approaches on a range of downstream natural language …

Guardar Citar Citado por 138 Artículos relacionados Las 3 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] mdpi.com

End-to-end transformer-based models in textual-based NLP

A Rahali, MA Akhloufi - AI, 2023 - mdpi.com

Transformer architectures are highly expressive because they use self-attention
mechanisms to encode long-range dependencies in the input sequences. In this paper, we …

Guardar Citar Citado por 84 Artículos relacionados Las 5 versiones En caché

[Free GPT-4]
[DeepSeek]

[PDF] mdpi.com

A systematic review of transformer-based pre-trained language models through self-supervised learning

E Kotei, R Thirunavukarasu - Information, 2023 - mdpi.com

Transfer learning is a technique utilized in deep learning applications to transmit learned
inference to a different target domain. The approach is mainly to solve the problem of a few …

Guardar Citar Citado por 56 Artículos relacionados Las 3 versiones En caché

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

One country, 700+ languages: NLP challenges for underrepresented languages and dialects in Indonesia

AF Aji, GI Winata, F Koto, S Cahyawijaya… - arxiv preprint arxiv …, 2022 - arxiv.org

NLP research is impeded by a lack of resources and awareness of the challenges presented
by underrepresented languages and dialects. Focusing on the languages spoken in …

Guardar Citar Citado por 86 Artículos relacionados Las 10 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Negative object presence evaluation (nope) to measure object hallucination in vision-language models

H Lovenia, W Dai, S Cahyawijaya, Z Ji… - arxiv preprint arxiv …, 2023 - arxiv.org

Object hallucination poses a significant challenge in vision-language (VL) models, often
leading to the generation of nonsensical or unfaithful responses with non-existent objects …

Guardar Citar Citado por 48 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A few thousand translations go a long way! leveraging pre-trained models for african news translation

DI Adelani, JO Alabi, A Fan, J Kreutzer, X Shen… - arxiv preprint arxiv …, 2022 - arxiv.org

Recent advances in the pre-training of language models leverage large-scale datasets to
create multilingual models. However, low-resource languages are mostly left out in these …

Guardar Citar Citado por 44 Artículos relacionados Las 11 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

IndoNLG: Benchmark and resources for evaluating Indonesian natural language generation

Ammus: A survey of transformer-based pretrained models in natural language processing

Pre-trained language models for text generation: A survey

A multitask, multilingual, multimodal evaluation of chatgpt on reasoning, hallucination, and interactivity

NusaCrowd: Open source initiative for Indonesian NLP resources

Language models are few-shot multilingual learners

End-to-end transformer-based models in textual-based NLP

A systematic review of transformer-based pre-trained language models through self-supervised learning

One country, 700+ languages: NLP challenges for underrepresented languages and dialects in Indonesia

Negative object presence evaluation (nope) to measure object hallucination in vision-language models

A few thousand translations go a long way! leveraging pre-trained models for african news translation