Google Наука

J Li, T Tang, WX Zhao, JY Nie, JR Wen - ACM Computing Surveys, 2024 - dl.acm.org

Text Generation aims to produce plausible and readable text in human language from input
data. The resurgence of deep learning has greatly advanced this field, in particular, with the …

Запазване Позоваване С позовавания в 430 Сродни статии Всички 12 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

NusaCrowd: Open source initiative for Indonesian NLP resources

S Cahyawijaya, H Lovenia, AF Aji, GI Winata… - arxiv preprint arxiv …, 2022 - arxiv.org

We present NusaCrowd, a collaborative initiative to collect and unify existing resources for
Indonesian languages, including opening access to previously non-public resources …

Запазване Позоваване С позовавания в 1109 Сродни статии Всички 12 версии Във вид на HTML

Towards robust automated math problem solving: a survey of statistical and deep learning approaches

A Saraf, P Kamat, S Gite, S Kumar, K Kotecha - Evolutionary Intelligence, 2024 - Springer

Automated mathematical problem-solving represents a unique intersection of natural
language processing (NLP) and mathematical reasoning, posing significant challenges in …

Запазване Позоваване С позовавания в 1 Сродни статии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Naamapadam: A large-scale named entity annotated data for Indic languages

A Mhaske, H Kedia, S Doddapaneni… - arxiv preprint arxiv …, 2022 - arxiv.org

We present, Naamapadam, the largest publicly available Named Entity Recognition (NER)
dataset for the 11 major Indian languages from two language families. The dataset contains …

Запазване Позоваване С позовавания в 22 Сродни статии Всички 8 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Airavata: Introducing hindi instruction-tuned llm

J Gala, T Jayakumar, JA Husain, MSUR Khan… - arxiv preprint arxiv …, 2024 - arxiv.org

We announce the initial release of" Airavata," an instruction-tuned LLM for Hindi. Airavata
was created by fine-tuning OpenHathi with diverse, instruction-tuning Hindi datasets to make …

Запазване Позоваване С позовавания в 13 Сродни статии Всички 5 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

medit: Multilingual text editing via instruction tuning

V Raheja, D Alikaniotis, V Kulkarni, B Alhafni… - arxiv preprint arxiv …, 2024 - arxiv.org

We introduce mEdIT, a multi-lingual extension to CoEdIT--the recent state-of-the-art text
editing models for writing assistance. mEdIT models are trained by fine-tuning multi-lingual …

Запазване Позоваване С позовавания в 4 Сродни статии Всички 3 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Dolphin: A challenging and diverse benchmark for Arabic NLG

A Elmadany, A El-Shangiti… - Findings of the …, 2023 - aclanthology.org

We present Dolphin, a novel benchmark that addresses the need for a natural language
generation (NLG) evaluation framework dedicated to the wide collection of Arabic …

Запазване Позоваване С позовавания в 9 Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Pmindiasum: Multilingual and cross-lingual headline summarization for languages in india

A Urlana, P Chen, Z Zhao, SB Cohen… - arxiv preprint arxiv …, 2023 - arxiv.org

This paper introduces PMIndiaSum, a multilingual and massively parallel summarization
corpus focused on languages in India. Our corpus provides a training and testing ground for …

Запазване Позоваване С позовавания в 8 Сродни статии Всички 10 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

V\= arta: A Large-Scale Headline-Generation Dataset for Indic Languages

R Aralikatte, Z Cheng, S Doddapaneni… - arxiv preprint arxiv …, 2023 - arxiv.org

We present V\= arta, a large-scale multilingual dataset for headline generation in Indic
languages. This dataset includes 41.8 million news articles in 14 different Indic languages …

Запазване Позоваване С позовавания в 8 Сродни статии Всички 5 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Building pre-train llm dataset for the indic languages: a case study on hindi

S Parida, S Panwar, K Lata, S Mishra… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs) demonstrated transformative capabilities in many
applications that require automatically generating responses based on human instruction …

Запазване Позоваване С позовавания в 4 Сродни статии Всички 2 версии Във вид на HTML

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

IndicNLG benchmark: Multilingual datasets for diverse NLG tasks in Indic languages

Pre-trained language models for text generation: A survey

NusaCrowd: Open source initiative for Indonesian NLP resources

Towards robust automated math problem solving: a survey of statistical and deep learning approaches

Naamapadam: A large-scale named entity annotated data for Indic languages

Airavata: Introducing hindi instruction-tuned llm

medit: Multilingual text editing via instruction tuning

Dolphin: A challenging and diverse benchmark for Arabic NLG

Pmindiasum: Multilingual and cross-lingual headline summarization for languages in india

V\= arta: A Large-Scale Headline-Generation Dataset for Indic Languages

Building pre-train llm dataset for the indic languages: a case study on hindi