- Academic Search

A Schioppa, P Zablotskaia, D Vilar… - Proceedings of the AAAI …, 2022 - ojs.aaai.org

We address efficient calculation of influence functions for tracking predictions back to the
training data. We propose and analyze a new approach to speeding up the inverse Hessian …

บันทึก อ้างอิง อ้างโดย110 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM's Translation Capability

E Briakou, C Cherry, G Foster - arxiv preprint arxiv:2305.10266, 2023 - arxiv.org

Large, multilingual language models exhibit surprisingly good zero-or few-shot machine
translation capabilities, despite having never seen the intentionally-included translation …

บันทึก อ้างอิง อ้างโดย53 บทความที่เกี่ยวข้อง ทั้งหมด 7 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

AboutMe: Using self-descriptions in webpages to document the effects of english pretraining data filters

L Lucy, S Gururangan, L Soldaini, E Strubell… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models'(LLMs) abilities are drawn from their pretraining data, and model
development begins with data curation. However, decisions around what data is retained or …

บันทึก อ้างอิง อ้างโดย12 บทความที่เกี่ยวข้อง ทั้งหมด 5 ฉบับ ดูในรูปแบบ HTML

[หนังสือ][B] The Routledge handbook of language contact

E Adamou, Y Matras - 2021 - api.taylorfrancis.com

The Routledge Handbook of Language Contact provides an overview of the state of the art
of current research in contact linguistics. Presenting contact linguistics as an established …

บันทึก อ้างอิง อ้างโดย41 บทความที่เกี่ยวข้อง ทั้งหมด 7 ฉบับ Library Search

[Free GPT-4]
[DeepSeek]

[PDF] dfki.de

Towards end-to-end multilingual question answering

E Loginova, S Varanasi, G Neumann - Information Systems Frontiers, 2021 - Springer

Multilingual question answering (MLQA) is a critical part of an accessible natural language
interface. However, current solutions demonstrate performance far below that of …

บันทึก อ้างอิง อ้างโดย39 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Training data augmentation for code-mixed translation

A Gupta, A Vavre, S Sarawagi - … of the 2021 Conference of the …, 2021 - aclanthology.org

Abstract Machine translation of user-generated code-mixed inputs to English is of crucial
importance in applications like web search and targeted advertising. We address the …

บันทึก อ้างอิง อ้างโดย26 บทความที่เกี่ยวข้อง ทั้งหมด 4 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Subword-level language identification for intra-word code-switching

M Mager, Ö Çetinoğlu, K Kann - arxiv preprint arxiv:1904.01989, 2019 - arxiv.org

Language identification for code-switching (CS), the phenomenon of alternating between
two or more languages in conversations, has traditionally been approached under the …

บันทึก อ้างอิง อ้างโดย35 บทความที่เกี่ยวข้อง ทั้งหมด 4 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Language identification of intra-word code-switching for arabic–english

C Sabty, I Mesabah, Ö Çetinoğlu, S Abdennadher - Array, 2021 - Elsevier

Multilingual speakers tend to mix different languages in text and speech; a phenomenon
referred to by linguists as “code-switching”(CS). Also, speakers switch between morphemes …

บันทึก อ้างอิง อ้างโดย17 บทความที่เกี่ยวข้อง ทั้งหมด 3 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Entity-switched datasets: An approach to auditing the in-domain robustness of named entity recognition models

O Agarwal, Y Yang, BC Wallace, A Nenkova - arxiv preprint arxiv …, 2020 - arxiv.org

Named entity recognition systems perform well on standard datasets comprising English
news. But given the paucity of data, it is difficult to draw conclusions about the robustness of …

บันทึก อ้างอิง อ้างโดย27 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Two languages, one treebank: building a Turkish–German code-switching treebank and its challenges

Ö Çetinoğlu, Ç Çöltekin - Language Resources and Evaluation, 2023 - Springer

This paper presents the SAGT Turkish–German code-switching treebank, and observations
and annotation challenges we encountered during its development. The treebank consists …

บันทึก อ้างอิง อ้างโดย10 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

A fast, compact, accurate model for language identification of codemixed text

Scaling up influence functions

Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM's Translation Capability

AboutMe: Using self-descriptions in webpages to document the effects of english pretraining data filters

[หนังสือ][B] The Routledge handbook of language contact

Towards end-to-end multilingual question answering

Training data augmentation for code-mixed translation

Subword-level language identification for intra-word code-switching

[HTML][HTML] Language identification of intra-word code-switching for arabic–english

Entity-switched datasets: An approach to auditing the in-domain robustness of named entity recognition models

Two languages, one treebank: building a Turkish–German code-switching treebank and its challenges