Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Scaling up influence functions
We address efficient calculation of influence functions for tracking predictions back to the
training data. We propose and analyze a new approach to speeding up the inverse Hessian …
training data. We propose and analyze a new approach to speeding up the inverse Hessian …
Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM's Translation Capability
Large, multilingual language models exhibit surprisingly good zero-or few-shot machine
translation capabilities, despite having never seen the intentionally-included translation …
translation capabilities, despite having never seen the intentionally-included translation …
AboutMe: Using self-descriptions in webpages to document the effects of english pretraining data filters
Large language models'(LLMs) abilities are drawn from their pretraining data, and model
development begins with data curation. However, decisions around what data is retained or …
development begins with data curation. However, decisions around what data is retained or …
[หนังสือ][B] The Routledge handbook of language contact
The Routledge Handbook of Language Contact provides an overview of the state of the art
of current research in contact linguistics. Presenting contact linguistics as an established …
of current research in contact linguistics. Presenting contact linguistics as an established …
Towards end-to-end multilingual question answering
Multilingual question answering (MLQA) is a critical part of an accessible natural language
interface. However, current solutions demonstrate performance far below that of …
interface. However, current solutions demonstrate performance far below that of …
Training data augmentation for code-mixed translation
Abstract Machine translation of user-generated code-mixed inputs to English is of crucial
importance in applications like web search and targeted advertising. We address the …
importance in applications like web search and targeted advertising. We address the …
Subword-level language identification for intra-word code-switching
Language identification for code-switching (CS), the phenomenon of alternating between
two or more languages in conversations, has traditionally been approached under the …
two or more languages in conversations, has traditionally been approached under the …
[HTML][HTML] Language identification of intra-word code-switching for arabic–english
Multilingual speakers tend to mix different languages in text and speech; a phenomenon
referred to by linguists as “code-switching”(CS). Also, speakers switch between morphemes …
referred to by linguists as “code-switching”(CS). Also, speakers switch between morphemes …
Entity-switched datasets: An approach to auditing the in-domain robustness of named entity recognition models
Named entity recognition systems perform well on standard datasets comprising English
news. But given the paucity of data, it is difficult to draw conclusions about the robustness of …
news. But given the paucity of data, it is difficult to draw conclusions about the robustness of …
Two languages, one treebank: building a Turkish–German code-switching treebank and its challenges
This paper presents the SAGT Turkish–German code-switching treebank, and observations
and annotation challenges we encountered during its development. The treebank consists …
and annotation challenges we encountered during its development. The treebank consists …