Machine learning methods for stylometry

J Savoy - Cham: Springer, 2020 - Springer
With the recent progress made in network and computing technology, the ubiquity of data,
and textual repositories freely available, the scientific practice evolves towards a more data …

Cross-language information retrieval

P Galuščáková, DW Oard, S Nair - arxiv preprint arxiv:2111.05988, 2021 - arxiv.org
Two key assumptions shape the usual view of ranked retrieval:(1) that the searcher can
choose words for their query that might appear in the documents that they wish to see, and …

CLIRMatrix: A massively large collection of bilingual and multilingual datasets for Cross-Lingual Information Retrieval

S Sun, K Duh - Proceedings of the 2020 Conference on Empirical …, 2020 - aclanthology.org
We present CLIRMatrix, a massively large collection of bilingual and multilingual datasets
for Cross-Lingual Information Retrieval extracted automatically from Wikipedia. CLIRMatrix …

Cross-lingual information retrieval with BERT

Z Jiang, A El-Jaroudi, W Hartmann, D Karakos… - arxiv preprint arxiv …, 2020 - arxiv.org
Multiple neural language models have been developed recently, eg, BERT and XLNet, and
achieved impressive results in various NLP tasks including sentence classification, question …

Training effective neural CLIR by bridging the translation gap

H Bonab, SM Sarwar, J Allan - … of the 43rd International ACM SIGIR …, 2020 - dl.acm.org
We introduce Smart Shuffling, a cross-lingual embedding (CLE) method that draws from
statistical word alignment approaches to leverage dictionaries, producing dense …

Mind the gap: Cross-lingual information retrieval with hierarchical knowledge enhancement

F Zhang, Z Zhang, X Ao, D Gao, F Zhuang… - Proceedings of the …, 2022 - ojs.aaai.org
Abstract Cross-Lingual Information Retrieval (CLIR) aims to rank the documents written in a
language different from the user's query. The intrinsic gap between different languages is an …

Efficiency-Effectiveness Tradeoff of Probabilistic Structured Queries for Cross-Language Information Retrieval

E Yang, S Nair, D Lawrie, J Mayfield, DW Oard… - arxiv preprint arxiv …, 2024 - arxiv.org
Probabilistic Structured Queries (PSQ) is a cross-language information retrieval (CLIR)
method that uses translation probabilities statistically derived from aligned corpora. PSQ is a …

Physical Reservoir Computing Using van der Waals Ferroelectrics for Acoustic Keyword Spotting

Y Cao, Z Zhang, BW Qin, W Sang, H Li, T Wang… - ACS …, 2024 - ACS Publications
Acoustic keyword spotting (KWS) plays a pivotal role in the voice-activated systems of
artificial intelligence (AI), allowing for hands-free interactions between humans and smart …

[PDF][PDF] Learning a Sparse Representation Model for Neural CLIR.

S Nair, E Yang, DJ Lawrie, J Mayfield, DW Oard - DESIRES, 2022 - user.eng.umd.edu
In monolingual retrieval, sparse representations learned atop BERT-style models offer a
complementary approach to the unsupervised BM25 model. Inspired by this line of work, we …

Weakly supervised attentional model for low resource ad-hoc cross-lingual information retrieval

L Zhao, R Zbib, Z Jiang, D Karakos… - Proceedings of the 2nd …, 2019 - aclanthology.org
We propose a weakly supervised neural model for Ad-hoc Cross-lingual Information
Retrieval (CLIR) from low-resource languages. Low resource languages often lack …