Machine learning methods for stylometry
J Savoy - Cham: Springer, 2020 - Springer
With the recent progress made in network and computing technology, the ubiquity of data,
and textual repositories freely available, the scientific practice evolves towards a more data …
and textual repositories freely available, the scientific practice evolves towards a more data …
Cross-language information retrieval
Two key assumptions shape the usual view of ranked retrieval:(1) that the searcher can
choose words for their query that might appear in the documents that they wish to see, and …
choose words for their query that might appear in the documents that they wish to see, and …
CLIRMatrix: A massively large collection of bilingual and multilingual datasets for Cross-Lingual Information Retrieval
We present CLIRMatrix, a massively large collection of bilingual and multilingual datasets
for Cross-Lingual Information Retrieval extracted automatically from Wikipedia. CLIRMatrix …
for Cross-Lingual Information Retrieval extracted automatically from Wikipedia. CLIRMatrix …
Cross-lingual information retrieval with BERT
Multiple neural language models have been developed recently, eg, BERT and XLNet, and
achieved impressive results in various NLP tasks including sentence classification, question …
achieved impressive results in various NLP tasks including sentence classification, question …
Training effective neural CLIR by bridging the translation gap
We introduce Smart Shuffling, a cross-lingual embedding (CLE) method that draws from
statistical word alignment approaches to leverage dictionaries, producing dense …
statistical word alignment approaches to leverage dictionaries, producing dense …
Mind the gap: Cross-lingual information retrieval with hierarchical knowledge enhancement
Abstract Cross-Lingual Information Retrieval (CLIR) aims to rank the documents written in a
language different from the user's query. The intrinsic gap between different languages is an …
language different from the user's query. The intrinsic gap between different languages is an …
Efficiency-Effectiveness Tradeoff of Probabilistic Structured Queries for Cross-Language Information Retrieval
Probabilistic Structured Queries (PSQ) is a cross-language information retrieval (CLIR)
method that uses translation probabilities statistically derived from aligned corpora. PSQ is a …
method that uses translation probabilities statistically derived from aligned corpora. PSQ is a …
Physical Reservoir Computing Using van der Waals Ferroelectrics for Acoustic Keyword Spotting
Y Cao, Z Zhang, BW Qin, W Sang, H Li, T Wang… - ACS …, 2024 - ACS Publications
Acoustic keyword spotting (KWS) plays a pivotal role in the voice-activated systems of
artificial intelligence (AI), allowing for hands-free interactions between humans and smart …
artificial intelligence (AI), allowing for hands-free interactions between humans and smart …
[PDF][PDF] Learning a Sparse Representation Model for Neural CLIR.
In monolingual retrieval, sparse representations learned atop BERT-style models offer a
complementary approach to the unsupervised BM25 model. Inspired by this line of work, we …
complementary approach to the unsupervised BM25 model. Inspired by this line of work, we …
Weakly supervised attentional model for low resource ad-hoc cross-lingual information retrieval
We propose a weakly supervised neural model for Ad-hoc Cross-lingual Information
Retrieval (CLIR) from low-resource languages. Low resource languages often lack …
Retrieval (CLIR) from low-resource languages. Low resource languages often lack …