Overview of the clef-2023 longeval lab on longitudinal evaluation of model performance

R Alkhalifa, I Bilal, H Borkakoty… - … Conference of the Cross …, 2023 - Springer
We describe the first edition of the LongEval CLEF 2023 shared task. This lab evaluates the
temporal persistence of Information Retrieval (IR) systems and Text Classifiers. Task 1 …

On Fuhr's guideline for IR evaluation

T Sakai - ACM SIGIR Forum, 2021 - dl.acm.org
In the December 2017 issue of SIGIR Forum, Fuhr presented ten" Thou Shalt Not" s (ie,
warnings against bad practices) for IR experimenters. While his article provides a lot of good …

Deep neural network regression for normalized digital surface model generation with Sentinel-2 imagery

K Müller, R Leppich, C Geiß, V Borst… - IEEE Journal of …, 2023 - ieeexplore.ieee.org
In recent history, normalized digital surface models (nDSMs) have been constantly gaining
importance as a means to solve large-scale geographic problems. High-resolution surface …

Feature Attention Distillation Defense for Backdoor Attack in Artificial Neural Network-Based Electricity Theft Detection

S Li, W Meng, C Liu, S He - IEEE Internet of Things Journal, 2024 - ieeexplore.ieee.org
Artificial neural networks (ANNs) have been widely used for tasks like electricity theft
detection (ETD) in smart meters. However, due to the subtle mechanisms and inherent …

Modelling the interrelationships between potential risk factors and childhood Co-morbidity of Malaria, Anaemia, and stunting in children less than five years in Burundi

RT Gaston, S Ramroop, F Habyarimana - Heliyon, 2024 - cell.com
Background Anaemia, malaria, and stunting remain health problems, especially in children
younger than five years, and those conditions are linked to morbidity and mortality. The main …

Detecting significant differences between information retrieval systems via generalized linear models

G Faggioli, N Ferro, N Fuhr - Proceedings of the 31st ACM international …, 2022 - dl.acm.org
Being able to compare Information Retrieval (IR) systems correctly is pivotal to improving
their quality. Among the most popular tools for statistical significance testing, we list t-test …

Towards the evaluation of information retrieval systems on evolving datasets with pivot systems

GN González-Sáez, P Mulhem, L Goeuriot - Experimental IR Meets …, 2021 - Springer
Abstract Evaluation of information retrieval systems follows the Cranfield paradigm, where
the evaluation of several IR systems relies on a common evaluation environment (test …

Towards meaningful statements in IR evaluation: Map** evaluation measures to interval scales

M Ferrante, N Ferro, N Fuhr - IEEE Access, 2021 - ieeexplore.ieee.org
Information Retrieval (IR) is a discipline deeply rooted in evaluation since its inception.
Indeed, experimentally measuring and statistically validating the performance of IR systems …

Exploratory visualization tool for the continuous evaluation of information retrieval systems

G González-Sáez, P Galuscáková, R Deveaud… - Proceedings of the 46th …, 2023 - dl.acm.org
This paper introduces a novel visualization tool that facilitates the exploratory analysis of
continuous evaluation for information retrieval systems. We base our analysis on score …

[PDF][PDF] Fairness-based evaluation of conversational search: A pilot study

T Sakai - Proceedings of EVIA, 2023 - repository.nii.ac.jp
ABSTRACT NTCIR-17 introduced the FairWeb-1 task, which evaluated web page rankings
in terms of both relevance and group fairness. The present study shows how their evaluation …