Overview of Touché 2021: argument retrieval

A Bondarenko, L Gienapp, M Fröbe, M Beloucif… - Experimental IR Meets …, 2021 - Springer
This paper is a condensed report on the second year of the Touché shared task on
argument retrieval held at CLEF 2021. With the goal to provide a collaborative platform for …

Overview of touché 2022: argument retrieval

A Bondarenko, M Fröbe, J Kiesel, S Syed… - … Conference of the Cross …, 2022 - Springer
This paper is a condensed report on the third year of the Touché lab on argument retrieval
held at CLEF 2022. With the goal to foster and support the development of technologies for …

FuzzyDedup: Secure fuzzy deduplication for cloud storage

T Jiang, X Yuan, Y Chen, K Cheng… - … on Dependable and …, 2022 - ieeexplore.ieee.org
Data deduplication is of critical importance to reduce the storage cost for clients and to
relieve the unnecessary storage pressure for cloud servers. While various techniques have …

BLEND: a fast, memory-efficient and accurate mechanism to find fuzzy seed matches in genome analysis

C Firtina, J Park, M Alser, JS Kim… - NAR Genomics and …, 2023 - academic.oup.com
Generating the hash values of short subsequences, called seeds, enables quickly
identifying similarities between genomic sequences by matching seeds with a single lookup …

Bootstrapped nDCG Estimation in the Presence of Unjudged Documents

M Fröbe, L Gienapp, M Potthast, M Hagen - European Conference on …, 2023 - Springer
Retrieval studies often reuse TREC collections after the corresponding tracks have passed.
Yet, a fair evaluation of new systems that retrieve documents outside the original judgment …

SimLESS: A Secure Deduplication System over Similar Data in Cloud Media Sharing

M Song, Z Hua, Y Zheng, T **ang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
With the growing popularity of cloud computing, sharing media data through the cloud has
become a common practice. Due to high information redundancy, media data take up a …

Chuweb21D: A Deduped English Document Collection for Web Search Tasks

Z Chu, T Sakai, Q Ai, Y Liu - … of the Annual International ACM SIGIR …, 2023 - dl.acm.org
As a traditional information retrieval task, ad hoc web search has long been an important
part of IR research and evaluation tracks (eg TREC, NTCIR and CLEF). A crawled, large …

Noise-reduction for automatically transferred relevance judgments

M Fröbe, C Akiki, M Potthast, M Hagen - International Conference of the …, 2022 - Springer
Abstract The TREC Deep Learning tracks used MS MARCO Version 1 as their official
training data until 2020 and switched to Version 2 in 2021. For Version 2, all previously …

HDeFC: Hierarchical Secure Fuzzy Deduplication Based on Fog Computing

Z Tang, S Zeng, S Han, J Liu… - IEEE Internet of Things …, 2024 - ieeexplore.ieee.org
Secure deduplication not only optimizes cloud storage but also prevents data leakage.
However, traditional schemes are with high computation and communication costs to deal …

Analyzing the Web: Are Top Websites Lists a Good Choice for Research?

T Alby, R Jäschke - International Conference on Theory and Practice of …, 2022 - Springer
The web has been a subject of research since its beginning, but it is difficult if not impossible
to analyze the whole web, even if a database of all URLs would be freely accessible …