- Academic Search

N Fuhr - Acm sigir forum, 2018 - dl.acm.org

This paper points out some mistakes that can be frequently found in IR publications: MRR
and ERR violate basic requirements for a metric, MAP is based on unrealistic assumptions …

Simpan Kutip Dirujuk 175 kali Artikel terkait 4 versi

[Free GPT-4]

[PDF] sigir.org

On Fuhr's guideline for IR evaluation

T Sakai - ACM SIGIR Forum, 2021 - dl.acm.org

In the December 2017 issue of SIGIR Forum, Fuhr presented ten" Thou Shalt Not" s (ie,
warnings against bad practices) for IR experimenters. While his article provides a lot of good …

Simpan Kutip Dirujuk 51 kali Artikel terkait 4 versi

[Free GPT-4]

[PDF] google.com

Topic difficulty: Collection and query formulation effects

JS Culpepper, G Faggioli, N Ferro… - ACM Transactions on …, 2021 - dl.acm.org

Several recent studies have explored the interaction effects between topics, systems,
corpora, and components when measuring retrieval effectiveness. However, all of these …

Simpan Kutip Dirujuk 31 kali Artikel terkait 5 versi

[Free GPT-4]

[PDF] nii.ac.jp

[PDF][PDF] Overview of the NTCIR-12 Short Text Conversation Task.

L Shang, T Sakai, Z Lu, H Li, R Higashinaka, Y Miyao - NTCIR, 2016 - research.nii.ac.jp

We give an overview of the NII Testbeds and Community for Information access Research
(NTCIR)-13 Short Text Conversation (STC) task, which was a core task of NTCIR-13. At …

Simpan Kutip Dirujuk 92 kali Artikel terkait 5 versi Versi HTML

Which Diversity Evaluation Measures Are" Good"?

T Sakai, Z Zeng - Proceedings of the 42nd international ACM SIGIR …, 2019 - dl.acm.org

This study evaluates 30 IR evaluation measures or their instances, of which nine are for
adhoc IR and 21 are for diversified IR, primarily from the viewpoint of whether their …

Simpan Kutip Dirujuk 50 kali Artikel terkait 5 versi

[Free GPT-4]

[PDF] unipd.it

[PDF][PDF] Overview of the NTCIR-15 we want web with CENTRE (WWW-3) task

T Sakai, S Tao, Z Zeng, Y Zheng, J Mao, Z Chu… - Proceedings of NTCIR …, 2020 - dei.unipd.it

This is an overview of the NTCIR-15 We Want Web with CENTRE (WWW-3) task. The task
features the Chinese subtask (adhoc web search) and the English subtask (adhoc web …

Simpan Kutip Dirujuk 35 kali Artikel terkait 10 versi Versi HTML

Retrieval evaluation measures that agree with users' SERP preferences: Traditional, preference-based, and diversity measures

T Sakai, Z Zeng - ACM Transactions on Information Systems (TOIS), 2020 - dl.acm.org

We examine the “goodness” of ranked retrieval evaluation measures in terms of how well
they align with users' Search Engine Result Page (SERP) preferences for web search. The …

Simpan Kutip Dirujuk 28 kali Artikel terkait 4 versi

[Free GPT-4]

[PDF] psu.edu

Statistical reform in information retrieval?

T Sakai - ACM SIGIR Forum, 2014 - dl.acm.org

IR revolves around evaluation. Therefore, IR researchers should employ sound evaluation
practices. Nowadays many of us know that statistical significance testing is not enough, but …

Simpan Kutip Dirujuk 93 kali Artikel terkait 6 versi

[Free GPT-4]

[PDF] nsf.gov

A reference-dependent model for web search evaluation: Understanding and measuring the experience of boundedly rational users

N Chen, J Liu, T Sakai - Proceedings of the ACM web conference 2023, 2023 - dl.acm.org

Previous researches demonstrate that users' actions in search interaction are associated
with relative gains and losses to reference points, known as the reference dependence …

Simpan Kutip Dirujuk 15 kali Artikel terkait 5 versi

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] User behavior modeling for web search evaluation

F Zhang, Y Liu, J Mao, M Zhang, S Ma - AI Open, 2020 - Elsevier

Search engines are widely used in our daily life. Batch evaluation of the performance of
search systems to their users has always been an essential issue in the field of information …

Simpan Kutip Dirujuk 15 kali Artikel terkait

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Metrics, statistics, tests

Some common mistakes in IR evaluation, and how they can be avoided

On Fuhr's guideline for IR evaluation

Topic difficulty: Collection and query formulation effects

[PDF][PDF] Overview of the NTCIR-12 Short Text Conversation Task.

Which Diversity Evaluation Measures Are" Good"?

[PDF][PDF] Overview of the NTCIR-15 we want web with CENTRE (WWW-3) task

Retrieval evaluation measures that agree with users' SERP preferences: Traditional, preference-based, and diversity measures

Statistical reform in information retrieval?

A reference-dependent model for web search evaluation: Understanding and measuring the experience of boundedly rational users

[HTML][HTML] User behavior modeling for web search evaluation