Google 학술 검색

J Chen, H Dong, X Wang, F Feng, M Wang… - ACM Transactions on …, 2023 - dl.acm.org

While recent years have witnessed a rapid growth of research papers on recommender
system (RS), most of the papers focus on inventing machine learning models to better fit …

저장 인용 936회 인용 관련 학술자료 전체 6개의 버전

[Free GPT-4]

[PDF] acm.org

DRN: A deep reinforcement learning framework for news recommendation

G Zheng, F Zhang, Z Zheng, Y **ang, NJ Yuan… - Proceedings of the …, 2018 - dl.acm.org

In this paper, we propose a novel Deep Reinforcement Learning framework for news
recommendation. Online personalized news recommendation is a highly challenging …

저장 인용 888회 인용 관련 학술자료 전체 6개의 버전

[Free GPT-4]

[PDF] acm.org

Unbiased learning-to-rank with biased feedback

T Joachims, A Swaminathan, T Schnabel - Proceedings of the tenth …, 2017 - dl.acm.org

Implicit feedback (eg, clicks, dwell times, etc.) is an abundant source of data in human-
interactive systems. While implicit feedback has many advantages (eg, it is inexpensive to …

저장 인용 609회 인용 관련 학술자료 전체 13개의 버전

[Free GPT-4]

[PDF] jmlr.org

[PDF][PDF] Batch learning from logged bandit feedback through counterfactual risk minimization

A Swaminathan, T Joachims - The Journal of Machine Learning Research, 2015 - jmlr.org

We develop a learning principle and an efficient algorithm for batch learning from logged
bandit feedback. This learning setting is ubiquitous in online systems (eg, ad placement …

[Free GPT-4]

[PDF] psu.edu

[책][B] Click models for web search

A Chuklin, I Markov, M De Rijke - 2022 - books.google.com

With the rapid growth of web search in recent years the problem of modeling its users has
started to attract more and more attention of the information retrieval community. This has …

저장 인용 495회 인용 관련 학술자료 전체 6개의 버전 도서관 검색

[Free GPT-4]

[PDF] mlr.press

Counterfactual risk minimization: Learning from logged bandit feedback

A Swaminathan, T Joachims - International Conference on …, 2015 - proceedings.mlr.press

We develop a learning principle and an efficient algorithm for batch learning from logged
bandit feedback. This learning setting is ubiquitous in online systems (eg, ad placement …

[Free GPT-4]

[PDF] psu.edu

Counterfactual estimation and optimization of click metrics in search engines: A case study

L Li, S Chen, J Kleban, A Gupta - … of the 24th International Conference on …, 2015 - dl.acm.org

Optimizing an interactive system against a predefined online metric is particularly
challenging, especially when the metric is computed from user feedback such as clicks and …

저장 인용 258회 인용 관련 학술자료 전체 4개의 버전

[Free GPT-4]

[PDF] nowpublishers.com

Efficient and effective tree-based and neural learning to rank

S Bruch, C Lucchese, FM Nardini - Foundations and Trends® …, 2023 - nowpublishers.com

As information retrieval researchers, we not only develop algorithmic solutions to hard
problems, but we also insist on a proper, multifaceted evaluation of ideas. The literature on …

[Free GPT-4]

[PDF] arxiv.org

Correcting for selection bias in learning-to-rank systems

Z Ovaisi, R Ahsan, Y Zhang, K Vasilaky… - Proceedings of The Web …, 2020 - dl.acm.org

Click data collected by modern recommendation systems are an important source of
observational data that can be utilized to train learning-to-rank (LTR) systems. However …

저장 인용 121회 인용 관련 학술자료 전체 8개의 버전

[Free GPT-4]

[PDF] nowpublishers.com

A survey of query auto completion in information retrieval

F Cai, M De Rijke - Foundations and Trends® in Information …, 2016 - nowpublishers.com

In information retrieval, query auto completion (QAC), also known as typeahead [**ao et al.,
2013, Cai et al., 2014b] and auto-complete suggestion [Jain and Mishne, 2010], refers to the …

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Reusing historical interaction data for faster online learning to rank for IR

Bias and debias in recommender system: A survey and future directions

DRN: A deep reinforcement learning framework for news recommendation

Unbiased learning-to-rank with biased feedback

[PDF][PDF] Batch learning from logged bandit feedback through counterfactual risk minimization

[책][B] Click models for web search

Counterfactual risk minimization: Learning from logged bandit feedback

Counterfactual estimation and optimization of click metrics in search engines: A case study

Efficient and effective tree-based and neural learning to rank

Correcting for selection bias in learning-to-rank systems

A survey of query auto completion in information retrieval