A tale of two efficient value iteration algorithms for solving linear mdps with large action space

Z Xu, Z Song, A Shrivastava - International Conference on …, 2023 - proceedings.mlr.press
Abstract Markov Decision Process (MDP) with large action space naturally occurs in many
applications such as language processing, information retrieval, and recommendation …

Rejection sampling for weighted jaccard similarity revisited

X Li, P Li - Proceedings of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
Efficiently computing the weighted Jaccard similarity has become an active research topic in
machine learning and theory. For sparse data, the standard technique is based on the …

Fast neural ranking on bipartite graph indices

S Tan, W Zhao, P Li - Proceedings of the VLDB Endowment, 2021 - dl.acm.org
Neural network based ranking has been widely adopted owing to its powerful capacity in
modeling complex relationships (eg, users and items, questions and answers). Online …

GCWSNet: Generalized consistent weighted sampling for scalable and accurate training of neural networks

P Li, W Zhao - Proceedings of the 31st ACM International Conference …, 2022 - dl.acm.org
We propose using" powered generalized min-max''(pGMM) hashed (linearized) via the"
generalized consistent weighted sampling''(GCWS) for training (deep) neural networks …

Constrained approximate similarity search on proximity graph

W Zhao, S Tan, P Li - arxiv preprint arxiv:2210.14958, 2022 - arxiv.org
Search engines and recommendation systems are built to efficiently display relevant
information from those massive amounts of candidates. Typically a three-stage mechanism …

EGM: enhanced graph-based model for large-scale video advertisement search

T Yu, J Liu, Y Yang, Y Li, H Fei, P Li - Proceedings of the 28th ACM …, 2022 - dl.acm.org
Video advertisements may grasp customers' attention instantly and are often adored by
advertisers. Since the corpus is vast, achieving an efficient query-to-video search can be …

Proximity graph maintenance for fast online nearest neighbor search

Z Xu, W Zhao, S Tan, Z Zhou, P Li - arxiv preprint arxiv:2206.10839, 2022 - arxiv.org
Approximate Nearest Neighbor (ANN) search is a fundamental technique for (eg,) the
deployment of recommender systems. Recent studies bring proximity graph-based methods …

Building K-Anonymous User Cohorts with Consecutive Consistent Weighted Sampling (CCWS)

X Zheng, W Zhao, X Li, P Li - Proceedings of the 46th International ACM …, 2023 - dl.acm.org
To retrieve personalized campaigns and creatives while protecting user privacy, digital
advertising is shifting from member-based identity to cohort-based identity. Under such …

Weighted Diversified Sampling for Efficient Data-Driven Single-Cell Gene-Gene Interaction Discovery

Y Wu, Y Yang, Z Liu, Z Li, K Pahwa, R Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Gene-gene interactions play a crucial role in the manifestation of complex human diseases.
Uncovering significant gene-gene interactions is a challenging task. Here, we present an …

Pb-Hash: Partitioned b-bit Hashing

P Li, W Zhao - Proceedings of the 2024 ACM SIGIR International …, 2024 - dl.acm.org
Many hashing algorithms including minwise hashing (MinHash), one permutation hashing
(OPH), and consistent weighted sampling (CWS) generate integers of B bits. With k hashes …