Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Metric index: An efficient and scalable solution for precise and approximate similarity search
Metric space is a universal and versatile model of similarity that can be applied in various
areas of information retrieval. However, a general, efficient, and scalable solution for metric …
areas of information retrieval. However, a general, efficient, and scalable solution for metric …
Fuzzy data operations
A Anderson - US Patent 8,484,215, 2013 - Google Patents
US PATENT DOCUMENTS 5,179,643 A 1/1993 Homma et al. 5,388,259 A 2f1995
Fleischman et al. 5,832,182 A 11/1998 Zhang et al. 6,026,398 A 2/2000 Brown et al …
Fleischman et al. 5,832,182 A 11/1998 Zhang et al. 6,026,398 A 2/2000 Brown et al …
Data clustering based on candidate queries
A Anderson, K Trojan - US Patent 9,361,355, 2016 - Google Patents
6,581,058 B1 6/2003 Fayyad et al. JP HO9-044518 2, 1997 6,658,626 B1 12/2003 Aiken JP
10275159 10, 1998 7,043,476 B2 5/2006 Robson JP 11184884 7, 1999 7,246,128 B2 …
10275159 10, 1998 7,043,476 B2 5/2006 Robson JP 11184884 7, 1999 7,246,128 B2 …
Data clustering, segmentation, and parallelization
A Anderson - US Patent 10,503,755, 2019 - Google Patents
(57) ABSTRACT A first set of original records is processed by a first processing entity to
generate a second set of records that includes the original records and one or more copies …
generate a second set of records that includes the original records and one or more copies …
Use of permutation prefixes for efficient and scalable approximate similarity search
A Esuli - Information Processing & Management, 2012 - Elsevier
We present the Permutation Prefix Index (this work is a revised and extended version of
Esuli (2009b), presented at the 2009 LSDS-IR Workshop, held in Boston)(PP-Index), an …
Esuli (2009b), presented at the 2009 LSDS-IR Workshop, held in Boston)(PP-Index), an …
Data clustering based on variant token networks
A Anderson - US Patent 9,037,589, 2015 - Google Patents
6,493,709 B1 12/2002 Aiken JP HO9-044518 2, 1997 6,581,058 B1 6/2003 Fayyad et al. JP
10275159 10, 1998 6,658,626 B1 12/2003 Aiken JP 11184884 7, 1999 7,043,476 B2 …
10275159 10, 1998 6,658,626 B1 12/2003 Aiken JP 11184884 7, 1999 7,043,476 B2 …
Ptolemaic access methods: Challenging the reign of the metric space model
Metric indexing is the state of the art in general distance-based retrieval. Relying on the
triangular inequality, metric indexes achieve significant online speed-up beyond a linear …
triangular inequality, metric indexes achieve significant online speed-up beyond a linear …
Fuzzy data operations
A Anderson - US Patent 9,607,103, 2017 - Google Patents
7.283, 999 B1 10/2007 Ramesh et al. JP HO9-044518 2, 1997 7.287, 019 B2 10/2007
Kapoor et al. JP 10275159 10, 1998 7,392.247 B2 6, 2008 Chen et al. JP H11-003342 1 …
Kapoor et al. JP 10275159 10, 1998 7,392.247 B2 6, 2008 Chen et al. JP H11-003342 1 …
Similarity caching in large-scale image retrieval
Feature-rich data, such as audio-video recordings, digital images, and results of scientific
experiments, nowadays constitute the largest fraction of the massive data sets produced …
experiments, nowadays constitute the largest fraction of the massive data sets produced …
Pivot-based approximate k-NN similarity joins for big high-dimensional data
Given an appropriate similarity model, the k-nearest neighbor similarity join represents a
useful yet costly operator for data mining, data analysis and data exploration applications …
useful yet costly operator for data mining, data analysis and data exploration applications …