Recent advances of large-scale linear classification
Linear classification is a useful tool in machine learning and data mining. For some data in a
rich dimensional space, the performance (ie, testing accuracy) of linear classifiers has …
rich dimensional space, the performance (ie, testing accuracy) of linear classifiers has …
Hashing techniques: A survey and taxonomy
With the rapid development of information storage and networking technologies, quintillion
bytes of data are generated every day from social networks, business transactions, sensors …
bytes of data are generated every day from social networks, business transactions, sensors …
Binary grasshopper optimisation algorithm approaches for feature selection problems
Feature Selection (FS) is a challenging machine learning-related task that aims at reducing
the number of features by removing irrelevant, redundant and noisy data while maintaining …
the number of features by removing irrelevant, redundant and noisy data while maintaining …
Learning to hash for indexing big data—A survey
The explosive growth in Big Data has attracted much attention in designing efficient indexing
and search methods recently. In many critical applications such as large-scale search and …
and search methods recently. In many critical applications such as large-scale search and …
Discrete graph hashing
Hashing has emerged as a popular technique for fast nearest neighbor search in gigantic
databases. In particular, learning based hashing has received considerable attention due to …
databases. In particular, learning based hashing has received considerable attention due to …
One loss for quantization: Deep hashing with discrete wasserstein distributional matching
Image hashing is a principled approximate nearest neighbor approach to find similar items
to a query in a large collection of images. Hashing aims to learn a binary-output function that …
to a query in a large collection of images. Hashing aims to learn a binary-output function that …
Distributed hierarchical gpu parameter server for massive scale deep learning ads systems
Neural networks of ads systems usually take input from multiple resources, eg query-ad
relevance, ad features and user portraits. These inputs are encoded into one-hot or multi-hot …
relevance, ad features and user portraits. These inputs are encoded into one-hot or multi-hot …
Density sensitive hashing
Nearest neighbor search is a fundamental problem in various research fields like machine
learning, data mining and pattern recognition. Recently, hashing-based approaches, for …
learning, data mining and pattern recognition. Recently, hashing-based approaches, for …
AIBox: CTR prediction model training on a single node
As one of the major search engines in the world, Baidu's Sponsored Search has long
adopted the use of deep neural network (DNN) models for Ads click-through rate (CTR) …
adopted the use of deep neural network (DNN) models for Ads click-through rate (CTR) …
A novel Siamese deep hashing model for histopathology image retrieval
Content-based histopathology image retrieval can be a useful technique for help in
diagnosing various diseases. The process of retrieving images is often time-consuming and …
diagnosing various diseases. The process of retrieving images is often time-consuming and …