Recent advances of large-scale linear classification

GX Yuan, CH Ho, CJ Lin - Proceedings of the IEEE, 2012 - ieeexplore.ieee.org
Linear classification is a useful tool in machine learning and data mining. For some data in a
rich dimensional space, the performance (ie, testing accuracy) of linear classifiers has …

Hashing techniques: A survey and taxonomy

L Chi, X Zhu - ACM Computing Surveys (Csur), 2017 - dl.acm.org
With the rapid development of information storage and networking technologies, quintillion
bytes of data are generated every day from social networks, business transactions, sensors …

Binary grasshopper optimisation algorithm approaches for feature selection problems

M Mafarja, I Aljarah, H Faris, AI Hammouri… - Expert Systems with …, 2019 - Elsevier
Feature Selection (FS) is a challenging machine learning-related task that aims at reducing
the number of features by removing irrelevant, redundant and noisy data while maintaining …

Learning to hash for indexing big data—A survey

J Wang, W Liu, S Kumar, SF Chang - Proceedings of the IEEE, 2015 - ieeexplore.ieee.org
The explosive growth in Big Data has attracted much attention in designing efficient indexing
and search methods recently. In many critical applications such as large-scale search and …

Discrete graph hashing

W Liu, C Mu, S Kumar… - Advances in neural …, 2014 - proceedings.neurips.cc
Hashing has emerged as a popular technique for fast nearest neighbor search in gigantic
databases. In particular, learning based hashing has received considerable attention due to …

One loss for quantization: Deep hashing with discrete wasserstein distributional matching

KD Doan, P Yang, P Li - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Image hashing is a principled approximate nearest neighbor approach to find similar items
to a query in a large collection of images. Hashing aims to learn a binary-output function that …

Distributed hierarchical gpu parameter server for massive scale deep learning ads systems

W Zhao, D **e, R Jia, Y Qian, R Ding… - … of Machine Learning …, 2020 - proceedings.mlsys.org
Neural networks of ads systems usually take input from multiple resources, eg query-ad
relevance, ad features and user portraits. These inputs are encoded into one-hot or multi-hot …

Density sensitive hashing

Z **, C Li, Y Lin, D Cai - IEEE transactions on cybernetics, 2013 - ieeexplore.ieee.org
Nearest neighbor search is a fundamental problem in various research fields like machine
learning, data mining and pattern recognition. Recently, hashing-based approaches, for …

AIBox: CTR prediction model training on a single node

W Zhao, J Zhang, D **e, Y Qian, R Jia, P Li - Proceedings of the 28th …, 2019 - dl.acm.org
As one of the major search engines in the world, Baidu's Sponsored Search has long
adopted the use of deep neural network (DNN) models for Ads click-through rate (CTR) …

A novel Siamese deep hashing model for histopathology image retrieval

SM Alizadeh, MS Helfroush, H Müller - Expert Systems with Applications, 2023 - Elsevier
Content-based histopathology image retrieval can be a useful technique for help in
diagnosing various diseases. The process of retrieving images is often time-consuming and …