Distributed submodular maximization: Identifying representative elements in massive data
Many large-scale machine learning problems (such as clustering, non-parametric learning,
kernel machines, etc.) require selecting, out of a massive data set, a manageable …
kernel machines, etc.) require selecting, out of a massive data set, a manageable …
Fast greedy algorithms in mapreduce and streaming
Greedy algorithms are practitioners' best friends—they are intuitive, are simple to implement,
and often lead to very good solutions. However, implementing greedy algorithms in a …
and often lead to very good solutions. However, implementing greedy algorithms in a …
Composable core-sets for diversity and coverage maximization
In this paper we consider efficient construction of" composable core-sets" for basic diversity
and coverage maximization problems. A core-set for a point-set in a metric space is a subset …
and coverage maximization problems. A core-set for a point-set in a metric space is a subset …
Randomized composable core-sets for distributed submodular maximization
An effective technique for solving optimization problems over massive data sets is to
partition the data into smaller pieces, solve the problem on each piece and compute a …
partition the data into smaller pieces, solve the problem on each piece and compute a …
[PDF][PDF] Distributed submodular maximization
Many large-scale machine learning problems–clustering, non-parametric learning, kernel
machines, etc.–require selecting a small yet representative subset from a large dataset …
machines, etc.–require selecting a small yet representative subset from a large dataset …
SCARAB: Scaling reachability computation on large graphs
Most of the existing reachability indices perform well on small-to medium-size graphs, but
reach a scalability bottleneck around one million vertices/edges. As graphs become …
reach a scalability bottleneck around one million vertices/edges. As graphs become …
Harmonia: Balancing compute and memory power in high-performance gpus
In this paper, we address the problem of efficiently managing the relative power demands of
a high-performance GPU and its memory subsystem. We develop a management approach …
a high-performance GPU and its memory subsystem. We develop a management approach …
Better streaming algorithms for the maximum coverage problem
We study the classic NP-Hard problem of finding the maximum k-set coverage in the data
stream model: given a set system of m sets that are subsets of a universe 1,…, n {1,...,n\}, find …
stream model: given a set system of m sets that are subsets of a universe 1,…, n {1,...,n\}, find …
Improved local search for the minimum weight dominating set problem in massive graphs by using a deep optimization mechanism
The minimum weight dominating set (MWDS) problem is an important generalization of the
minimum dominating set problem with various applications. In this work, we develop an …
minimum dominating set problem with various applications. In this work, we develop an …
Submodular optimization over sliding windows
Maximizing submodular functions under cardinality constraints lies at the core of numerous
data mining and machine learning applications, including data diversification, data …
data mining and machine learning applications, including data diversification, data …