Fusionnet: 3d object classification using multiple data representations
High-quality 3D object recognition is an important component of many vision and robotics
systems. We tackle the object recognition problem using two data representations, to …
systems. We tackle the object recognition problem using two data representations, to …
An inherently nonnegative latent factor model for high-dimensional and sparse matrices from industrial applications
High-dimensional and sparse (HiDS) matrices are commonly encountered in many big-data-
related and industrial applications like recommender systems. When acquiring useful …
related and industrial applications like recommender systems. When acquiring useful …
Algorithms of unconstrained non-negative latent factor analysis for recommender systems
Non-negativity is vital for a latent factor (LF)-based model to preserve the important feature
of a high-dimensional and sparse (HiDS) matrix in recommender systems, ie, none of its …
of a high-dimensional and sparse (HiDS) matrix in recommender systems, ie, none of its …
Sliceline: Fast, linear-algebra-based slice finding for ml model debugging
S Sagadeeva, M Boehm - … of the 2021 international conference on …, 2021 - dl.acm.org
Slice finding---a recent work on debugging machine learning (ML) models---aims to find the
top-K data slices (eg, conjunctions of predicates such as gender female and degree PhD) …
top-K data slices (eg, conjunctions of predicates such as gender female and degree PhD) …
Accelerating big data applications using lightweight virtualization framework on enterprise cloud
Hypervisor-based virtualization technology has been successfully used to deploy high-
performance and scalable infrastructure for Hadoop, and now Spark applications. Container …
performance and scalable infrastructure for Hadoop, and now Spark applications. Container …
[BUKU][B] Data management in machine learning systems
Large-scale data analytics using machine learning (ML) underpins many modern data-
driven applications. ML systems provide means of specifying and executing these ML …
driven applications. ML systems provide means of specifying and executing these ML …
Compressed linear algebra for large-scale machine learning
Large-scale machine learning (ML) algorithms are often iterative, using repeated read-only
data access and I/O-bound matrix-vector multiplications to converge to an optimal model. It …
data access and I/O-bound matrix-vector multiplications to converge to an optimal model. It …
A survey on geographically distributed big-data processing using MapReduce
S Dolev, P Florissi, E Gudes… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Hadoop and Spark are widely used distributed processing frameworks for large-scale data
processing in an efficient and fault-tolerant manner on private or public clouds. These big …
processing in an efficient and fault-tolerant manner on private or public clouds. These big …
A survey of big data analytics for smart forestry
Accurate and reliable forestry data can be obtained by means of continuous monitoring of
forests using advanced technologies, which provides a major opportunity for the …
forests using advanced technologies, which provides a major opportunity for the …
Bridging the gap between HPC and big data frameworks
Apache Spark is a popular framework for data analytics with attractive features such as fault
tolerance and interoperability with the Hadoop ecosystem. Unfortunately, many analytics …
tolerance and interoperability with the Hadoop ecosystem. Unfortunately, many analytics …