Fusionnet: 3d object classification using multiple data representations

V Hegde, R Zadeh - arxiv preprint arxiv:1607.05695, 2016 - arxiv.org
High-quality 3D object recognition is an important component of many vision and robotics
systems. We tackle the object recognition problem using two data representations, to …

An inherently nonnegative latent factor model for high-dimensional and sparse matrices from industrial applications

X Luo, MC Zhou, S Li, MS Shang - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
High-dimensional and sparse (HiDS) matrices are commonly encountered in many big-data-
related and industrial applications like recommender systems. When acquiring useful …

Algorithms of unconstrained non-negative latent factor analysis for recommender systems

X Luo, M Zhou, S Li, D Wu, Z Liu… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Non-negativity is vital for a latent factor (LF)-based model to preserve the important feature
of a high-dimensional and sparse (HiDS) matrix in recommender systems, ie, none of its …

Sliceline: Fast, linear-algebra-based slice finding for ml model debugging

S Sagadeeva, M Boehm - … of the 2021 international conference on …, 2021 - dl.acm.org
Slice finding---a recent work on debugging machine learning (ML) models---aims to find the
top-K data slices (eg, conjunctions of predicates such as gender female and degree PhD) …

Accelerating big data applications using lightweight virtualization framework on enterprise cloud

J Bhimani, Z Yang, M Leeser… - 2017 IEEE High …, 2017 - ieeexplore.ieee.org
Hypervisor-based virtualization technology has been successfully used to deploy high-
performance and scalable infrastructure for Hadoop, and now Spark applications. Container …

[BUKU][B] Data management in machine learning systems

M Boehm, A Kumar, J Yang - 2022 - books.google.com
Large-scale data analytics using machine learning (ML) underpins many modern data-
driven applications. ML systems provide means of specifying and executing these ML …

Compressed linear algebra for large-scale machine learning

A Elgohary, M Boehm, PJ Haas, FR Reiss… - Proceedings of the …, 2016 - dl.acm.org
Large-scale machine learning (ML) algorithms are often iterative, using repeated read-only
data access and I/O-bound matrix-vector multiplications to converge to an optimal model. It …

A survey on geographically distributed big-data processing using MapReduce

S Dolev, P Florissi, E Gudes… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Hadoop and Spark are widely used distributed processing frameworks for large-scale data
processing in an efficient and fault-tolerant manner on private or public clouds. These big …

A survey of big data analytics for smart forestry

W Zou, W **g, G Chen, Y Lu, H Song - Ieee Access, 2019 - ieeexplore.ieee.org
Accurate and reliable forestry data can be obtained by means of continuous monitoring of
forests using advanced technologies, which provides a major opportunity for the …

Bridging the gap between HPC and big data frameworks

M Anderson, S Smith, N Sundaram, M Capotă… - Proceedings of the …, 2017 - dl.acm.org
Apache Spark is a popular framework for data analytics with attractive features such as fault
tolerance and interoperability with the Hadoop ecosystem. Unfortunately, many analytics …