More is better: recent progress in multi-omics data integration methods

S Huang, K Chaudhary, LX Garmire - Frontiers in genetics, 2017 - frontiersin.org
Multi-omics data integration is one of the major challenges in the era of precision medicine.
Considerable work has been done with the advent of high-throughput studies, which have …

Quality of information in mobile crowdsensing: Survey and research challenges

F Restuccia, N Ghosh, S Bhattacharjee… - ACM Transactions on …, 2017 - dl.acm.org
Smartphones have become the most pervasive devices in people's lives and are clearly
transforming the way we live and perceive technology. Today's smartphones benefit from …

[HTML][HTML] Snorkel: Rapid training data creation with weak supervision

A Ratner, SH Bach, H Ehrenberg, J Fries… - Proceedings of the …, 2017 - ncbi.nlm.nih.gov
Labeling training data is increasingly the largest bottleneck in deploying machine learning
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the …

A survey of heterogeneous information network analysis

C Shi, Y Li, J Zhang, Y Sun… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
Most real systems consist of a large number of interacting, multi-typed components, while
most contemporary researches model them as homogeneous information networks, without …

Snorkel: rapid training data creation with weak supervision

A Ratner, SH Bach, H Ehrenberg, J Fries, S Wu, C Ré - The VLDB Journal, 2020 - Springer
Labeling training data is increasingly the largest bottleneck in deploying machine learning
systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the …

Unsupervised fake news detection on social media: A generative approach

S Yang, K Shu, S Wang, R Gu, F Wu, H Liu - Proceedings of the AAAI …, 2019 - aaai.org
Social media has become one of the main channels for people to access and consume
news, due to the rapidness and low cost of news dissemination on it. However, such …

Big data integration

XL Dong, D Srivastava - 2013 IEEE 29th international …, 2013 - ieeexplore.ieee.org
The Big Data era is upon us: data is being generated, collected and analyzed at an
unprecedented scale, and data-driven decision making is swee** through all aspects of …

Data and information quality

C Batini, M Scannapieco - Cham, Switzerland: Springer International …, 2016 - Springer
This book is the result of a study path that started in 2006, when the two authors of this book
published the book Data Quality: Concepts, Methodologies and Techniques. After 8 years …

Mining heterogeneous information networks: a structural analysis approach

Y Sun, J Han - ACM SIGKDD explorations newsletter, 2013 - dl.acm.org
Most objects and data in the real world are of multiple types, interconnected, forming
complex, heterogeneous but often semi-structured information networks. However, most …

A survey on truth discovery

Y Li, J Gao, C Meng, Q Li, L Su, B Zhao… - ACM Sigkdd …, 2016 - dl.acm.org
Thanks to information explosion, data for the objects of interest can be collected from
increasingly more sources. However, for the same object, there usually exist conflicts among …