Big data systems: A software engineering perspective
A Davoudian, M Liu - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
Big Data Systems (BDSs) are an emerging class of scalable software technologies whereby
massive amounts of heterogeneous data are gathered from multiple sources, managed …
massive amounts of heterogeneous data are gathered from multiple sources, managed …
[PDF][PDF] The Data Civilizer System.
In many organizations, it is often challenging for users to find relevant data for specific tasks,
since the data is usually scattered across the enterprise and often inconsistent. In fact, data …
since the data is usually scattered across the enterprise and often inconsistent. In fact, data …
Sok: Cryptographically protected database search
Protected database search systems cryptographically isolate the roles of reading from,
writing to, and administering the database. This separation limits unnecessary administrator …
writing to, and administering the database. This separation limits unnecessary administrator …
A systematic overview of data federation systems
Data federation addresses the problem of uniformly accessing multiple, possibly
heterogeneous data sources, by map** them into a unified schema, such as an RDF …
heterogeneous data sources, by map** them into a unified schema, such as an RDF …
Enabling query processing across heterogeneous data models: A survey
Modern applications often need to manage and analyze widely diverse datasets that span
multiple data models [1],[2],[3],[4],[5]. Warehousing the data through Extract-Transform-Load …
multiple data models [1],[2],[3],[4],[5]. Warehousing the data through Extract-Transform-Load …
[PDF][PDF] Data Ingestion for the Connected World.
In this paper, we argue that in many “Big Data” applications, getting data into the system
correctly and at scale via traditional ETL (Extract, Transform, and Load) processes is a …
correctly and at scale via traditional ETL (Extract, Transform, and Load) processes is a …
The BigDAWG polystore system and architecture
V Gadepally, P Chen, J Duggan… - 2016 IEEE High …, 2016 - ieeexplore.ieee.org
Organizations are often faced with the challenge of providing data management solutions for
large, heterogenous datasets that may have different underlying data and programming …
large, heterogenous datasets that may have different underlying data and programming …
BatchDB: Efficient isolated execution of hybrid OLTP+ OLAP workloads for interactive applications
In this paper we present BatchDB, an in-memory database engine designed for hybrid OLTP
and OLAP workloads. BatchDB achieves good performance, provides a high level of data …
and OLAP workloads. BatchDB achieves good performance, provides a high level of data …
A survey of state management in big data processing systems
The concept of state and its applications vary widely across big data processing systems.
This is evident in both the research literature and existing systems, such as Apache Flink …
This is evident in both the research literature and existing systems, such as Apache Flink …
Big data in cloud computing review and opportunities
M Muniswamaiah, T Agerwala, C Tappert - arxiv preprint arxiv …, 2019 - arxiv.org
Big Data is used in decision making process to gain useful insights hidden in the data for
business and engineering. At the same time it presents challenges in processing, cloud …
business and engineering. At the same time it presents challenges in processing, cloud …