On data lake architectures and metadata management
Over the past two decades, we have witnessed an exponential increase of data production
in the world. So-called big data generally come from transactional systems, and even more …
in the world. So-called big data generally come from transactional systems, and even more …
An adaptable big data value chain framework for end-to-end big data monetization
Today, almost all active organizations manage a large amount of data from their business
operations with partners, customers, and even competitors. They rely on Data Value Chain …
operations with partners, customers, and even competitors. They rely on Data Value Chain …
[HTML][HTML] Data lakes: A survey of concepts and architectures
This paper presents a comprehensive literature review on the evolution of data-lake
technology, with a particular focus on data-lake architectures. By systematically examining …
technology, with a particular focus on data-lake architectures. By systematically examining …
Concept drift adaptation techniques in distributed environment for real-world data streams
Real-world data streams pose a unique challenge to the implementation of machine
learning (ML) models and data analysis. A notable problem that has been introduced by the …
learning (ML) models and data analysis. A notable problem that has been introduced by the …
Data lakes: A survey of functions and systems
Data lakes are becoming increasingly prevalent for Big Data management and data
analytics. In contrast to traditional 'schema-on-write'approaches such as data warehouses …
analytics. In contrast to traditional 'schema-on-write'approaches such as data warehouses …
Spatial big data architecture: from data warehouses and data lakes to the Lakehouse
The construction of systems supporting spatial data has experienced great enthusiasm in
the past, due to the richness of this type of data and their semantics, which can be used in …
the past, due to the richness of this type of data and their semantics, which can be used in …
[PDF][PDF] Kafka-based architecture in building data lakes for real-time data streams
K Peddireddy - International Journal of Computer Applications, 2023 - academia.edu
The purpose of this paper is to investigate how Kafka can be used to construct data lakes for
real-time data processing. Kafka has gained widespread popularity as a data ingestion and …
real-time data processing. Kafka has gained widespread popularity as a data ingestion and …
Data integration for digital twins in the built environment based on federated data models
Improving the efficiency of operations is a major challenge in facility management given the
limitations of outsourcing individual building functions to third-party companies. The status of …
limitations of outsourcing individual building functions to third-party companies. The status of …
[HTML][HTML] A novel Edge architecture and solution for detecting concept drift in smart environments
The proliferation of the Internet of Things (IoT), artificial intelligence (AI), the adoption of 5G,
and progress towards 6G technology have led to the accumulation of massive amounts of …
and progress towards 6G technology have led to the accumulation of massive amounts of …
Data lake architecture for storing and transforming web server access log files
Web server access log files are text files containing important data about server activities,
client requests addressed to a server, server responses, etc. Large-scale analysis of these …
client requests addressed to a server, server responses, etc. Large-scale analysis of these …