On data lake architectures and metadata management

P Sawadogo, J Darmont - Journal of Intelligent Information Systems, 2021 - Springer
Over the past two decades, we have witnessed an exponential increase of data production
in the world. So-called big data generally come from transactional systems, and even more …

An adaptable big data value chain framework for end-to-end big data monetization

AZ Faroukhi, I El Alaoui, Y Gahi, A Amine - Big Data and Cognitive …, 2020 - mdpi.com
Today, almost all active organizations manage a large amount of data from their business
operations with partners, customers, and even competitors. They rely on Data Value Chain …

[HTML][HTML] Data lakes: A survey of concepts and architectures

S Azzabi, Z Alfughi, A Ouda - Computers, 2024 - mdpi.com
This paper presents a comprehensive literature review on the evolution of data-lake
technology, with a particular focus on data-lake architectures. By systematically examining …

Concept drift adaptation techniques in distributed environment for real-world data streams

H Mehmood, P Kostakos, M Cortes… - Smart Cities, 2021 - mdpi.com
Real-world data streams pose a unique challenge to the implementation of machine
learning (ML) models and data analysis. A notable problem that has been introduced by the …

Data lakes: A survey of functions and systems

R Hai, C Koutras, C Quix… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Data lakes are becoming increasingly prevalent for Big Data management and data
analytics. In contrast to traditional 'schema-on-write'approaches such as data warehouses …

Spatial big data architecture: from data warehouses and data lakes to the Lakehouse

SA Errami, H Hajji, KA El Kadi, H Badir - Journal of Parallel and Distributed …, 2023 - Elsevier
The construction of systems supporting spatial data has experienced great enthusiasm in
the past, due to the richness of this type of data and their semantics, which can be used in …

[PDF][PDF] Kafka-based architecture in building data lakes for real-time data streams

K Peddireddy - International Journal of Computer Applications, 2023 - academia.edu
The purpose of this paper is to investigate how Kafka can be used to construct data lakes for
real-time data processing. Kafka has gained widespread popularity as a data ingestion and …

Data integration for digital twins in the built environment based on federated data models

J Merino, X **e, N Moretti, JY Chang… - Proceedings of the …, 2023 - icevirtuallibrary.com
Improving the efficiency of operations is a major challenge in facility management given the
limitations of outsourcing individual building functions to third-party companies. The status of …

[HTML][HTML] A novel Edge architecture and solution for detecting concept drift in smart environments

H Mehmood, A Khalid, P Kostakos, E Gilman… - Future Generation …, 2024 - Elsevier
The proliferation of the Internet of Things (IoT), artificial intelligence (AI), the adoption of 5G,
and progress towards 6G technology have led to the accumulation of massive amounts of …

Data lake architecture for storing and transforming web server access log files

E Zagan, M Danubianu - IEEE Access, 2023 - ieeexplore.ieee.org
Web server access log files are text files containing important data about server activities,
client requests addressed to a server, server responses, etc. Large-scale analysis of these …