The state of the art of metadata managements in large-scale distributed file systems—scalability, performance and availability

H Dai, Y Wang, KB Kent, L Zeng… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
File system metadata is the data in charge of maintaining namespace, permission semantics
and location of file data blocks. Operations on the metadata can account for up to 80% of …

Hvac: Removing i/o bottleneck for large-scale deep learning applications

A Khan, AK Paul, C Zimmer, S Oral… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Scientific communities are increasingly adopting deep learning (DL) models in their
applications to accelerate scientific discovery processes. However, with rapid growth in the …

Matrix profile-based approach to industrial sensor data analysis inside RDBMS

M Zymbler, E Ivanova - Mathematics, 2021 - mdpi.com
Currently, big sensor data arise in a wide spectrum of Industry 4.0, Internet of Things, and
Smart City applications. In such subject domains, sensors tend to have a high frequency and …

MOSIQS: Persistent memory object storage with metadata indexing and querying for scientific computing

A Khan, H Sim, SS Vazhkudai, Y Kim - IEEE Access, 2021 - ieeexplore.ieee.org
Scientific applications often require high-bandwidth shared storage to perform joint
simulations and collaborative data analytics. Shared memory pools provide a chance to …

Persistent memory object storage and indexing for scientific computing

A Khan, H Sim, SS Vazhkudai, J Ma… - 2020 IEEE/ACM …, 2020 - ieeexplore.ieee.org
This paper presents Mosiqs, a persistent memory object storage framework with metadata
indexing and querying for scientific computing. We design Mosiqs based on the key idea …

[PDF][PDF] A Data-Aware Remote Procedure Call Method for Big Data Systems.

J Wang, Y Yang, J Zhang, X Yu, O Alfarraj… - Comput. Syst. Sci …, 2020 - academia.edu
In recent years, big data has been one of the hottest development directions in the
information field. With the development of artificial intelligence technology, mobile smart …

Efficient distributed association management method of data, model, and knowledge for digital twin railway

Y Guo, Q Zhu, Y Ding, Y Li, H Wu, Y He… - … Journal of Digital …, 2024 - Taylor & Francis
Digital twin railway is a pivotal foundation for the intelligent construction and maintenance of
railway engineering projects within extensive open spaces. Its essence is the integrated …

Scanns: Towards scalable and concurrent data indexing and searching in high-end computing system

AI Orhean, A Giannakou… - 2022 22nd IEEE …, 2022 - ieeexplore.ieee.org
Increasing data volumes, particularly in science and engineering, has resulted in the
widespread adoption of parallel and distributed file systems for data storage and access …

Idioms: Index-powered distributed object-centric metadata search for scientific data management

W Zhang, H Tang, S Byna - 2024 IEEE 24th International …, 2024 - ieeexplore.ieee.org
Affix-oriented metadata search is one of the essential fuzzy search capabilities that allow
users to find data of interest in their voluminous data set with incomplete query conditions …

SCIPIS: Scalable and concurrent persistent indexing and search in high-end computing systems

AI Orhean, A Giannakou, L Ramakrishnan… - Journal of Parallel and …, 2024 - Elsevier
While it is now routine to search for data on a personal computer or discover data online,
there is no such equivalent method for discovering data on large parallel and distributed file …