BlobSeer: Next-generation data management for large scale infrastructures

B Nicolae, G Antoniu, L Bougé, D Moise… - Journal of Parallel and …, 2011 - Elsevier
As data volumes increase at a high speed in more and more application fields of science,
engineering, information services, etc., the challenges posed by data-intensive computing …

BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications

B Nicolae, D Moise, G Antoniu… - … on Parallel & …, 2010 - ieeexplore.ieee.org
Hadoop is a software framework supporting the Map-Reduce programming model. It relies
on the Hadoop Distributed File System (HDFS) as its primary storage system. The efficiency …

High throughput data-compression for cloud storage

B Nicolae - International Conference on Data Management in Grid …, 2010 - Springer
As data volumes processed by large-scale distributed data-intensive applications grow at
high-speed, an increasing I/O pressure is put on the underlying storage service, which is …

Towards mapreduce for desktop grid computing

B Tang, M Moca, S Chevalier, H He… - … Conference on P2P …, 2010 - ieeexplore.ieee.org
MapReduce is an emerging programming model for data-intense application proposed by
Google, which has attracted a lot of attention recently. MapReduce borrows from functional …

Metadata traces and workload models for evaluating big storage systems

CL Abad, H Luu, N Roberts, K Lee, Y Lu… - 2012 ieee fifth …, 2012 - ieeexplore.ieee.org
Efficient namespace metadata management is increasingly important as next-generation file
systems are designed for peta and exascales. New schemes have been proposed, however …

On the benefits of transparent compression for cost-effective cloud data storage

B Nicolae - Transactions on Large-Scale Data-and Knowledge …, 2011 - Springer
Abstract Infrastructure-as-a-Service (IaaS) cloud computing has revolutionized the way we
think of acquiring computational resources: it allows users to deploy virtual machines (VMs) …

BlobSeer: Towards efficient data storage management for large-scale, distributed systems

B Nicolae - 2010 - theses.hal.science
With data volumes increasing at a high rate and the emergence of highly scalable
infrastructures (cloud computing, petascale computing), distributed management of data …

Scalable data management for map-reduce-based data-intensive applications: a view for cloud and hybrid infrastructures

G Antoniu, A Costan, J Bigot… - … Journal of Cloud …, 2013 - inderscienceonline.com
As map-reduce emerges as a leading programming paradigm for data-intensive computing,
today's frameworks which support it still have substantial shortcomings that limit its potential …

BlobSeer: Efficient data management for data-intensive applications distributed at large-scale

B Nicolae, G Antoniu, L Bougé - 2010 IEEE International …, 2010 - ieeexplore.ieee.org
As the rate, scale and variety of data increases in complexity, the need for flexible
applications that can crunch huge amounts of heterogeneous data fast and cost-effective is …

Recent advances and research challenges in desktop grid and volunteer computing

G Fedak - Grids, P2P and Services Computing, 2010 - Springer
For over a decade, Desktop Grid systems have paved the way to high throughput computing
over large scale network of Desktop PCs. Nowadays, the aggregate computing power of the …