Workflow systems for science: Concepts and tools
D Talia - International Scholarly Research Notices, 2013 - Wiley Online Library
The wide availability of high‐performance computing systems, Grids and Clouds, allowed
scientists and engineers to implement more and more complex applications to access and …
scientists and engineers to implement more and more complex applications to access and …
Distributed data mining: a survey
Most data mining approaches assume that the data can be provided from a single source. If
data was produced from many physically distributed locations like Wal-Mart, these methods …
data was produced from many physically distributed locations like Wal-Mart, these methods …
The WEKA data mining software: an update
More than twelve years have elapsed since the first public release of WEKA. In that time, the
software has been rewritten entirely from scratch, evolved substantially and now …
software has been rewritten entirely from scratch, evolved substantially and now …
Xel: A cloud-agnostic data platform for the design-driven building of high-availability data science services
This paper presents Xel, a cloud-agnostic data platform for the design-driven building of
high-availability data science services as a support tool for data-driven decision-making. We …
high-availability data science services as a support tool for data-driven decision-making. We …
Active learning for sentiment analysis on data streams: Methodology and workflow implementation in the ClowdFlows platform
Sentiment analysis from data streams is aimed at detecting authors' attitude, emotions and
opinions from texts in real-time. To reduce the labeling effort needed in the data collection …
opinions from texts in real-time. To reduce the labeling effort needed in the data collection …
ClowdFlows: Online workflows for distributed big data mining
The paper presents a platform for distributed computing, developed using the latest software
technologies and computing paradigms to enable big data mining. The platform, called …
technologies and computing paradigms to enable big data mining. The platform, called …
A parallel distributed weka framework for big data mining using spark
AK Koliopoulos, P Yiapanis, F Tekiner… - … congress on big …, 2015 - ieeexplore.ieee.org
Effective Big Data Mining requires scalable and efficient solutions that are also accessible to
users of all levels of expertise. Despite this, many current efforts to provide effective …
users of all levels of expertise. Despite this, many current efforts to provide effective …
GPU-based bees swarm optimization for association rules mining
Association rules mining (ARM) is a well-known combinatorial optimization problem aiming
at extracting relevant rules from given large-scale datasets. According to the state of the art …
at extracting relevant rules from given large-scale datasets. According to the state of the art …
Toolkit-based high-performance data mining of large data on MapReduce clusters
D Wegener, M Mock, D Adranale… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
The enormous growth of data in a variety of applications has increased the need for high
performance data mining based on distributed environments. However, standard data …
performance data mining based on distributed environments. However, standard data …
A semantic framework for automatic generation of computational workflows using distributed data and component catalogues
Computational workflows are a powerful paradigm to represent and manage complex
applications, particularly in large-scale distributed scientific data analysis. Workflows …
applications, particularly in large-scale distributed scientific data analysis. Workflows …