Distributed data mining in credit card fraud detection

PK Chan, W Fan, AL Prodromidis… - … Intelligent Systems and …, 1999 - ieeexplore.ieee.org
Credit card transactions continue to grow in number, taking an ever-larger share of the US
payment system and leading to a higher rate of stolen account numbers and subsequent …

[BOEK][B] Association rule mining: models and algorithms

C Zhang, S Zhang - 2002 - Springer
During decision making, we are often confronted by a huge amount of factors. These factors
may be either an advantage or a disadvantage to a decision objective. For the purpose of …

[PDF][PDF] Ensemble Pruning Via Semi-definite Programming.

Y Zhang, S Burer, W Nick Street, KP Bennett… - Journal of machine …, 2006 - jmlr.org
An ensemble is a group of learning models that jointly solve a problem. However, the
ensembles generated by existing techniques are sometimes unnecessarily large, which can …

[PDF][PDF] Meta-learning in distributed data mining systems: Issues and approaches

A Prodromidis, P Chan, S Stolfo - Advances in distributed and …, 2000 - academia.edu
Data mining systems aim to discover patterns and extract useful information from facts
recorded in databases. A widely adopted approach to this objective is to apply various …

A survey of methods for scaling up inductive algorithms

F Provost, V Kolluri - Data mining and knowledge discovery, 1999 - Springer
One of the defining challenges for the KDD research community is to enable inductive
learning algorithms to mine very large databases. This paper summarizes, categorizes, and …

Synthesizing high-frequency rules from different data sources

X Wu, S Zhang - IEEE Transactions on Knowledge and Data …, 2003 - ieeexplore.ieee.org
Many large organizations have multiple data sources, such as different branches of an
interstate company. While putting all data together from different sources might amass a …

Improved dataset characterisation for meta-learning

Y Peng, PA Flach, C Soares, P Brazdil - Discovery Science: 5th …, 2002 - Springer
This paper presents new measures, based on the induced decision tree, to characterise
datasets for meta-learning in order to select appropriate learning algorithms. The main idea …

Database classification for multi-database mining

X Wu, C Zhang, S Zhang - Information Systems, 2005 - Elsevier
Many large organizations have multiple databases distributed in different branches, and
therefore multi-database mining is an important task for data mining. To reduce the search …

[PDF][PDF] Multi-database mining

S Zhang, X Wu, C Zhang - IEEE Computational Intelligence …, 2003 - comp.hkbu.edu.hk
Multi-database mining is an important research area because (1) there is an urgent need for
analyzing data in different sources,(2) there are essential differences between mono-and …

Cost complexity-based pruning of ensemble classifiers

AL Prodromidis, SJ Stolfo - Knowledge and Information Systems, 2001 - Springer
In this paper we study methods that combine multiple classification models learned over
separate data sets. Numerous studies posit that such approaches provide the means to …