Frequent pattern mining: current status and future directions
Frequent pattern mining has been a focused theme in data mining research for over a
decade. Abundant literature has been dedicated to this research and tremendous progress …
decade. Abundant literature has been dedicated to this research and tremendous progress …
Frequent item set mining
C Borgelt - Wiley interdisciplinary reviews: data mining and …, 2012 - Wiley Online Library
Frequent item set mining is one of the best known and most popular data mining methods.
Originally developed for market basket analysis, it is used nowadays for almost any task that …
Originally developed for market basket analysis, it is used nowadays for almost any task that …
A survey on explainable anomaly detection
In the past two decades, most research on anomaly detection has focused on improving the
accuracy of the detection, while largely ignoring the explainability of the corresponding …
accuracy of the detection, while largely ignoring the explainability of the corresponding …
Krimp: mining itemsets that compress
One of the major problems in pattern mining is the explosion of the number of results. Tight
constraints reveal only common knowledge, while loose constraints lead to an explosion in …
constraints reveal only common knowledge, while loose constraints lead to an explosion in …
TacticFlow: Visual analytics of ever-changing tactics in racket sports
Event sequence mining is often used to summarize patterns from hundreds of sequences
but faces special challenges when handling racket sports data. In racket sports (eg, tennis …
but faces special challenges when handling racket sports data. In racket sports (eg, tennis …
Fast and reliable anomaly detection in categorical data
Spotting anomalies in large multi-dimensional databases is a crucial task with many
applications in finance, health care, security, etc. We introduce COMPREX, a new approach …
applications in finance, health care, security, etc. We introduce COMPREX, a new approach …
Maximum entropy models and subjective interestingness: an application to tiles in binary databases
T De Bie - Data Mining and Knowledge Discovery, 2011 - Springer
Recent research has highlighted the practical benefits of subjective interestingness
measures, which quantify the novelty or unexpectedness of a pattern when contrasted with …
measures, which quantify the novelty or unexpectedness of a pattern when contrasted with …
An information theoretic framework for data mining
T De Bie - Proceedings of the 17th ACM SIGKDD international …, 2011 - dl.acm.org
We formalize the data mining process as a process of information exchange, defined by the
following key components. The data miner's state of mind is modeled as a probability …
following key components. The data miner's state of mind is modeled as a probability …
Spatial-temporal traffic flow pattern identification and anomaly detection with dictionary-based compression theory in a large-scale urban network
Traffic flow pattern identification, as well as anomaly detection, is an important component
for traffic operations and control. To reveal the characteristics of regional traffic flow patterns …
for traffic operations and control. To reveal the characteristics of regional traffic flow patterns …
Tell me what i need to know: succinctly summarizing data with itemsets
Data analysis is an inherently iterative process. That is, what we know about the data greatly
determines our expectations, and hence, what result we would find the most interesting. With …
determines our expectations, and hence, what result we would find the most interesting. With …