Learning natural coding conventions
Every programmer has a characteristic style, ranging from preferences about identifier
naming to preferences about object relationships and design patterns. Coding conventions …
naming to preferences about object relationships and design patterns. Coding conventions …
Mining idioms from source code
We present the first method for automatically mining code idioms from a corpus of previously
written, idiomatic software projects. We take the view that a code idiom is a syntactic …
written, idiomatic software projects. We take the view that a code idiom is a syntactic …
Frequent approximate subgraphs as features for graph-based image classification
The use of approximate graph matching for frequent subgraph mining has been identified in
different applications as a need. To meet this need, several algorithms have been …
different applications as a need. To meet this need, several algorithms have been …
CSPM: Discovering compressing stars in attributed graphs
Graphs, also known as networks, are an expressive data representation used in many
domains. Numerous algorithms have been designed to find interesting patterns in graphs …
domains. Numerous algorithms have been designed to find interesting patterns in graphs …
Discovering representative attribute-stars via minimum description length
Graphs are a popular data type found in many domains. Numerous techniques have been
proposed to find interesting patterns in graphs to help understand the data and support …
proposed to find interesting patterns in graphs to help understand the data and support …
Mining patterns in source code using tree mining algorithms
Discovering regularities in source code is of great interest to software engineers, both in
academia and in industry, as regularities can provide useful information to help in a variety …
academia and in industry, as regularities can provide useful information to help in a variety …
Visual analytics of co-occurrences to discover subspaces in structured data
We present an approach that shows all relevant subspaces of categorical data condensed in
a single picture. We model the categorical values of the attributes as co-occurrences with …
a single picture. We model the categorical values of the attributes as co-occurrences with …
EvoMiner: frequent subtree mining in phylogenetic databases
The problem of mining collections of trees to identify common patterns, called frequent
subtrees (FSTs), arises often when trying to interpret the results of phylogenetic analysis …
subtrees (FSTs), arises often when trying to interpret the results of phylogenetic analysis …
The good, the bad, and the ugly: mining for patterns in student source code
Research on source code mining has been explored to discover interesting structural
regularities, API usage patterns, refactoring opportunities, bugs, crosscutting concerns, code …
regularities, API usage patterns, refactoring opportunities, bugs, crosscutting concerns, code …
Fast algorithms for semantic association search and pattern mining
Given a large graph representing relations between entities, searching for complex
relationships (called semantic associations, or SAs for short) between a set of entities is a …
relationships (called semantic associations, or SAs for short) between a set of entities is a …