Data mining and machine learning in astronomy

NM Ball, RJ Brunner - International Journal of Modern Physics D, 2010 - World Scientific
We review the current state of data mining and machine learning in astronomy. Data Mining
can have a somewhat mixed connotation from the point of view of a researcher in this field. If …

Coupling different methods for overcoming the class imbalance problem

L Nanni, C Fantozzi, N Lazzarini - Neurocomputing, 2015 - Elsevier
Many classification problems must deal with imbalanced datasets where one class–the
majority class–outnumbers the other classes. Standard classification methods do not …

Probabilistic cross-identification of astronomical sources

T Budavári, AS Szalay - The Astrophysical Journal, 2008 - iopscience.iop.org
We present a general probabilistic formalism for cross-identifying astronomical point sources
in multiple observations. Our Bayesian approach, symmetric in all observations, is the …

Evidence for H i replenishment in massive galaxies through gas accretion from the cosmic web

D Kleiner, KA Pimbblet, DH Jones… - Monthly Notices of …, 2017 - academic.oup.com
We examine the H i-to-stellar mass ratio (H i fraction) for galaxies near filament backbones
within the nearby Universe (d< 181 Mpc). This work uses the 6-degree Field Galaxy Survey …

Scientific data mining in astronomy

KD Borne - Next generation of data mining, 2008 - taylorfrancis.com
References............................................................................ 1095.1 Introduction It has been said
that astronomers have been doing data mining for centuries:“thedata are mine, and you …

Support vector machines and kd-tree for separating quasars from large survey data bases

D Gao, YX Zhang, YH Zhao - Monthly Notices of the Royal …, 2008 - academic.oup.com
We compare the performance of two automated classification algorithms, k-dimensional tree
(kd-tree) and support vector machines (SVMs), to separate quasars from stars in the data …

Bayesian matching for X-ray and infrared sources in the MYStIX project

T Naylor, PS Broos, ED Feigelson - The Astrophysical Journal …, 2013 - iopscience.iop.org
Identifying the infrared counterparts of X-ray sources in Galactic plane fields such as those of
the MYStIX project presents particular difficulties due to the high density of infrared sources …

A support vector machine for spectral classification of emission-line galaxies from the Sloan Digital Sky Survey

F Shi, YY Liu, GL Sun, PY Li, YM Lei… - Monthly Notices of the …, 2015 - academic.oup.com
The emission-lines of galaxies originate from massive young stars or supermassive
blackholes. As a result, spectral classification of emission-line galaxies into star-forming …

OCCT: A one-class clustering tree for implementing one-to-many data linkage

A Shabtai, L Rokach, Y Elovici - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
One-to-many data linkage is an essential task in many domains, yet only a handful of prior
publications have addressed this issue. Furthermore, while traditionally data linkage is …

A genetic encoding approach for learning methods for combining classifiers

L Nanni, A Lumini - Expert Systems with Applications, 2009 - Elsevier
Several studies have reported that the ensemble of classifiers can improve the performance
of a stand-alone classifier. In this paper, we propose a learning method for combining the …