Fast relevance-redundancy dominance: Feature selection for high dimensional data

Research output: Chapter in Book/Report/Conference proceedingsChapterpeer-review

Abstract

Feature selection is used to select a subset of features for model construction. This reduces dimensionality which is important for simplification, efficiency and reducing overfitting. Filter-based methods are the most scalable, rating features by their relevance to the target variable via appropriate statistics. Browne, et al. proposed a filter feature selection method, called Relevance-Redundancy Dominance (RRD) with useful properties (no threshold setting, adaptability to any statistics etc.), but with a poor scalability. In this paper, we present a scalable version of RRD, called Fast Relevance-Redundancy Dominance which holds the same properties as RRD while improving scalability. To show the effectiveness of the proposed approach we have carried out extensive numerical experiments on high dimensional datasets (DNA microarray datasets) which shows that it outperforms state-of-the-art algorithms.

Original languageEnglish
Title of host publicationProceedings of the International Conferences on Computer Graphics, Visualization, Computer Vision and Image Processing 2017 and Big Data Analytics, Data Mining and Computational Intelligence 2017 - Part of the Multi Conference on Computer Science and Information Systems 2017
EditorsLuis Rodrigues, Yingcai Xiao, Ajith P. Abraham
PublisherIADIS
Pages255-262
Number of pages8
ISBN (Electronic)9789898533661
Publication statusPublished - 2017
Event11th International Conferences on Computer Graphics, Visualization, Computer Vision and Image Processing, CGVCVIP 2017 and International Conference on Big Data Analytics, Data Mining and Computational Intelligence, BigDaCI 2017 - Lisbon, Portugal
Duration: 21 Jul 201723 Jul 2017

Publication series

NameProceedings of the International Conferences on Computer Graphics, Visualization, Computer Vision and Image Processing 2017 and Big Data Analytics, Data Mining and Computational Intelligence 2017 - Part of the Multi Conference on Computer Science and Information Systems 2017

Conference

Conference11th International Conferences on Computer Graphics, Visualization, Computer Vision and Image Processing, CGVCVIP 2017 and International Conference on Big Data Analytics, Data Mining and Computational Intelligence, BigDaCI 2017
Country/TerritoryPortugal
CityLisbon
Period21/07/1723/07/17

Keywords

  • Feature-Selection
  • Filter-Based
  • High-Dimensional
  • Microarray
  • Scalability

Fingerprint

Dive into the research topics of 'Fast relevance-redundancy dominance: Feature selection for high dimensional data'. Together they form a unique fingerprint.

Cite this