iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://doi.org/10.1145/3449726.3459481
Scatter search for high-dimensional feature selection using feature grouping | Proceedings of the Genetic and Evolutionary Computation Conference Companion skip to main content
10.1145/3449726.3459481acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
poster

Scatter search for high-dimensional feature selection using feature grouping

Published: 08 July 2021 Publication History

Abstract

In feature selection tasks, finding the optimal subset of features is unfeasible due to the increase of the search space with the dimensionality. In order to reduce the complexity of the space, feature grouping approach aims to generate subsets of correlated features. In this context, evolutionary algorithms have proven to achieve competitive solutions. In this work we propose a novel Scatter Search (SS) strategy that uses feature grouping to generate a population of diverse and high quality solutions. Solutions are evolved by applying random mechanisms in combination with the feature group structure to maintain the diversity and the quality of the solutions during the search. We test the proposed strategy on high dimensional data from biomedical domains and compare the performance against the first adaptation of the SS to the feature selection problem. Results show that our proposal is able to find smaller subsets of features while keeping a similar predictive power of the classifier models. Finally, a case of study regarding melanoma skin cancer is analysed using the proposed strategy.

References

[1]
M. García-Torres, F. Gómez-Vela, B. Melián-Batista, and J. M. Moreno-Vega. 2016. High-dimensional feature selection via feature grouping: A Variable Neighborhood Search approach. Information Sciences 326 (2016), 102--118.
[2]
F. C. García-López, M. García-Torres, B. Melián-Batista, J. A. Moreno-Pérez, and J. M. Moreno-Vega. 2006. Solving the Feature Selection Problem by a Parallel Scatter Search. European Journal of Operations Research 169, 2 (2006), 477--489.
[3]
M. A. Hall. 1999. Correlation-based Feature Subset Selection for Machine Learning. Ph.D. Dissertation. University of Waikato, Hamilton, New Zealand.
[4]
D. N. Leguizamon Correa, L. R. Bareiro Paniagua, J. L. Vazquez Noguera, D. P. Pinto-Roa, and L. A. Salgueiro Toledo. 2015. Computerized Diagnosis of Melanocytic Lesions Based on the ABCD Method. In 2015 Latin American Computing Conference (CLEI). 1--12.

Cited By

View all
  • (2024)Improving accuracy of vascular access quality classification in hemodialysis patients using deep learning with K highest score feature selectionJournal of International Medical Research10.1177/0300060524123251952:4Online publication date: 4-Apr-2024
  • (2024)A hybrid scatter search method for solving fuzzy no-wait flow-shop scheduling problemsEngineering Optimization10.1080/0305215X.2024.2367600(1-34)Online publication date: 4-Jul-2024
  • (2023)Review of feature selection approaches based on grouping of featuresPeerJ10.7717/peerj.1566611(e15666)Online publication date: 17-Jul-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GECCO '21: Proceedings of the Genetic and Evolutionary Computation Conference Companion
July 2021
2047 pages
ISBN:9781450383516
DOI:10.1145/3449726
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 July 2021

Check for updates

Author Tags

  1. feature grouping
  2. feature selection
  3. high dimensionality
  4. metaheuristic

Qualifiers

  • Poster

Funding Sources

  • This work was supported by the CONACYT, Paraguay, under Grant PINV18-1129

Conference

GECCO '21
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 08 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Improving accuracy of vascular access quality classification in hemodialysis patients using deep learning with K highest score feature selectionJournal of International Medical Research10.1177/0300060524123251952:4Online publication date: 4-Apr-2024
  • (2024)A hybrid scatter search method for solving fuzzy no-wait flow-shop scheduling problemsEngineering Optimization10.1080/0305215X.2024.2367600(1-34)Online publication date: 4-Jul-2024
  • (2023)Review of feature selection approaches based on grouping of featuresPeerJ10.7717/peerj.1566611(e15666)Online publication date: 17-Jul-2023
  • (2023)Artificial Intelligence for Multiple Sclerosis Management Using Retinal Images: Pearl, Peaks, and PitfallsSeminars in Ophthalmology10.1080/08820538.2023.229303039:4(271-288)Online publication date: 13-Dec-2023
  • (2023)Evolutionary feature selection on high dimensional data using a search space reduction approachEngineering Applications of Artificial Intelligence10.1016/j.engappai.2022.105556117:PAOnline publication date: 1-Jan-2023
  • (2022)Hybrid Feature Selection Method Based on Feature Subset and Factor AnalysisIEEE Access10.1109/ACCESS.2022.322281210(120792-120803)Online publication date: 2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media