Search Paper
  • Home
  • Login
  • Categories
  • Post URL
  • Academic Resources
  • Contact Us

 

Data-driven Feature Selection Methods for Text Classification: an Empirical Evaluation

google+
Views: 115                 

Author :  Rogerio C. P. Fragoso,Roberto H. W. Pinheiro,George D. C. Cavalcanti

Affiliation :  Universidade Federal de Pernambuco, Brazil

Country :  Brazil

Category :  Computer Science & Information Technology

Volume, Issue, Month, Year :  25, 4, April, 2019

Abstract :


: Dimensionality reduction is a crucial task in text classification. The most adopted strategy is feature selection using filter methods. This approach presents a difficulty in determining the best size for the final feature vector. At Least One FeaTure (ALOFT), Maximum f Features per Document (MFD), Maximum f Features per Document-Reduced (MFDR) and Class-dependent Maximum f Features per Document-Reduced (cMFDR) are feature selection methods that define automatically the number of features per Corpus. However, MFD, MFDR, and cMFDR require a parameter that defines the number of features to be selected per document. Automatic Feature Subsets Analyzer (AFSA) is an auxiliary method that automates such configuration. In this paper, we evaluate dimensionality reduction, classification performance and execution time of this family of methods: ALOFT, MFD, MFDR, cMFDR and AFSA. The experiments are conducted using three feature evaluation functions and twenty databases. MFD obtained the best results among the feature selection methods. In addition, the experiments showed that the use of AFSA does not significantly affect the classification performances or the dimensionality reduction rates of the feature selection methods, but considerably reduces their execution times.

Keyword :  feature selection, text classification

Journal/ Proceedings Name :  Journal of Universal Computer Science

URL :  http://www.jucs.org/jucs_25_4/data_driven_feature_selection/jucs_25_04_0334_0360_fragoso.pdf

User Name : alex
Posted 15-02-2020 on 14:44:51 AEDT



Related Research Work

  • An Adaptive And Smart System For Parental Control On Digital Games
  • Intelligent System For Solving Problems Of Veterinary Medicine On The Example Of Dairy Farms
  • A Diet Control And Fitness Assistant Application Using Deep Learning-based Image Classification
  • 2d Image Features Detector And Descriptor Selection Expert System

About Us | Post Cfp | Share URL Main | Share URL category | Post URL
All Rights Reserved @ Call for Papers - Conference & Journals