Emerging Trends in Computing |
Foundation of Computer Science USA |
ETC2016 - Number 4 |
March 2017 |
Authors: Apurva Y. Chaudhari, Satish. S. Banait |
8fc9b633-b883-4505-b372-4399339b4012 |
Apurva Y. Chaudhari, Satish. S. Banait . An Efficient Distributed Feature Subset Selection Technique on High Dimensional Small Sized Data. Emerging Trends in Computing. ETC2016, 4 (March 2017), 11-16.
Feature subset selection is a crucial phase in modeling accurate classifiers in data mining and machine learning, especially with High Dimensional Small Sized (HDSS) data. LDA can also be used for feature selection as an efficient measure for evaluation of the feature subset. While LDA is applied to feature selection on HDSS data and class imbalance, it meets some difficulties, such as singular scatter matrix, overwhelming, overfitting, and computational complexity. For this purpose, a new LDA based feature selection technique based is proposed which focuses more on minority class with a novel regularization technique. Main objective is to enhance the performance of feature subset selection process using LDA in distributed environment. Sample ratio between both classes has been determined.