Feature Selection ToolboxFST3 Library / Documentation

demo34.cpp File Reference

Example 34: Dependency-Aware Feature Ranking (DAF0). More...

#include <boost/smart_ptr.hpp>
#include <exception>
#include <iostream>
#include <cstdlib>
#include <string>
#include <vector>
#include "error.hpp"
#include "global.hpp"
#include "subset.hpp"
#include "data_intervaller.hpp"
#include "data_splitter.hpp"
#include "data_splitter_5050.hpp"
#include "data_splitter_cv.hpp"
#include "data_scaler.hpp"
#include "data_scaler_void.hpp"
#include "data_accessor_splitting_memTRN.hpp"
#include "data_accessor_splitting_memARFF.hpp"
#include "criterion_wrapper.hpp"
#include "distance_euclid.hpp"
#include "classifier_knn.hpp"
#include "search_monte_carlo.hpp"
#include "result_tracker_feature_stats.hpp"
Include dependency graph for demo34.cpp:

Functions

int main ()

Detailed Description

Example 34: Dependency-Aware Feature Ranking (DAF0).


Function Documentation

int main (  ) 

Example 34: Dependency-Aware Feature Ranking (DAF0).

Dependency-Aware Feature Ranking (DAF) is a novel approach to feature selection especially suitable for very-high-dimensional problems and over-fitting-prone feature selection scenarios. DAF evaluates a chosen criterion on a series of probe subsets to eventually rank features according to their estimated contextual quality. DAF has been shown capable of overperforming BIF quite significantly in many cases in terms of the quality of selected feature subsets, yet its stability and resistance against over-fitting remains on par with BIF. For details see UTIA Technical Report No. 2295 from February 2011. We demonstrate two slightly different forms of DAF (DAF0 and DAF1) on examples Example 34: Dependency-Aware Feature Ranking (DAF0). and Example 35t: Dependency-Aware Feature Ranking (DAF1) to enable Wrapper based FS on very-high-dimensional data.. Example34 illustrates the approach with k-NN accuracy wrapper criterion on 64-dimensional image data. Example 35t illustrates DAF with SVM wrapper applied to very-high-dimensional text categorization problem.

Note:
DAF (as BIF) provides feature ranking but does not determine final subset size.
This approach makes it possible to apply even the complex wrapper feature selection criteria in problems of very-high-dimensionality.

References FST::Search< RETURNTYPE, DIMTYPE, SUBSET, CRITERION >::enable_result_tracking(), FST::Search_Monte_Carlo< RETURNTYPE, DIMTYPE, SUBSET, CRITERION >::search(), FST::Search_Monte_Carlo< RETURNTYPE, DIMTYPE, SUBSET, CRITERION >::set_cardinality_randomization(), FST::Search< RETURNTYPE, DIMTYPE, SUBSET, CRITERION >::set_output_detail(), and FST::Search_Monte_Carlo< RETURNTYPE, DIMTYPE, SUBSET, CRITERION >::set_stopping_condition().


Generated on Thu Mar 31 11:36:14 2011 for FST3Library by  doxygen 1.6.1