Biomedical Imaging by Youxin Mao - HTML preview

/ Home / Science (Academic) / Biomedical Imaging

PLEASE NOTE: This is an HTML preview only and some elements such as links or page numbers may be incorrect.
Download the book in PDF, ePub, Kindle for a complete version.

Chandrasekaran, 1982). Two powerful methods for reducing the number of features are

approach described in section 5.2.

presented. These are the sequential forward search (SFS) algorithm and its backward

counterpart the sequential backward search (SBS) algorithm (Devijver & Kittler, 1982). The

5.2 Feature Selection Using the Sequential Backward Search Algorithm

pattern recognition system must also be capable of partitioning, or clustering, the reduced

The SBS is a top down approach, which starts with the complete feature set and removes

feature set into classes of similar observations. The K-means algorithm belongs to the

one feature at each successive iteration (Devijver & Kittler, 1982). The feature that is chosen

collection of multivariate methods used for classifying, or clustering, data and is presented

to be removed is the feature that results in the smallest reduction in the value of the

because of its general applicability in classification problems (Therrien, 1989).

selection criterion function when it is removed. In general, the SBS algorithm requires more

computation than the SFS algorithm because initially it considers the number of features in

5.1 Feature Selection Using the Sequential Forward Search Algorithm

the complete set as forming the subset. Although the SBS overcomes some of the difficulties

The SFS algorithm is a bottom-up strategy for removing redundant or irrelevant features

of the SFS approach the resulting feature subset is not guaranteed to be optimal.

from the feature matrix (Devijver & Kittler, 1982). At each successive iteration the feature

Furthermore, like its counterpart the SBS algorithm suffers from nesting because once a

that produces the largest value of the selection criterion function J is added to the current

feature is selected it cannot be disregarded. Implementation of the SBS approach is

analogous to the SFS approach detailed in section 5.1. The SBS algorithm is computationally

feature set. Given a set of candidate features Y  R, a subset X  R is selected without more expensive than the SFS algorithm, however, their performance is comparable. Despite

Texture Analysis Methods for Medical Image Characterisation

significant degradation to the classification system (Jain & Zongker, 1997). The best subset

X   x







i | i

1 2, , d,

xi Y,

(16)

of d features where( d ) D is selected from the set,

Y   y j 

 D

1 |

1,2, , ,

(17)

by optimising the criterion function J, chosen here to be the estimated minimum probability

of error. For the set of measurements taken from Y, ideally the probability of correct

classification ( ) , with respect to any other combination, is given by,

  



i | i

1 2,, 

d .

(18)

Fig. 10. Plot of fractal island area against cumulative number of islands acquired within the

It follows that the minimum probability of error for the space spanned by  , for each class

area. The slope of the straight line plotted through the data is used to determine the fractal

 i is defined as,

dimension.

(

E )  1max( (

P  |) ) (

p ) (

d 

(19)

5. Feature Selection, Reduction and Classification

and the desired criterion function,

The texture analysis approaches presented in the preceding sections calculate features that

describe properties of the image, or region, being studied. This information is next used in a

J( X)  min( (

E )).

(20)

pattern recognition system to classify the objects, or texture patterns of interest, into an

One of the disadvantages of the SFS approach is that it may suffer from nesting. That is,

appropriate number of categories or classes (Therrien, 1989). However, some of the features

because features selected and included in the feature subset cannot be removed, already

calculated may be highly correlated and some may contain irrelevant information. Feature

selected features determine the course of the remaining selection process. This has

selection is used to select a subset of features s from a given set of

s 

noticeable hazards since after further iterations a feature may become superfluous. Another

p features such that p p

and there is no significant degradation in the performance of the classification system

limitation of the SFS approach is that in the case of two feature variables, which alone

(Therrien, 1989; Zongker & Jain, 1996; Stearns, 1976). The reduction of the feature set

provide little discrimination but together are very effective, the SFS approach may never

reduces the dimensionality of the classification problem and in some cases can increase the

detect this combination. To overcome this problem it is useful to start with a full set of

performance of the classification accuracy due to finite sample size effects (Jain &

available features and eliminate them one at a time. This is the method adopted by the SBS