Spectral methods are widely used to reduce data dimensionality in order to enable a more effective use of several pattern recognition techniques such as clustering algorithms. Here we review the most popular spectral methods.
Consider a collection of sample points $ \{\vec{x_1},\vec{x_2},\cdots,\vec{x_n}\} $ where $ \vec{x_i} \in R^m $. We divide the methods in two categories:
- Outer Characteristics of the point cloud: These methods require the spectral analysis of a positive definite kernel of dimension m, the extrinsic dimensionality of the data.
- Inner characteristics of the point cloud: These methods require the spectral analysis of a positive definite kernel of dimension n, or the number of samples in the sample cloud. This happens because this set of methods explore the structure of pairwise similarities among all the data samples.
Outer Characteristics of the point cloud Methods
Inner characteristics of the point cloud Methods
Nonlinear Methods: When Data Embedded in Low-Dimensional Non-Linear Manifold
Let's consider the case when the data is embedded in a low-dimensional non-linear manifold. Because of the non-linear structure of the manifold, the traditional MDS_Old Kiwi won't capture the true structure of the data. Recently several methods have been developed to deal with such a non-linar configuration. Here we briefly review two of them: