Whitening

Models make assumptions about the data for simplification
These assumptions may not be true for the initial data
Whitening is a preprocessing technique that makes such assumptions correct, or at least more reasonable
A common assumption models make is that data input components shares the same variance
This is not the case when using PCs (Principle Components) as features
Whitening scales the axes to make the covariance matrix the identity matrix (such that the variance is unit n-dimensional sphere)

The original variance of the PCs are $λ_{i}$ and therefore we must multiply each point by the square root (standard deviation) of the inverse (divide by the standard deviation) $λ_{i}^{- 0.5}$

Extending this to the matrix form yields:

Λ_{v}^{- \frac{1}{2}} = (\frac{Σ _{v}^{2}}{N})^{- \frac{1}{2}}

The transformed data is $Σ_{v} V_{v}^{T}$ such that the whitened data is now:

(\frac{Σ _{v}^{2}}{N})^{- \frac{1}{2}} Σ_{v} V_{v}^{T} = N V_{v}^{T}

📓 Daniel's Notes

Explorer

Whitening

Graph View

Backlinks