Day 26

Principal Components

Given data matrix $X$ with $N$ rows (samples) and $k$ columns (features) – assume each feature has mean zero.
The matrix $Q=\frac{1}{N}X^{\intercal}X$ is symmetric and its entries are the variances/covariances.
If $v$ is a vector, then $Xv$ is called a “score” – a synthetic measure of the data.
The variance of the score is $v^{\intercal}Qv$.
Critical points of variance are eigenvectors of $Q$.
These critical directions are called “principal components”.