Skip to main content Principal Components
- Given data matrix $X$ with $N$ rows (samples) and $k$ columns (features) – assume each feature has mean zero.
- The matrix $Q=\frac{1}{N}X^{\intercal}X$ is symmetric and its entries are the variances/covariances.
- If $v$ is a vector, then $Xv$ is called a “score” – a synthetic measure of the data.
- The variance of the score is $v^{\intercal}Qv$.
- Critical points of variance are eigenvectors of $Q$.
- These critical directions are called “principal components”.