2000 character limit reached
Clustering, multicollinearity, and singular vectors
Published 7 Aug 2020 in cs.LG and stat.ML | (2008.03368v1)
Abstract: Let $A$ be a matrix with its pseudo-matrix $A{\dagger}$ and set $S=I-A{\dagger}A$. We prove that, after re-ordering the columns of $A$, the matrix $S$ has a block-diagonal form where each block corresponds to a set of linearly dependent columns. This allows us to identify redundant columns in $A$. We explore some applications in supervised and unsupervised learning, specially feature selection, clustering, and sensitivity of solutions of least squares solutions.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.