Correspondence Analysis of Mexican Discourses
This article is originally published at http://jean9208.github.io/
Correspondence analysis is a multivariate statistical technique that summarizes a set of categorical data in a two dimensional form. It’s like the equivalent of Principal Component Analysis but for categorical data.
Correspondence analysis is usually applied to contigency tables. In this post, we will apply it to a frequency matrix (term document matrix from bag of words representation).
The analysis can be done by row or by column. Below is an implementation of correspondence analysis, where row and column analysis are done at the same time.
Please visit source website for post related comments.