Population structure

Population structure#

In this notebook, we run a principal components analysis and build a neighbour joining tree on the amplicon sequencing variant data. For the PCA, we will plot PC1 v PC2 and PC3 v PC4, and the variance explained by the model.

Variance explained#

As a general rule of thumb, when the variance explained for each PC begins to flatten out, that is when the PCs are no longer informative.

PCA#

NJT#

excluding extreme outliers from NJT []