Genetic diversity#

Output Directory: results/variantAnalysis/diversity/

Rules

  • variantAnalysis.smk

    • SummaryStatistics

Introduction

Genetic diversity is an important concept in population genetics, and it refers to the amount of variation in the genetics of a population. This variation can take many forms, including single nucleotide polymorphisms (SNPs), small insertions or deletions (INDELs), and larger structural changes and rearrangememts.

One measure of genetic diversity is nucleotide diversity (aka pi), which is a measure of the average number of nucleotide differences per site between two randomly chosen genomes within a population. Another measure is Watterson’s theta, which is a measure of the average number of segregating sites within a population. These measures can provide valuable insights into the evolutionary history of a population.

In RNA-Seq-Pop, we calculate genetic diversity from the called SNPs using scikit-allel. As indel calling is unreliable from RNA-Seq data, we filter out any indel calls prior to variant analysis.

Results