Incorporating spatial data to inform studies of the population demography of a species has a long history of interest. From inferring geographical clines in Principal Components Analyses (Menozzi et al. 1978), using location data as “informative priors” during model-based estimation of admixture (Hubisz et al. 2009), using phylogenetic trees (and other distance based methods) and superimposing them upon geographical distributions to make predictions about what has come to be known as “phylogeography” of a species, measuring the correlation between geographic distance matrices and genetic distance matrices (sensu Peakall and Smouse 1999), estimating spatial autocorrelation (eg. Moran’s I, correllograms – see Brian Epperson’s book for an excellent review, also Sokal and Oden 1991) to discover directional clines in the genetic-spatial distribution of a species, pruning variance-covariance matrices in genetic data using graph-theoretical/network algorithms to discover geographical-genetic structure, detecting differences in allele frequency spectra of populations to detect founder effects and range expansions (see Peter and Slatkin 2013), just to name a few.
At the core of all these methods is the variance-covariance structure in the genetics (primarily observed and/or ancestral allele frequency distribution), and the apparent geographical distribution of the species. Continue reading