P5: Genetically defined metapopulations

Metapopulations are notoriously difficult to define in a straightforward manner. Sometimes it is done by ancestral characteristics (e.g. language).

Roewer et al. 2005 [PubMed] [DOI] and Andersen et al. 2014 [PubMed] [DOI] have demonstrated two different methods to perform cluster analysis on Y-STR data. In this project we aim to repeat such analyses with the aim of being able to define metapopulations entirely by the genetic information.

We will use the following subset of slowly mutating loci with the indicated approximate mutation rates:

  • DYS438 : 4 in 10,000
  • DYS392 : 5 in 10,000
  • DYS393 : 11 in 10,000
  • DYS437 : 12 in 10,000
  • DYS643 (1) : 14 in 10,000
  • DYS448 (2) : 15 in 10,000
  • DYS390 : 21 in 10,000
  • DYS19 : 22 in 10,000

(1) only available in the Y17 dataset (289,405 profiles)

(2) only available in the Y23 dataset (103,280).

The rest of the loci are available in the Y12 dataset (309,090).

We request the Y17 dataset (289,405 profiles) for the above loci (except DYS643) and the Y23 dataset for the above loci. That is, we request all available profiles, but restricted to the loci listed above.


  • Tóra Oluffa Stenberg Olsen (Aalborg University, Denmark)
  • Mikkel Meyer Andersen (Aalborg University, Denmark)
  • Amke Caliebe (Kiel University, Germany)
  • James Curran (Auckland University, New Zealand)


Mikkel Meyer Andersen

Date of objection

Expired at June, 22nd, 2023

* See FAQ/Glossary (http://yhrd.org/pages/faq) for further explanations of abbreviated terms used here