HAPLOGROUPS
It sometimes comes as a surprise to people when
they first receive their Y test results to discover that the haplogroups are
actually defined by a type of marker called unique event polymorphisms (UEPs) that are not normally tested by the commercial
laboratories because of the expense.
These biallelic (i.e., two-valued) markers are
also called Single Nucleotide Polymorphisms (SNP). As the term UEP implies, changes in these
markers occur (except very rarely) only once at a given Y-chromosome location
in human history (ergo, the “unique” in UEP), and a new one pops up about every
7000 years in the regions of interest on the Y-chromosome. A mutation involves the substitution of one
of the four subunits of DNA for another.
There are 249 known haplogroups and subgroups. Major haplogroups
are labeled (named) with the letters of the alphabet, while numerals and
lower-case letters are used to designate the subgroups. For example, the most common subgroup in
The type of DNA testing normally used for
genealogical studies, tests an entirely different set of markers, the short
tandem repeats (STR). These are short,
usually four-letter, sequences that are repeated between 8 and 36 times. The “value” at a particular marker or
location is simply the number of times that the sequence is repeated. Y-chromosome testing is normally done by the
commercial testing services on 12, 25, or 37 markers. The results of such testing is simply a set
of 12, 25, or 37 two-digit (or occasionally one-digit) numbers. The region where the STRs are tested is a
region of the Y-chromosome that has no biological function. It is thought to represent “junk DNA.” There are no medical or health issues
connected to these, other than paternity, so there are no more privacy issues
for this set of numbers than there is for the public awareness of ones surname.
Because of the extensive anthropological research
that has been done on the origin and spread of the various (SNP-defined)
haplogroups, there is intense interest among “genetic genealogists” to predict
a person’s haplogroup from the STR testing results. This is done on the basis of a database of
STR test results for which the haplogroup is known. Using the observed allele (test value)
frequency for the different markers, one can often determine which haplogroup
that a given individual belongs to. For
some of the rarer haplogroups, this can be rather uncertain because of the
scarcity of data.
It is very important to note that membership in a
particular haplogroup does not (by itself) indicate the ethnic group from which
the patrilineal line derives. There is a
lot of misinformation posted on the Internet in this regard. You can see such statements as “R1b means
Celtic,” or “I1a means Viking.” While
those two haplogroups are common in those two populations, they also occur in
every country in
A particular set of values for a set of STR or SNP
markers is termed a “haplotype.” The
repeat value of a particular STR marker is called an “allele,” and the
distribution of values for a marker within a given haplogroup is called the
allele frequency distribution.
Anthropological
information about the haplogroups mentioned above and presented in more detail
below, is partly from FTDNA (http://www.familytreedna.com):
R1b Haplogroup R1b is the most common haplogroup in European
populations. It is believed to have expanded throughout
The R1b group is defined by a
set of mutations that go back tens of thousands of years, some early and some
late. The following table shows the
steps, each one involving a UEP mutation, that led
from earliest humans to the origin of the R1b group in time and location:
|
Haplogroup with The defining Y Biallelic SNP Marker (in parentheses) |
Years Before Present |
Migration Route |
|
[no name](M94) |
? |
In |
|
[no name](M168) |
50,000 |
|
|
F(M89) |
45,000 |
|
|
K(M9) |
40,000 |
South |
|
P(M45) |
35,000 |
North |
|
R(M207) |
? |
In |
|
1(M173) |
30,000 |
|
|
b(P25) |
13,000? |
In Ice Age Enclave in |
R1a The R1a lineage is believed to have originated in the
Eurasian Steppes north of the Black and
I The I lineages (I1a,
I1b, and I1c) are common in northwestern
|
Haplogroup with The defining Y Biallelic SNP Marker (in parentheses) |
Years Before Present |
Migration Route |
|
[no name](M94) |
? |
In |
|
[no name](M168) |
50,000 |
|
|
F(M89) |
45,000 |
|
|
I(M170) |
? |
? |
|
1(P38) |
? |
? |
|
b(P37b) |
? |
? |
|
Haplogroup with The defining Y Biallelic SNP Marker (in parentheses) |
Years Before Present |
Migration Route |
|
[no name](M94) |
? |
In |
|
[no name](M168) |
50,000 |
|
|
[no name](M145) |
? |
? |
|
E(M96) |
? |
? |
|
3(P2) |
? |
? |
|
b(M35) |
? |
? |
|
Haplogroup with The defining Y Biallelic SNP Marker (in parentheses) |
Years Before Present |
Migration Route |
|
[no name](M94) |
? |
In |
|
[no name](M168) |
50,000 |
|
|
F(M89) |
45,000 |
|
|
G(M201) |
? |
|
J
Haplogroup
The J haplogroup is common
in the