Friday 5 February 2016

Y-chromosomal haplogroup R1a - PART ONE.



Y-chromosomal haplogroup R1a - PART ONE.

Some of you have probably wondered why it is named like that and how the “fun” has started. In other words – what’s the genesis behind an R1a as an abstract relationship of letters and number. It will need a necessary review of background before final move to the point. 
Most of us have already met the developed form, to put it like that. We are tested using at least 12(+2) Y-STRs (Y-37 is a standard) and given prediction or genotyped using custom chip. Some of us even had a pleasure to undergo a full genome research or form of sequencing reserved for Y-Chromosome. Genetic industry is literally developing before our eyes. But…it had to begin someday.


It shouldn’t be especially difficult to explain, because timeline of genetic genealogy from perspective of Y-Chromosome is relatively brief. Do you want to hear some history? Here we go.

Y-Chromosome. It doesn't look so pretty in reality ;)
         Y-Chromosome is an inherent element of a male genetics and since it was discovered in 19051, it’s role has always been within a scope of interest of many people from scientific world. We had to wait few decades until new perspectives were opened after discovery of DNA. It turned out that DNA located on Y-Chromosome (aside pseudo-autosomal region of 3Mbp length2) doesn’t recombine and therefore, may be used for research on paternity. And soon, scientists started to seek for relatedness of males, this time using microsatellite polymorphisms. Polymorphism needs to be explained.  Let’s use a definition made by Cavalli-Sforza and Bodmer (1971). Genetic polymorphism is the occurrence in the same population of two or more alleles at one locus, each with appreciable frequency"(minimum - 1%3). Basically speaking, POLYMORPHISM means an existence of differences in the DNA of population. We can distinguish two types of polymorphisms, which are of our use – microsatellite and single-nucleotide ones (SNPs).  There were two reasons that prevailed in the initial success of microsatellite polymorphisms. First – microsatellite polymorphism can be read by simple measuring its length, counted in base pairs. Second reason was probably the most decisive one - one small set of chromosome locations can be measured ACROSS ALL MEN REGARDLESS OF HAPLOGROUPS. It is clear, isn’t it?

 Why it is called “micro”, then? DNA fragments looked for are really short (2-5 base pairs) and they are repeated, typically from 5 to 50 times4. They occur at various locations in human genome and aren’t limited to Y-Chromosome. Term “satellite”, on the other hand, originates from experiments with centrifugation of genomic DNA in a test tube – this process resulted with separating layer of bulk DNA from accompanying (thus: “satellite”) layers of repetitive DNA.

Microsatellite polymorphism when located on Y-Chromosome, is called Y-STR (STR - short tandem repeat, a term typically used by forensic geneticists). It took a lot of effort to find and select Y-STRs that were valuable enough for genetic genealogy. That’s because there is a lot of variance between them – with some microsatellites mutating too fast/too slow or with generally unstable character. It wasn’t easy to fish out the most worthy ones.
First was DYS19 (Roewer et al 19925), in the next row came YCAII a/b (Mathias et al 19946), DYS389 I/II, DYS390, DYS391, DYS392, DYS393 (Roewer et al 19967). In the year of 2000 and 2002 many useful markers from DYS4XX series were found.

After about 8 years from the first discovery, list of solid Y-STR markers was complete enough to sell it as a commercial product. Who was first? An American company Family Tree DNA (FTDNA) located in Houston, Texas. It was available for anyone who offered enough $. Do some of you still remember introductory prices?

Years have passed, number of testable markers grown and hence the accuracy – also the costs have lowered considerably over time.  Y-STRs generally allow to form clusters consisting of related males - especially in macro-scale, by which I mean a distant relationship.  Higher number of shared Y-STRs, particularly “innovations” (rare values not found elsewhere or combinations of such values) may indicate a close relationship between the tested. The problem begins when we want to evaluate loose relationship between totally random people or when we have a significant amount of data. Data that is coming from population study, for instance. The relevant question is -how to test hundreds of people and don’t get overwhelmed with results? Such huge amount of genetic data has to be orderly and Y-SNPs came here with support. This process resulted later with the division into named Y-haplogroups, like R1a, E, I1 etc. It will be explained at the second part of an entry, because it started to be quite lengthy.
We shouldn't be serious all the time...
 See you soon!

Sources: 



1. Brush, S G (Jun 1978). “Nettie M. Stevens and the discovery of sex determination by chromosomes". Isis; an international review devoted to the history of science and its cultural influences (United States) 69 (247):163–72. doi:10.1086/352001. ISSN 0021-1753. PMID 389882.
2. Mangs H, Morris.The Human Pseudoautosomal Region (PAR): Origin, Function and Future”. Curr Genomics. 2007 Apr; 8(2): 129–136. PMCID: PMC2435358
3Philip Hedrick. Genetics of Populations. Jones & Bartlett Learning. pp. 104–
 
ISBN 978-0-7637-5737-3
4. Turnpenny P, Ellard S (2005). Emery's Elements of Medical Genetics, 12th. ed. London: Elsevier.
5. Roewer L, Arnemann J, Spurr NK, Grzeschik KH and Epplen JT (1992). Simple repeat sequences on the human Y chromosome are equally polymorphic as their autosomal counterparts. Hum Genet 89: 389–394
6. Mathias N, Bayes M, Tyler-Smith C (1994) Highly informative compound haplotypes for the human Y chromosome. Hum Mol Genet 3: 115–123
7.  L Roewer, M Kayser, P Dieltjes, M Nagy, E Bakker, M Krawczak, P de Knijff. Analysis of molecular variance (AMOVA) of Y-chromosome-specific microsatellites in two closely related human populations. Hum Mol Genet, 5 (1996), pp. 1029–1033
 



 

16 comments:

  1. I have joined this Haplogroup, but I am lost as to what else to do, even after reading about it.Please help me. Thank you!

    ReplyDelete
    Replies
    1. Hello Kathy, I'm sorry for an inconvenience but it seems I wasn't notified about the comment at all.

      Have you tested at FTDNA, right? Please provide me with a kit number and I will try to answer your problem.

      Delete
  2. Hi Kathy,
    There is expertise in this group that goes far beyond mine (and that's not saying a lot), but I'd start here: You need to have a male family member who has been Y-DNA tested to get anything from this group, since Y-DNA only applies to males. Once you've got that, you can trace the ancestry through R, R1, R1a, and see where each subclade originated, and when, going back a few thousand years. In genealogy, it is mostly useful for determining who is not genetically related, who is, and how far back a common ancestor is likely to be found.

    ReplyDelete
  3. Hi Kathy,
    There is expertise in this group that goes far beyond mine (and that's not saying a lot), but I'd start here: You need to have a male family member who has been Y-DNA tested to get anything from this group, since Y-DNA only applies to males. Once you've got that, you can trace the ancestry through R, R1, R1a, and see where each subclade originated, and when, going back a few thousand years. In genealogy, it is mostly useful for determining who is not genetically related, who is, and how far back a common ancestor is likely to be found.

    ReplyDelete
  4. This is my dad's haplogroup 512 &198. What is the meaning that he got 2 ??
    second question, his family is part of the Jewish Levite group, does is shaw here?

    ReplyDelete
    Replies
    1. Hello, can you provide me with a kit number?

      Delete
    2. Unfortunately, I can't see such kit in the Project. Please join using this link: https://www.familytreedna.com/groups/r-1a/about :)

      Delete
  5. Although I am a surname Project Admin. I know very little about STR's and SNP's.
    There is the Farrar project kits 466114 and N11518. They share a most unusual (once in a thousand years mutation at DYS390=23. N11518 claims to share with me an ancestor born 1627 in Virginia, his family migrated to Macoupin/Green County, Ill before moving on. Kit 466124's appears to descend from an NPE, and their family also lived in Macoupin/greene County, Ill. They are both terminal Z93

    They are at a GD=2 to each other and a GD=3 to me, and other members of the project.

    Is this sufficient information to unequivocably state that both share the same ancestor, i.e. William Farrar born 1627 in Virginia?

    ReplyDelete
    Replies
    1. Hello,

      Only Big Y results would've given ineqivocal answers but it seems that independent development of such value as in DYS390 in otherwise closely matching people is highly unlikely.
      So - they most likely share the same ancestor, closer one than with other kits of Farrars, i.e. William Farrar.

      Anyway, we are very thankful for joining the kits to the R1a Project.

      Best

      Delete
  6. Hello, Artur.
    I did my Geno2 test, and uploaded results to family tree. My kit number N207698 .
    Geno2 put me in CTS1211
    I desiderata to do order z280 snp pack
    , but just realized that BigY is much better,
    But if I go for upgrades I don't see BigY option there , should I do y12 or y25 first to be able to do big Y? Or I should contact family tree and ask for big y?
    Thank you. Very exited about our R1a group!

    ReplyDelete
    Replies
    1. Hello Sasha.

      Such option was allowed previously, after a phone call to FTDNA, if I remember correctly. Unfortunately it changed and you can't do Big Y now without ordering at least y-12. That's what I've heard from a person who tried to order Big Y without Y-STR testing.

      Delete
  7. Hi Artur,
    My results were R-M198, which was a surprise to me, as we believed our family was from Scotland, maybe Wales. We are not able to trace our history beyond a male ancestor born in Pennsylvania in 1799. It looks like R1A in Scotland is considered Scandinavian, meaning my ancestors would have been Norman Vikings, not native Celts. However, based on our inability to find locate the ancestor who traveled to America, I'm not sure that he did not travel directly from Germany to Pennsylvania in the 1700's, or simply live in Scotland for a short period before migrating to America as many Germans immigrants to Pennsylvania had done. Is there any way to know from the test results whether my family settled Scotland hundreds of year ago as vikings or Saxons before coming to America, or whether they traveled more directly from Germany? My kit number is 502040. Thank you.

    ReplyDelete
    Replies
    1. Hello Kevin.

      Your results (predicted as M458>PF7521>L260)suggest ancient Slavic ancestry on a purely paternal side, what actually implies that your paternal line is much more likely to have been originally German. Many West Slavic people were germanised and then arrived as Germans in Pennsylvania, for example.

      L260 would've been really atypical for Scots and suggest German or Slavic background of a bearer. Scandinavians, on the other hand, belong mostly to a very different subbranch of R1a that is only distantly related to L260 - going back to the early Bronze Age ;).

      Delete
  8. Also my result came in R-M198 and it was surprise to me. especially that I am from Arab country. but as my little knowledge that M-198 is more that 10000 years old. my Kit number is 515351

    ReplyDelete
    Replies
    1. Hi,

      R1a is indeed one of the less frequent haplogroups on Arabian Peninsula, although it may have solid local representation. We predict that you may belong to subbranch of Z93, that is a typical kind of R1a to aforementioned region. I would've suggested ordering Big Y (if affordable) or R1a Backbone SNP Pack to help finding a proper subbranch you belong to. Don't be discouraged with lack of matches beyond 12 markers :).

      Delete