Abstract
The Roma, also known as âGypsiesâ, represent the largest and the most widespread ethnic minority of Europe. There is increasing evidence, based on linguistic, anthropological and genetic data, to suggest that they originated from the Indian subcontinent, with subsequent bottlenecks and undetermined gene flow from/to hosting populations during their diaspora. Further support comes from the presence of Indian uniparentally inherited lineages, such as mitochondrial DNA M and Y-chromosome H haplogroups, in a significant number of Roma individuals. However, the limited resolution of most genetic studies so far, together with the restriction of the samples used, have prevented the detection of other non-Indian founder lineages that might have been present in the proto-Roma population. We performed a high-resolution study of the uniparental genomes of 753 Roma and 984 non-Roma hosting European individuals. Roma groups show lower genetic diversity and high heterogeneity compared with non-Roma samples as a result of lower effective population size and extensive drift, consistent with a series of bottlenecks during their diaspora. We found a set of founder lineages, present in the Roma and virtually absent in the non-Roma, for the maternal (H7, J1b3, J1c1, M18, M35b, M5a1, U3, and X2d) and paternal (I-P259, J-M92, and J-M67) genomes. This lineage classification allows us to identify extensive gene flow from non-Roma to Roma groups, whereas the opposite pattern, although not negligible, is substantially lower (up to 6.3%). Finally, the exact haplotype matching analysis of both uniparental lineages consistently points to a Northwestern origin of the proto-Roma population within the Indian subcontinent.
Similar content being viewed by others
Log in or create a free account to read this content
Gain free access to this article, as well as selected content from this journal and more on nature.com
or
Accession codes
Change history
24 December 2021
A Correction to this paper has been published: https://doi.org/10.1038/s41431-021-01020-7
References
Fraser A : The Gypsies. Blackwell Publishers: Oxford, 1992.
Kenrick D : Historical Dictionary of the Gypsies. The Scarecrow Press, Inc.: Lanham, Maryland, 2007.
Hancock I : The emergence of Romani as a koïné outside of Indiain. Acton T (ed): Scholarship and Gypsy Struggle: Commitment in Romani Studies. University of Hertfordshire Press: Hatfield, 1987, pp 1â13.
Kalaydjieva L, Calafell F, Jobling MA et al: Patterns of inter- and intra-group genetic diversity in the Vlax Roma as revealed by Y chromosome and mitochondrial DNA lineages. Eur J Hum Genet 2001; 9: 97â104.
Kalaydjieva L, Gresham D, Calafell F : Genetic studies of the Roma (Gypsies): a review. BMC Med Genet 2001; 2: 5â5.
Mendizabal I, Lao O, Marigorta UM et al: Reconstructing the population history of European Romani from genome-wide data. Curr Biol 2012; 22: 2342â2349.
Mendizabal I, Valente C, Gusmao A et al: Reconstructing the Indian origin and dispersal of the European Roma: a maternal genetic perspective. PLoS One 2011; 6: e15988.
Moorjani P, Patterson N, Loh P-R et al: Reconstructing Roma history from genome-wide data. PLoS One 2013; 8: e58633.
Morar B, Gresham D, Angelicheva D et al: Mutation history of the Roma/Gypsies. Am J Hum Gene 2004; 75: 596â609.
Marushiakova E, Popov V : Gypsies in the Ottoman Empire. University of Hertfordshire Press: Hatfield, 2001.
Horváthová E . Cigáni na Slovensku (Gypsies in Slovakia). Bratislava: Vydavatelstvo Slovenskej Akadémie Vied, 1964.
Liégeois JP : Roma, Gypsies, Travellers. Council of Europe Press: Strasbourg, 1994.
Marushiakova E, Popov V : Roma identities in Central, Southeastern and Eastern Europe. In: Kyuchukov H, Rawashdeh O (ed): Roma Identity and Anti-Gypsyism in Europe. Lincoln: München, 2013, pp 40â52.
Kalaydjieva L, Morar B, Chaix R, Tang H : A newly discovered founder population: the Roma/Gypsies. BioEssays 2005; 27: 1084â1094.
Morar B, Azmanov DN, Kalaydjieva L : Roma (Gypsies): genetic studies. eLS 2013.
Mendizabal I, Lao O, Marigorta UM, Kayser M, Comas D : Implications of population history of European Romani on genetic susceptibility to disease. Hum Hered 2013; 76: 194â200.
Gresham D, Morar B, Underhill PA et al: Origins and divergence of the Roma (Gypsies). Am J Hum Gene 2001; 69: 1314â1331.
Gusmao A, Valente C, Gomes V et al: A genetic historical sketch of European gypsies: the perspective from autosomal markers. Am J Phys Anthropol 2010; 141: 507â514.
Azmanov DN, Dimitrova S, Florez L et al: LTBP2 and CYP1B1 mutations and associated ocular phenotypes in the Roma/Gypsy founder population. Eur J Hum Genet 2011; 19: 326â333.
Bouwer S, Angelicheva D, Chandler D, Seeman P, Tournev I, Kalaydjieva L : Carrier rates of the ancestral Indian W24X mutation in GJB2 in the general gypsy population and individual subisolates. Genet Test 2007; 11: 455â458.
Gusmao A, Gusmao L, Gomes V et al: A perspective on the history of the Iberian gypsies provided by phylogeographic analysis of Y-chromosome lineages. Ann Hum Genet 2008; 72: 215â227.
Pamjav H, Zalan A, Beres J, Nagy M, Chang YM : Genetic structure of the paternal lineage of the Roma people. Am J Phys Anthropol 2011; 145: 21â29.
Gomez-Carballa A, Pardo-Seco J, Fachal L et al: Indian signatures in the Westernmost edge of the European Romani diaspora: new insight from mitogenomes. PLoS One 2013; 8: e75397.
Malyarchuk BA, Grzybowski T, Derenko MV, Czarny J, Miscicka-Sliwka D : Mitochondrial DNA diversity in the Polish Roma. Ann Hum Genet 2006; 70: 195â206.
Malyarchuk BA, Perkova MA, Derenko MV, Vanecek T, Lazur J, Gomolcak P : Mitochondrial DNA variability in Slovaks, with application to the Roma origin. Ann Hum Genet 2008; 72: 228â240.
Chandrasekar A, Kumar S, Sreenath J et al: Updating phylogeny of mitochondrial DNA macrohaplogroup M in India: dispersal of modern human in South Asian corridor. PLoS One 2009; 4: e7447.
Sun C, Kong QP, Palanichamy MG et al: The dazzling array of basal branches in the mtDNA macrohaplogroup M from India as inferred from complete genomes. Mol Biol Evol 2006; 23: 683â690.
Nagy M, Henke L, Henke J et al: Searching for the origin of Romanies: Slovakian Romani, Jats of Haryana and Jat Sikhs Y-STR data in comparison with different Romani populations. Forensic Sci Int 2007; 169: 19â26.
Volgyi A, Zalan A, Szvetnik E, Pamjav H : Hungarian population data for 11 Y-STR and 49 Y-SNP markers. Forensic Sci Int Genet 2009; 3: E27âE28.
Zalan A, Beres J, Pamjav H : Paternal genetic history of the Vlax Roma. Forensic Sci Int Genet 2011; 5: 109â113.
Martinez-Cruz B, Ioana M, Calafell F et al: Y-chromosome analysis in individuals bearing the Basarab name of the first dynasty of Wallachian Kings. PLoS One 2012; 7: e41803.
Martinez-Cruz B, Ziegle J, Sanz P et al: Multiplex single-nucleotide polymorphism typing of the human Y chromosome using TaqMan probes. Invest Genet 2011; 2: 13.
Martinez-Cruz B, Harmant C, Platt DE et al: Evidence of pre-Roman tribal genetic structure in Basques from uniparentally inherited markers. Mol Biol Evol 2012; 29: 2211â2222.
Karafet TM, Mendez FL, Meilerman MB, Underhill PA, Zegura SL, Hammer MF : New binary polymorphisms reshape and increase resolution of the human Y chromosomal haplogroup tree. Genome Res 2008; 18: 830â838.
Haak W, Balanovsky O, Sanchez JJ et al: Ancient DNA from European early Neolithic farmers reveals their near Eastern affinities. PLoS Biol 2010; 8: e1000536.
Kloss-Brandstaetter A, Pacher D, Schoenherr S et al: HaploGrep: a fast and reliable algorithm for automatic classification of mitochondrial DNA haplogroups. Hum Mutat 2011; 32: 25â32.
van Oven M, Kayser M : Updated Comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum Mutat 2009; 30: E386âE394.
Excoffier L, Laval G, Schneider S : Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol Bioinformatics 2005; 1: 47â50.
Hurles ME, Nicholson J, Bosch E, Renfrew C, Sykes BC, Jobling MA : Y chromosomal evidence for the origins of Oceanic-speaking peoples. Genetics 2002; 160: 289â303.
Bandelt HJ, Forster P, Rohl A : Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 1999; 16: 37â48.
Drummond AJ, Rambaut A : BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol 2007; 7: 214.
Soares P, Ermini L, Thomson N et al: Correcting for purifying selection: an improved human mitochondrial molecular clock. Am J Hum Gene 2009; 84: 740â759.
Rambaut A, Suchard MA, Xie D, Drummond AJ : Tracer v1.6. http://beast.bio.ed.ac.uk/Tracer, 2014.
Sengupta S, Zhivotovsky LA, King R et al: Polarity and temporality of high-resolution Y-chromosome distributions in India identify both indigenous and exogenous expansions and reveal minor genetic influence of central Asian pastoralists. Am J Hum Gene 2006; 78: 202â221.
Dubut V, Murail P, Pech N, Thionville MD, Cartault F : Inter- and extra-Indian admixture and genetic diversity in Reunion Island revealed by analysis of mitochondrial DNA. Ann Hum Genet 2009; 73: 314â334.
Mendizabal I, Sandoval K, Berniell-Lee G et al: Genetic origin, admixture, and asymmetry in maternal and paternal human lineages in Cuba. BMC Evol Biol 2008; 8: 213.
Semino O, Magri C, Benuzzi G et al: Origin, diffusion, and differentiation of Y-chromosome haplogroups E and J: inferences on the neolithization of Europe and later migratory events in the Mediterranean area. Am J Hum Genet 2004; 74: 1023â1034.
Mallory JP, Adams DQ : The Oxford Introduction to Proto-Indo-European and the Proto-Indo-European World. Oxford University Press: Oxford, United Kingdom, 2006.
Regueiro M, Rivera L, Chennakrishnaiah S et al: Ancestral modal Y-STR haplotype shared among Romani and South Indian populations. Gene 2012; 504: 296â302.
Acknowledgements
We thank all the DNA donors who made this study possible. We are really grateful to the Banco Nacional de ADN (www.bancoadn.org) for providing part of the Spanish samples, and to Judit Béres for providing some Hungarian (Vlax and Romungro) samples, and Melinda Nagy for Slovakian Roma samples. We also thank Paula Sanz, Maria López-Valenzuela, Mònica Vallés, and the Genomic Core Facility at the UPF for their valuable technical help and advice. This study was partly supported by the Spanish Ministerio de EconomÃa y Competitividad Grant CGL2013-44351-P.
Author information
Authors and Affiliations
Consortia
Corresponding author
Ethics declarations
Competing interests
The authors declare no conflict of interest.
Additional information
The Genographic Consortium includes
Li Jin, Hui Li, and Shilin Li (Fudan University, Shanghai, China); Pandikumar Swamikrishnan (IBM, Somers, NY, USA); Asif Javed, Laxmi Parida, and Ajay K Royyuru (IBM, Yorktown Heights, NY, USA); R John Mitchell (La Trobe University, Melbourne, VIC, Australia); Pierre A Zalloua (Lebanese American University, Chouran, Beirut, Lebanon); Syama Adhikarla, Arun Kumar, Ganesh Prasad, Ramasamy Pitchappan, Arun Varatharajan Santhakumari, and Kavitha Valampuri (Madurai Kamaraj University, Madurai, Tamil Nadu, India); R Spencer Wells and Miguel G Vilar (National Geographic Society, Washington, DC, USA); Himla Soodyall (National Health Laboratory Service, Johannesburg, South Africa); Elena Balanovska and Oleg Balanovsky (Research Centre for Medical Genetics, Russian Academy of Medical Sciences, Moscow, Russia); Chris Tyler-Smith (The Wellcome Trust Sanger Institute, Hinxton, UK); FabrÃcio R Santos (Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil); Jaume Bertranpetit, Marc Haber, Marta Melé, and David Comas (Universitat Pompeu Fabra, Barcelona, Spain); Christina J Adler, Alan Cooper, Clio SI Der Sarkissian, and Wolfgang Haak (University of Adelaide, SA, Australia); Matthew E Kaplan and Nirav C Merchant (University of Arizona, Tucson, AZ, USA); Colin Renfrew (University of Cambridge, Cambridge, UK); Andrew C Clarke and Elizabeth A Matisoo-Smith (University of Otago, Dunedin, New Zealand); Jill B Gaieski and Theodore G Schurr (University of Pennsylvania, Philadelphia, PA, USA).
Supplementary Information accompanies this paper on European Journal of Human Genetics website
The original online version of this article was revised.
Access to the supplementary Tables in this article has been removed because of concerns that the data presented could be potentially identifiable.
Rights and permissions
About this article
Cite this article
MartÃnez-Cruz, B., Mendizabal, I., Harmant, C. et al. Origins, admixture and founder lineages in European Roma. Eur J Hum Genet 24, 937â943 (2016). https://doi.org/10.1038/ejhg.2015.201
Received:
Revised:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1038/ejhg.2015.201
This article is cited by
-
Europeâs Roma people are vulnerable to poor practice in genetics
Nature (2021)
-
Phylogeographic review of Y chromosome haplogroups in Europe
International Journal of Legal Medicine (2021)
-
Sex-biased patterns shaped the genetic history of Roma
Scientific Reports (2020)
-
âDo my Roma and non-Roma patients need different care?â A brief step-by-step guideline for clinical practitioners
International Journal of Public Health (2019)
-
Tau haplotypes support the Asian ancestry of the Roma population settled in the Basque Country
Heredity (2018)