Vol. XLI No. 26 June 25, 2017

Population Genetics Confirm Historians and Confound RSS Theories

Prabir Purkayastha

A RECENT paper on a genome wide study of the South Asian population (A genetic chronology for the Indian Subcontinent points to heavily sex-biased dispersals: BMC Evolutionary Biology, March 2017) strikingly confirms the work of historians, archaeologists and historical linguists. Data shows that there were indeed significant migrations from Eurasian steppes into South Asia, starting from 3,500 years ago. The South Asian population – current Indian, Pakistani, Bangladeshi, Nepalese, and Sri Lankan population – have a significant central Asian component in their genetic make-up. The Indo Aryan (Indic) group of speakers have a higher signature of Central Asian genes than the corresponding Dravidian speakers; and the signatures are significantly higher in the male population. Incidentally, this signature in South Asia, is the highest in Pakistan and Gujarat.

Why should there be even a dispute about the origin of the Indo Aryan group of languages? Or immigration from outside South Asia? After all, the much larger language group of which Indo Aryan is part, is Indo European. If a group of languages have close affinities, it is obvious that they originate from a common proto language. So why this passion in denying that Indo Aryan group of languages have an origin, which could conceivably be outside India?

Here is the problem for the Hindutva ideologues. For those who believe that land is the origin of the nation – punya bhumi (matri bhumi or pitri bhumi) than the Hindu nation, as defined by Savarkar, Golwalkar and others, simply have to originate in India. If it does not, characterising all other religions as non Indian would not be tenable. Hinduism or the Vedic religion, is then simply another foreign incursion into India, and as non Indian, as Islam or Christianity. At best an early mover, but by no means original. The whole intellectual edifice of Hindutva would be exposed for what it is – a house of cards. That is why the venom against historians, who “unfortunately” have based their research on actual data and not on myth.



We have reported (History, Genetics and Statistics, Peoples Democracy, November 18, 2012) some of the earlier genetic studies in these columns. They showed that there is an Ancestral North Indian (ANI) population that differs somewhat, particularly in the male population, from an Ancestral South Indian (ASI) population. ANI has a larger proportion of Central Asian genetic markers than the ASI population. They had also shown that “upper” castes or caste Hindus have a higher component of the ANI ancestry.

The new study has confirmed the broad conclusions of the earlier studies, but has fleshed this out in much more details. It has also looked, not only at the issue of Eurasian steppe migration into South Asia, but two other migrations. One is the “modern” human migration from Africa into South Asia. This now can be dated to be around 50,000 years ago, and followed a coastal route from Africa. The second is when did the neolithic farmers arrive in South Asia (Mehrgarh, Baluchistan) and from where. The evidence indicates that such a migration of neolithic farmers took place from the Fertile Crescent (modern day Syria and Iraq) through Iran, around 9,000 years back. This is also supported by the genetic makeup of the agricultural crops that entered South Asia then, and ancient DNA found in the burial sites.

Though the migration of people with Central Asian genes into South Asia started 3,500 years back, it did not occur at one go. There seems to have been a continuous influx of people carrying such genetic material over a millennia. The timing of the migration into South Asia and its extent, also fits into the larger picture of Eurasian migrations. Apart from the major neolithic expansion because of agriculture, the next big shift was domestication of horses and the use of chariots in the Eurasian steppes. Historians had already identified the area between the Black Sea and the Caspian Sea as the “home” of the proto Indo European language speakers based on archaeological and other data. The genetic data has confirmed their conclusions. While one arm of the migration from Eurasian steppes went towards Europe, the other was towards South Asia. 

Whether it is the near total domination of the Mongols of the Eurasian land mass 800 hundred years back, or the German blitzkrieg in the Second World War, mobility is much more important in war than in peace time. Therefore the argument that the expansion of the horse riding Indo European speakers into Europe and large parts of Asia, was much more of a military domination than a cultural one. While it is not possible to conclusively establish this through genetics, the marked difference between the male genetic makeup from the female one, (and the genetic composition of various caste groups in South Asia), would support the theory of military domination.


Each of us carry about 23 pairs of chromosomes and a total of 19,000-20,000 genes. This may not be very large. The problem is that each of these genes have a very large number of base pairs that constitutes their genetic code. The total number of such base pairs is about 3 billion, and this is what makes genome wide studies so difficult. Each of us have slight differences in the genetic code: they vary across locations and in the actual code itself. By studying the changes in the inherited code over different population samples, we can see genetic drifts in the larger population. We also know the rate at which genetic changes or mutations take place, and therefore when population bifurcations could have occurred.

What has changed the study of population genetics is the introduction of biochips or microarrays that allow for screening of a large number of DNA segments simultaneously. These tools can be used to check out the variations in the genetic makeup of different populations. Once we know the variations, it is possible to look at how far back they might have occurred, what are the direction of their flows, and therefore the possible migration paths.

The change that these biochips have brought in is the sheer quantity of data. We now have much larger databases to analyse than the earlier methods that looked for mutations at a small number of locations. Today, we simultaneously process a much larger number of such locations. The sheer amount of data is an order – or several orders – of higher magnitude.

The consequence of creating such a large amount of data, is that analysing such data sets manually or using even spreadsheets, is impossible. Various statistical tools have to be used to tease out the actual relations that are present in the data. This is the other major change that is taking place in population studies. These tools – from biochips to statistical programmes – require large, cross disciplinary teams to undertake research.


The above study conducted by a large group of researchers, most of them in public universities in Portugal, has been supported by grants from European Fund for Economic and Regional Development and from the Portuguese Foundation for Science and Technology. The lead author, Marina Silva, is a doctoral student in Huddersfield School of Applied Sciences.

Clearly, good quality research is possible, if there is a willingness to work on serious science and supported by public institutions. Contrast this with the approach of the current Head of the Indian Council of Historical Research (ICHR), YS Rao. He advocates that only Vedas, Ramayana and Mahabharata, should be used as evidence; all other evidence –  other texts, linguistics, archaeology and now genetics –  are to be disregarded. If, for example, carbon dating of artefacts shows that the Vedic Age is 3,500 years old, that evidence is to be dismissed. So also presumably genetic evidence of the kind showing significant incursion of Eurasian genes around 3,500 years ago.

The gutting of research bodies in India and its leading institutions – Indian Council of Historical Research and Jawaharlal Nehru University – is not simply an attack on its political and social ethos, but also an attack on serious research. Genetics, mathematics and history have to come together to make sense of our past. Unlike Non-science or nonsense of the YS Rao variety.