Telugu DNA Ancestry: The Genetics of Andhra Pradesh and Telangana
Telugu speakers number over 80 million across Andhra Pradesh and Telangana, making them one of the largest ethnolinguistic groups in the world. The Telugu diaspora is substantial — among the largest Indian communities in the United States, the Gulf, and Southeast Asia. Yet Telugu genetics has received significantly less research attention than North Indian populations, partly because North Indian diasporas were the first large South Asian communities to engage with consumer DNA testing in the West, and partly because academic genomics has historically over-sampled from the same populations.
This is changing. Several large-scale South Asian genomic studies now include Telugu populations at meaningful sample sizes, and the picture that is emerging is both more nuanced and more interesting than "Dravidian" as a single category would suggest. Here is what the evidence actually shows.
The Telugu Genetic Profile: ANI/ASI Balance
Modern South Asian populations are best understood as mixtures of several ancestral components: Ancestral North Indian (ANI), which relates to Iranian Neolithic farmers and later Bronze Age steppe pastoralists; and Ancient Ancestral South Indian (AASI), the oldest component in the subcontinent, representing the deep roots of populations who were present before agriculture arrived from the west. A third component — steppe ancestry associated with Indo-Aryan migrations — overlaps significantly with ANI but can be distinguished in fine-grained analyses.
Telugu populations sit in the middle of the Indian genetic spectrum. They carry more AASI ancestry than most North Indian groups but less than some Dravidian populations further south, particularly certain Tamil non-Brahmin communities and tribal populations of the Western Ghats. The ANI/AASI balance in Telugu non-Brahmin populations is broadly comparable to that of Karnataka non-Brahmin communities.
Crucially, the average figures hide significant community-level variation. Telugu genetics is not monolithic. The difference between a Telugu Brahmin and a Telugu Dalit community in terms of AASI proportion can be as large as the difference between some North and South Indian populations.
Community-Level Genetic Profiles
Telugu Brahmins
Telugu Brahmins — including Niyogi and Vaidiki subgroups — carry substantially higher ANI ancestry and higher steppe-related ancestry than non-Brahmin Telugu populations. This is consistent with the pattern observed across all South Indian Brahmin communities: Telugu, Tamil, Kannada, and Malayali Brahmins all cluster together genetically and sit closer to North Indian populations than to non-Brahmin populations of their own linguistic community.
This genetic distinctiveness reflects the historical origin of Brahmin communities in the Deccan, which involved migration of communities with high steppe ancestry from the north — a process that occurred over centuries and resulted in the maintenance of high ANI/steppe proportions through endogamous marriage practices that have persisted for millennia.
Kamma
Kamma is one of the most studied non-Brahmin Telugu communities in population genomics. Kamma populations show relatively high AASI proportions compared to Brahmins, with moderate ANI, reflecting a genetic profile more typical of South Indian cultivator communities. Historically associated with the coastal Krishna-Guntur belt, Kamma genetic diversity reflects strong endogamy within a community that has maintained geographic continuity for centuries.
Reddy
Reddys are genetically closely related to Kamma populations and cluster together in most analyses. Both groups share a broadly similar ANI/AASI balance, reflecting their shared non-Brahmin South Indian agricultural background. Regional variation within the Reddy community is worth noting: coastal Andhra Reddys show slightly different profiles from Rayalaseema Reddys and Telangana Reddys, with the inland and highland populations sometimes showing subtly higher AASI proportions.
Kapu
Kapu is a large agricultural community with significant internal diversity — the term encompasses multiple subgroups across different regions of Andhra Pradesh and Telangana. The genetic profile is broadly consistent with other non-Brahmin Telugu agrarian communities, though with more regional variation than tightly endogamous communities like Kamma or Velama. Coastal Kapu communities show some signal of historical maritime contact — a subtle but detectable trace of Southeast Asian or island Southeast Asian ancestry that appears in some coastal Andhra populations generally.
Velama
Velama is historically a warrior and landowning community in Telangana and northern Andhra. Genetically, Velama populations show some variation depending on lineage: some Velama subgroups show signals of northern admixture consistent with historical military service under Deccan Sultanates and later the Hyderabad Nizam's court. The community has practised strong endogamy, resulting in distinct genetic substructure between different Velama groups that reflects their varying historical origins.
BC/OBC Communities
The Backward Classes and Other Backward Classes of Andhra Pradesh and Telangana encompass enormous diversity — artisan communities, pastoralist communities, fishing communities, and others. Some coastal fishing communities (particularly in the Krishna and Godavari delta regions) show higher proportions of a Southeast Asian-related ancestry component compared to inland communities, consistent with ancient maritime interaction across the Bay of Bengal.
Telugu genetics is not monolithic. The genetic distance between a Telugu Brahmin and a Dalit community from the same district can be comparable to the distance between some North and South Indian populations. Community-level ancestry breakdown — not a single "Telugu" category — is the level at which population genetics becomes genuinely informative for people from this region.
Y-DNA Haplogroups Common in Telugu Populations
Y-chromosome haplogroups are inherited through the direct paternal line and provide a window into ancient male migration history. The following haplogroups are most relevant for Telugu ancestry:
Haplogroup H
Haplogroup H is the oldest and deepest-rooted South Asian Y-DNA lineage. It is ancient — likely among the original Y-chromosome lineages of the hunter-gatherers who populated the subcontinent before agriculture. Haplogroup H is particularly common in South Indian non-Brahmin communities and in Scheduled Caste and Scheduled Tribe populations across Andhra Pradesh and Telangana. Carrying haplogroup H is a direct paternal connection to the most ancient wave of human settlement in South Asia.
Haplogroup R2
R2 (technically R1b1c or R-M479) is a haplogroup particularly associated with the Indian subcontinent — it originated in South Asia and is found at meaningful frequencies across Telugu communities. Unlike its close relative R1a (which is strongly associated with steppe ancestry), R2 appears to have expanded within India and reflects an older South Asian presence. It is present across various Telugu communities with some variation by caste.
Haplogroup J2
J2 is associated with Near Eastern Neolithic agricultural ancestry — the farming communities who expanded from Iran and the Fertile Crescent into South Asia in the Neolithic period. It is elevated in Telugu Brahmin populations compared to non-Brahmin Telugu groups, consistent with the higher Iranian Neolithic and ANI ancestry found in Brahmin populations generally. J2 is also present in some non-Brahmin Telugu communities, where it likely reflects older ANI-related ancestry.
Haplogroup L
Haplogroup L is another South Asian–specific paternal lineage with ancient roots in the subcontinent. It is found across Telugu communities and is particularly associated with certain artisan and trading communities. Haplogroup L has deep origins in the South Asian Neolithic and likely spread with agricultural expansion within the subcontinent.
Haplogroup F* and Basal Lineages
Tribal and Adivasi communities in the Godavari and Nallamalla forest regions of Andhra Pradesh often carry basal F* haplogroups and other ancient Y-DNA lineages that represent some of the oldest surviving male lineages in South Asia. These are among the most ancient human Y-chromosome lineages in the world, carried by communities who have maintained continuity with pre-Neolithic populations.
Maternal (mtDNA) Lineages
Mitochondrial DNA (mtDNA) traces the direct maternal line. The pattern of maternal lineages in Telugu populations reflects the deep indigenous roots of South Asian maternal ancestry:
- M haplogroup supergroup: The M macro-haplogroup and its South Asian subclades (M2, M3, M5, M6, and others) dominate maternal lineages across Telugu communities. These are ancient South Asian–specific lineages that expanded within the subcontinent after the initial modern human settlement, likely over 50,000 years ago. Carrying an M subclade lineage is a direct maternal connection to the earliest anatomically modern humans in South Asia.
- U2: U2, particularly the South Asian–specific U2a, U2b, and U2c subclades, represents another deep ancient South Asian maternal lineage that predates the Neolithic period. It is present across Telugu communities and is particularly informative as a marker of pre-farming South Asian ancestry.
- R haplogroup: Various R subclades (R5, R6, R7, R8, and others — the South Asian-specific ones, not to be confused with R in a European context) are present across Telugu communities and represent further layers of deep South Asian maternal diversity.
- West Eurasian lineages (H, J, T, K): Present at lower frequency in Telugu Brahmin communities and some upper-caste groups, reflecting the maternal contribution of ANI and steppe-associated populations. These are notably rarer in non-Brahmin Telugu communities compared to their frequency in North Indian populations.
Ancient Connections: Deccan History in the Genome
The ancient DNA record from the Deccan is sparse but growing. What has been recovered from Iron Age and early historical sites in the Telugu-speaking region is broadly consistent with population continuity — the genetic profile of the people who built and maintained the Megalithic culture of the Deccan shows strong similarity to present-day South Indian non-Brahmin populations. There is no evidence of major population replacement in the Telugu-speaking region during the historical period.
The Satavahana Empire (2nd century BCE to 3rd century CE) was the dominant political formation in Telugu-speaking lands during the early historical period. Satavahana-era sites in the Krishna-Guntur region have been excavated extensively, and while full ancient DNA analysis from these sites remains limited, what exists suggests the same basic South Indian genetic profile that characterises the region today. The Kakatiya Kingdom (1000–1323 CE) and the Vijayanagara Empire (1336–1646 CE), which were both centred partly in Telugu-speaking territories, represent the high medieval period — a time of architectural and cultural efflorescence that is now visible in the genetics primarily through endogamy-reinforced community structures rather than wholesale population change.
The ancient DNA evidence from the Deccan region is consistent with population continuity. Present-day Telugu communities are genetically connected to the populations who occupied this landscape through the Iron Age and Megalithic periods. The deep AASI ancestry that characterises Telugu non-Brahmin populations is a genomic signature of this long-term continuity.
The Telugu Diaspora and the Reference Panel Problem
The Telugu diaspora in the United States is large — Telugu speakers are estimated to be among the top two or three Indian linguistic communities in the US, with particularly high concentrations in the technology sector in California, Texas, and New Jersey. The Gulf diaspora is also substantial, particularly in the UAE, Saudi Arabia, Qatar, and Kuwait.
Despite this, Telugu people have consistently been underserved by mainstream consumer DNA testing companies. 23andMe, AncestryDNA, and MyHeritage all built their South Asian reference panels primarily from communities that were disproportionately represented in their early customer bases — mainly Punjabi, Gujarati, and other North Indian communities from the UK and US diaspora. This means that the algorithms these companies use to assign ancestry categories have far more reference data for North Indian populations than for Telugu, Kannada, or other South Indian groups.
The result: a large proportion of Telugu people who test with 23andMe receive a result of "Broadly South Asian" rather than a meaningful regional or community-level assignment. This is not a reflection of their actual ancestry — it is a failure of reference panel design. The algorithm cannot make a confident assignment because it does not have enough reference individuals from Telugu populations to calibrate against.
Helixline addresses this directly. The reference panel used for ancestry assignment is built specifically to distinguish South Indian communities at a finer level, incorporating reference populations drawn from Telugu communities across Andhra Pradesh and Telangana. The result is a community-level ancestry breakdown that actually reflects what Telugu ancestry looks like — including the distinction between Telugu Brahmin and non-Brahmin genetic profiles, and the regional variation within the non-Brahmin Telugu community.
What a Helixline Result Looks Like for a Telugu Person
A Helixline Origins result for someone of Telugu background provides:
- Community-level ancestry breakdown: Rather than a single "South Asian" bar, you see proportions assigned to specific reference populations — Telugu Brahmin, Telugu non-Brahmin, Tamil, Kannada, and others — reflecting the actual genetic structure of South Indian populations. The proportions are shown alongside the AASI/ANI/steppe component breakdown.
- Y-DNA haplogroup (paternal lineage): Full haplogroup identification including subclade — not just "H" but the specific subclade and its deep geographic and time depth context.
- mtDNA haplogroup (maternal lineage): Full maternal haplogroup with subclade identification and historical context for the lineage.
- Comparison to reference populations: Where you sit relative to other Telugu communities, other South Indian communities, and the broader South Asian spectrum — the genetic context that makes your individual result meaningful.
Discover your Telugu community ancestry at subclade level.
Community-level South Asian reference panel. Haplogroup identification.
Origins from ₹6,999 — cheek swab at home, results in 6–8 weeks.
Frequently Asked Questions
Are Kamma and Reddy genetically similar or distinct?
Kamma and Reddy communities are genetically closely related — they cluster together in population genetic analyses and share broadly similar ANI/AASI proportions reflecting their shared non-Brahmin Telugu agricultural background. That said, there is measurable subgroup variation within each community based on region: coastal Andhra Reddys differ somewhat from Rayalaseema Reddys, and northern Kamma populations show some differences from those in the Krishna-Guntur districts. These are subtle shifts within a shared genetic neighbourhood rather than the sharp divergences you see between Telugu Brahmins and non-Brahmin Telugu communities.
How does Telugu ancestry compare to Tamil ancestry genetically?
Telugu non-Brahmin populations sit between Tamil non-Brahmin populations and North Indian populations on the ANI/AASI spectrum. Tamil populations — particularly non-Brahmin Tamil communities — tend to carry higher proportions of AASI ancestry, reflecting deeper continuity with ancient South Indian populations. Telugu non-Brahmin communities carry somewhat more ANI ancestry on average than their Tamil counterparts, though with significant community-level and regional variation. Both are distinctly Dravidian in genetic profile and both sit at the higher-AASI end of the South Asian spectrum compared to most North Indian populations.
Why do Telugu people get "Broadly South Asian" from 23andMe but not from Helixline?
23andMe's South Asian reference panel is heavily weighted toward communities that are disproportionately represented in their customer base — primarily North Indian, Punjabi, and Gujarati populations from the large US and UK diaspora. Telugu, Kannada, and other South Indian populations are underrepresented relative to their actual genetic diversity, causing the algorithm to default to the "Broadly South Asian" category when it lacks sufficient reference individuals to make a confident subcategory assignment. Helixline's reference panel is built specifically to distinguish South Indian communities at a finer level, using reference populations that accurately represent Telugu genetic diversity.
Can a DNA test distinguish between Andhra Pradesh and Telangana ancestry?
Not reliably as distinct political entities — Telangana was carved from Andhra Pradesh only in 2014, and the ancestral populations of both states are genetically continuous rather than sharply divided. What a Helixline result can do is identify your community-level ancestry with reference to specific Telugu communities (Kamma, Reddy, Kapu, Velama, Telugu Brahmin, etc.) and regional clustering within the Telugu genetic space. This is more meaningful than a state-level assignment anyway, since community identity rather than administrative geography is the relevant biological signal in South Asian genetics.
What haplogroup is most common among Telugu Brahmins?
Telugu Brahmins show a higher prevalence of haplogroup R1a than non-Brahmin Telugu communities, consistent with the pattern seen across South Indian Brahmin groups and reflecting the steppe ancestry associated with Brahmin migration into the Deccan. Haplogroup J2 is also present, associated with Near Eastern Neolithic agricultural ancestry. Haplogroup L and haplogroup R2 are present at lower frequencies. The overall haplogroup profile of Telugu Brahmins more closely resembles that of other South Indian Brahmin groups (Tamil Brahmin, Kannada Brahmin) than it resembles that of non-Brahmin Telugu populations.