Depending on the age of the language family under consideration, its homeland may be known with near-certainty (in the case of historical or near-historical migrations) or it may be very uncertain (in the case of deep prehistory). Next to internal linguistic evidence, the reconstruction of a prehistoric homeland makes use of a variety of disciplines, including archaeology and archaeogenetics.
Methods
There are several methods to determine the homeland of a given language family. One method is based on the vocabulary that can be reconstructed for the proto-language. This vocabulary – especially terms for flora and fauna – can provide clues for the geographical and ecological environment in which the proto-language was spoken. An estimate for the time-depth of the proto-language is necessary in order to account for prehistorical changes in climate and the distribution of flora and fauna.[1][2]
Another method is based on the linguistic migration theory (first proposed by Edward Sapir), which states that the most likely candidate for the last homeland of a language family can be located in the area of its highest linguistic diversity.[3] This presupposes an established view about the internal subgrouping of the language family. Different assumptions about high-order subgrouping can thus lead to very divergent proposals for a linguistic homeland (e.g. Isidore Dyen's proposal for New Guinea as the center of dispersal of the Austronesian languages).[4] The linguistic migration theory has its limits because it only works when linguistic diversity evolves continuously without major disruptions. Its results can be distorted e.g. when this diversity is wiped out by more recent migrations.[5]
The concept of a (single, identifiable) "homeland" of a given language family implies a purely genealogical view of the development of languages. This assumption is often reasonable and useful, but it is by no means a logical necessity, as languages are well known to be susceptible to areal change such as substrate or superstrate influence.
Over a sufficient period of time, in the absence of evidence of intermediary steps in the process, it may be impossible to observe linkages between languages that have a shared Urheimat: given enough time, natural language change will obliterate any meaningful linguistic evidence of a common genetic source.
This general concern is a manifestation of the larger issue of "time depth" in historical linguistics.[6]
For example, the languages of the New World are believed to be descended from a relatively "rapid" peopling of the Americas (relative to the duration of the Upper Paleolithic) within a few millennia (roughly between 20,000 and 15,000 years ago),[7] but their genetic relationship has become completely obscured over the more than ten millennia which have passed between their separation and their first written record in the early modern period. Similarly, the Australian Aboriginal languages are divided into some 28 families and isolates for which no genetic relationship can be shown.[8]
The Urheimaten reconstructed using the methods of comparative linguistics typically estimate separation times dating to the Neolithic or later. It is undisputed that fully developed languages were present throughout the Upper Paleolithic, and possibly into the deep Middle Paleolithic (see origin of language, behavioral modernity). These languages would have spread with the early human migrations of the first "peopling of the world", but they are no longer amenable to linguistic reconstruction. The Last Glacial Maximum (LGM) has imposed linguistic separation lasting several millennia on many Upper Paleolithic populations in Eurasia, as they were forced to retreat into "refugia" before the advancing ice sheets. After the end of the LGM, Mesolithic populations of the Holocene again became more mobile, and most of the prehistoric spread of the world's major linguistic families seem to reflect the expansion of population cores during the Mesolithic followed by the Neolithic Revolution.
The Nostratic theory is the best-known attempt to expand the deep prehistory of the main language families of Eurasia (excepting Sino-Tibetan and the languages of Southeast Asia) to the beginning of the Holocene. First proposed in the early 20th century, the Nostratic theory still receives serious consideration, but it is by no means generally accepted. The more recent and more speculative "Borean" hypothesis attempts to unite Nostratic with Dené–Caucasian and Austric, in a "mega-phylum" that would unite most languages of Eurasia, with a time depth going back to the Last Glacial Maximum.
The argument surrounding the "Proto-Human language", finally, is almost completely detached from linguistic reconstruction, instead surrounding questions of phonology and the origin of speech. Time depths involved in the deep prehistory of all the world's extant languages are of the order of at least 100,000 years.[9]
Language contact and creolization
The concept of an Urheimat only applies to populations speaking a proto-language defined by the tree model. This is not always the case.
For example, in places where language families meet, the relationship between a group that speaks a language and the Urheimat for that language is complicated by "processes of migration, language shift and group absorption are documented by linguists and ethnographers" in groups that are themselves "transient and plastic." Thus, in the contact area in western Ethiopia between languages belonging to the Nilo-Saharan and Afroasiatic families, the Nilo-Saharan-speaking Nyangatom and the Afroasiatic-speaking Daasanach have been observed to be closely related to each other but genetically distinct from neighboring Afroasiatic-speaking populations. This is a reflection of the fact that the Daasanach, like the Nyangatom, originally spoke a Nilo-Saharan language, with the ancestral Daasanach later adopting an Afroasiatic language around the 19th century.[10]
Creole languages are hybrids of languages that are sometimes unrelated. Similarities arise from the creole formation process, rather than from genetic descent.[11] For example, a creole language may lack significant inflectional morphology, lack tone on monosyllabic words, or lack semantically opaque word formation, even if these features are found in all of the parent languages of the languages from which the creole was formed.[12]
Isolates
Some languages are language isolates. That is to say, they have no well accepted language family connection, no nodes in a family tree, and therefore no known Urheimat. An example is the Basque language of Northern Spain and southwest France. Nevertheless, it is a scientific fact that all languages evolve. An unknown Urheimat may still be hypothesized, such as that for a Proto-Basque, and may be supported by archaeological and historical evidence.
Sometimes relatives are found for a language originally believed to be an isolate. An example is the Etruscan language, which, even though only partially understood, is believed to be related to the Rhaetic language and to the Lemnian language. A single family may be an isolate. In the case of the non-Austronesian indigenous languages of Papua New Guinea and the indigenous languages of Australia, there is no published linguistic hypothesis supported by any evidence that these languages have links to any other families. Nevertheless, an unknown Urheimat is implied. The entire Indo-European family itself is a language isolate: no further connections are known. This lack of information does not prevent some professional linguists from formulating additional hypothetical nodes (Nostratic) and additional homelands for the speakers.
Although Dravidian languages are now concentrated in southern India, isolated pockets further north, placenames and substrate influences on Indo-Aryan languages indicate that they were once spoken more widely across the Indian subcontinent.[18] Reconstructed Proto-Dravidian terms for flora and fauna support the idea that Dravidian is indigenous to India.[19] Proponents of a migration from the northwest cite the location of Brahui, a hypothesized connection to the undeciphered Indus script, and claims of a link to Elamite.[20]
Turkic
Turkic languages are today spoken across an area stretching from northwest China to the edge of Europe, but Proto-Turkic lexical items about the climate, topography, flora, fauna and subsistence point to a homeland in the taiga-steppe zone of southern Siberia and Mongolia around the Altai-Sayan region.[21] Early contact with Mongolic languages also points to this area.[22] Genetic studies suggest that most of the expansion of the language family was due to language replacement rather than migration, but have identified shared elements originating from the South Siberia-Mongolia area.[23]
Uralic
Inherited tree names seem to indicate a Uralic homeland to the east of the Ural Mountains. The internal branching of the family suggests an area between the Ob River and Yenisey River.[24] Uralic speakers are not genetically distinguished from their neighbours, but do share a genetic component that is of Siberian origin.[25][26]
All modern Koreanic varieties are descended from the language of Unified Silla, which ruled the southern two-thirds of the Korean peninsula between the 7th and 10th centuries.[30][31] Evidence for the earlier linguistic history of the peninsula is extremely sparse.[32] The orthodox view among Korean social historians is that the Korean people migrated to the peninsula from the north, but no archaeological evidence of such a migration has been found.[33][34]
Sino-Tibetan
The reconstruction of Sino-Tibetan is much less developed than for other major families, so its higher-level structure and time depth remain unclear.[35] Proposed homelands and periods include: the upper and middle reaches of the Yellow River about 4–8 kya, associated with the hypothesis of a top-level branching between Chinese and the rest (most likely); southwestern Sichuan around 9 kya, associated with the hypothesis that Chinese and Tibetan form a subbranch; Northeast India (the area of maximal diversity) 9–10 kya.[36]
Austroasiatic is widely held to be the oldest family in mainland Southeast Asia, with its current discontinuous distribution resulting from the later arrival of other families. The various branches share a great deal of vocabulary concerning rice cultivation, but few related to metals.[39] Identification of the homeland of the family has been hampered by the lack of progress on its branching. The main proposals are Northern India (favoured by those who assume an early branching of Munda), Southeast Asia (the area of maximal diversity; most likely) and southern China (based on claimed loanwords in Chinese).[40]
Austronesian
The homeland of the Austronesian languages is widely accepted by linguists to be Taiwan, since nine of its ten branches are found there, with all Austronesian languages found outside Taiwan belonging to the remaining Malayo-Polynesian branch.[41]
Some authorities on the history of the Uto-Aztecan language family place the Proto-Uto-Aztecan homeland in the border region between the USA and Mexico, namely the upland regions of Arizona and New Mexico and the adjacent areas of the Mexican states of Sonora and Chihuahua, roughly corresponding to the Sonoran Desert. The proto-language would have been spoken by foragers, about 5,000 years ago. Hill (2001) proposes instead a homeland further south, making the assumed speakers of Proto-Uto-Aztecan maize cultivators in Mesoamerica, who were gradually pushed north, bringing maize cultivation with them, during the period of roughly 4,500 to 3,000 years ago, the geographic diffusion of speakers corresponding to the breakup of linguistic unity.[49]
South America
Tupian
Proto-Tupian, the reconstructed common ancestor of the Tupian languages of South America, was probably spoken in the region between the Guaporé and Aripuanã rivers, around 5,000 years ago.[50]
Africa and Middle East
Afroasiatic
As Semitic is the only branch of Afroasiatic found outside Africa, northeast Africa is considered the most likely location of the Afroasiatic homeland. An alternative theory, based on lexical comparisons with Indo-European, proposes a Neolithic expansion from the Middle East.[51][52][53] Proto-Afroasiatic is estimated to have begun to break up in the 8th millennium BCE.[51]Proto-Semitic is thought to have been spoken in the Near East between 4400 and 7400 BCE, with Akkadian representing its earliest known branch.[54]
Niger–Congo
Although the membership and subgrouping of the Niger–Congo remains unsettled, the widely-accepted core of the group includes over 1000 languages spoken from West Africa through most of Southern Africa.[55] The homeland is thought to have been somewhere in the savanna belt of West Africa, with the Bantu expansion through the equatorial rainforests of Central Africa beginning around 3000 BCE.[56]
Mande
Valentin Vydrin concluded that "the Mande homeland at the second half of the 4th millennium BC was located in Southern Sahara, somewhere to the North of 16° or even 18° of Northern latitude and between 3° and 12° of Western longitude."[57] That is now Mauritania and/or southern Western Sahara.[58]
Nilo-Saharan
The validity of the Nilo-Saharan family remains controversial. Proponents of the family view the border area between Chad, Sudan, and the Central African Republic as a likely candidate for its homeland from around the start of the Holocene.[59]
^Bowern, Claire; Atkinson, Quentin (2012). "Computational Phylogenetics and the Internal Structure of Pama-Nyungan". Language. 84 (4): 817–845. Kayser, Manfred (2010), "The Human Genetic History of Oceania: Near and Remote Views of Dispersal", Current Biology, 20 (4): R194–201, Bibcode:2010CBio...20.R194K, doi:10.1016/j.cub.2009.12.004, PMID20178767, S2CID7282462
^Bengtson and Ruhlen (1994) offered a list of 27 "global etymologies". Bengtson, John D. and Merritt Ruhlen. 1994. "Global etymologies"Archived 2007-09-28 at the Wayback Machine. In Ruhlen 1994a, pp. 277–336. This approach has been criticized as flawed by Campbell and Poser (2008) who used the same criteria employed by Bengtson and Ruhlen to identify "cognates" in Spanish known to be false. Campbell, Lyle, and William J. Poser. 2008. Language Classification: History and Method. Cambridge: Cambridge University Press, 370–372.
^McWhorter, John H. (December 1998). "Identifying the Creole Prototype: Vindicating a Typological Class". Language. 74 (4): 788–818. doi:10.2307/417003. JSTOR417003.
^McWhorter, John H. (1999), "The Afrogenesis Hypothesis of Plantation Creole Origin", in Huber, Magnus; Parkvall, Mikael (eds.), Spreading the Word: The Issue of Diffusion among the Atlantic Creoles, London: Westminster University Press, pp. 111–152
^Golden, Peter B. (2011). "Ethnogenesis in the tribal zone: The Shaping of the Turks". Studies on the Peoples and Cultures of the Eurasian Steppes. Editura Academiei Române. pp. 17–63. ISBN978-973-27-2152-0. pp. 35–37.
^Janhunen, Juha (2003). "Ethnicity and language in prehistoric Northeast Asia". In Blench, Roger; Spriggs, Matthew (eds.). Archaeology and Language II: Archaeological Data and Linguistic Hypotheses. Routledge. pp. 195–208. ISBN0-415-11761-5. p. 203.
^Saag, Lehti; Laneman, Margot; Varul, Liivi; Malve, Martin; Valk, Heiki; Razzak, Maria A.; Shirobokov, Ivan G.; Khartanovich, Valeri I.; Mikhaylova, Elena R.; Kushniarevich, Alena; Scheib, Christiana Lyn; Solnik, Anu; Reisberg, Tuuli; Parik, Jüri; Saag, Lauri; Metspalu, Ene; Rootsi, Siiri; Montinaro, Francesco; Remm, Maido; Mägi, Reedik; D'Atanasio, Eugenia; Crema, Enrico Ryunosuke; Díez-del-Molino, David; Thomas, Mark G.; Kriiska, Aivar; Kivisild, Toomas; Villems, Richard; Lang, Valter; Metspalu, Mait; Tambets, Kristiina (May 2019). "The Arrival of Siberian Ancestry Connecting the Eastern Baltic to Uralic Speakers further East". Current Biology. 29 (10): 1701–1711.e16. Bibcode:2019CBio...29E1701S. doi:10.1016/j.cub.2019.04.026. PMC6544527. PMID31080083.
^Serafim, Leon A. (2008). "The uses of Ryukyuan in understanding Japanese language history". In Frellesvig, Bjarke; Whitman, John (eds.). Proto-Japanese: Issues and Prospects. John Benjamins. pp. 79–99. ISBN978-90-272-4809-1. p. 98.
^Lee, Ki-Moon; Ramsey, S. Robert (2011). A History of the Korean Language. Cambridge University Press. p. 4. ISBN978-1-139-49448-9.
^Lee, Iksop; Ramsey, S. Robert (2000). The Korean Language. SUNY Press. p. 31. ISBN978-0-7914-4831-1.
^Yi, Seonbok (2014). "Korea: archaeology". In Bellwood, Peter (ed.). The Global Prehistory of Human Migration. Wiley. pp. 586–597. ISBN978-1-118-97059-1. pp. 586–587.
^Handel, Zev (May 2008). "What is Sino-Tibetan? Snapshot of a Field and a Language Family in Flux: Sino-Tibetan: a Snapshot". Language and Linguistics Compass. 2 (3): 422–441. doi:10.1111/j.1749-818X.2008.00061.x.
^"Kra-dai and Austronesian: Notes on phonological correspondences and vocabulary distribution WEERA OSTAPIRAT". The Peopling of East Asia. 2005. pp. 135–159. doi:10.4324/9780203343685-20. ISBN9780203343685.
^Sidwell, Paul (2015). "Austroasiatic Classification". In Jenny, Mathias; Sidwell, Paul (eds.). The Handbook of the Austroasiatic Languages. Leiden: BRILL. pp. 144–220. ISBN978-90-04-28295-7. p. 146.
^Rau, Felix; Sidwell, Paul (12 September 2019). "The Munda Maritime Hypothesis". Journal of the Southeast Asian Linguistics Society. 12 (2): 35–57. hdl:10524/52454. GaleA651276154.
^Potter, Ben A. (2010). "Archaeological Patterning in Northeast Asia and Northwest North America: An Examination of the Dene-Yeniseian Hypothesis". Anthropological Papers of the University of Alaska. 5 (1–2): 138–167.
^Yanovich, Igor (16 September 2020). "Phylogenetic linguistic evidence and the Dene-Yeniseian homeland". Diachronica. 37 (3): 410–446. doi:10.1075/dia.17038.yan. S2CID209542004.
^Bakker, Peter (2013). "Diachrony and typology in the history of Cree". In Folke Josephson; Ingmar Söhrman (eds.). Diachronic and typological perspectives on verbs. Amsterdam: John Benjamins. pp. 223–260.
^Golla, Victor (2011). California Indian Languages. Berkeley: University of California Press. p. 256.
^Hill, Jane H. (2001). "Proto-Uto-Aztecan: A Community of Cultivators in Central Mexico?". American Anthropologist. 103 (4): 913–934. doi:10.1525/aa.2001.103.4.913. JSTOR684121.
^Rodrigues, Aryon Dall'Igna; Cabral, Ana Suelly Arruda Câmara (2012). "Tupían". In Campbell, Lyle; Grondona, Verónica (eds.). The Indigenous Languages of South America: A Comprehensive Guide. Walter de Gruyter. pp. 495–574. ISBN978-3-11-025803-5.
^ abPorkhomovsky, Victor (2020). "Afro-Asiatic Overview". In Vossen, Rainer; Dimmendaal, Gerrit J. (eds.). The Oxford Handbook of African Languages. Oxford University Press. pp. 269–274. ISBN978-0-19-960989-5. p. 273.
^Güldemann, Tom (2018). "Historical linguistics and genealogical language classification in Africa". In Güldemann, Tom (ed.). The Languages and Linguistics of Africa. De Gruyter Mouton. pp. 58–444. ISBN978-3-11-042606-9. p. 311.
^Blažek, Václav (2013). "Levant and North Africa: Afroasiatic linguistic history". In Ness, Immanuel (ed.). The Encyclopedia of Global Human Migration. Wiley. pp. 125–132. doi:10.1002/9781444351071.wbeghm815. ISBN978-1-4443-3489-0. p. 126.
Anthony, David W. (2007), The Horse, the Wheel and Language: How Bronze-Age Riders from the Eurasian Steppes Shaped the Modern World, Princeton University Press
Anthony, David; Ringe, Don (2015), "The Indo-European Homeland from Linguistic and Archaeological Perspectives", Annual Review of Linguistics, 1: 199–219, doi:10.1146/annurev-linguist-030514-124812
Bellwood, Peter (2007) [1997]. Prehistory of the Indo-Malaysian Archipelago: Revised Edition. ANU E Press. ISBN978-1-921313-12-7.
Bellwood, Peter S. (2000), Presejarah Kepulauan Indo-Malaysia (Translation of Bellwood 2007)
Blust, Robert (1984). "The Austronesian Homeland: A Linguistic Perspective". Asian Perspectives. 26 (1): 45–67. JSTOR42928105.
Campbell, Lyle (1997), American Indian Languages: The Historical Linguistics of Native America, Oxford University Press, ISBN978-0-19-509427-5.
——— (2013), Historical Linguistics: An Introduction (3rd ed.), Edinburgh University Press, ISBN978-0-7486-4601-2.
Mallory, J.P. (1989), In Search of the Indo-Europeans: Language, Archaeology, and Myth, London: Thames & Hudson.
Mallory, James P. (1997), "The homelands of the Indo-Europeans", in Blench, Roger; Spriggs, Matthew (eds.), Archaeology and Language, vol. I: Theoretical and Methodological Orientations, London: Routledge, ISBN978-0-415-11760-9.
Mallory, J.P.; Adams, D.Q. (2006), The Oxford introduction to Proto-Indo-European and the Proto-Indo-European world (Repr. ed.), Oxford [u.a.]: Oxford Univ. Press, ISBN978-0-19-928791-8
Pereltsvaig, Asya; Lewis, Martin W. (2015), "Searching for Indo-European origins", The Indo-European Controversy, Cambridge University Press, ISBN978-1-107-05453-0
Sohn, Ho-Min (1999), The Korean Language, Cambridge: Cambridge University Press, ISBN978-0-521-36123-1.