In historical linguistics , the tree model (also Stammbaum , genetic , or cladistic model ) is a model of the evolution of languages analogous to the concept of a family tree , particularly a phylogenetic tree in the biological evolution of species . As with species, each language is assumed to have evolved from a single parent or "mother" language, with languages that share a common ancestor belonging to the same language family .
92-674: Gawar-Bati or Narsati is an Indo-Aryan language spoken in Kunar Province of eastern Afghanistan and across the border in Pakistan. It is also known as Kohistani in Kunar. Gawar-Bati has an estimated 75,000 speakers, of whom 50,000 are in Kunar and 25,000 in Chitral, Pakistan. The Gawar-Bati language has not been given serious study by linguists, except that it is mentioned by George Morgenstierne (1926) and Kendall Decker (1992). It
184-425: A clade and the discovery of clades was cladistics . Greenberg began writing during a time when phylogenetic systematics lacked the tools available to it later: the computer (computational systematics) and DNA sequencing ( molecular systematics ). To discover a cladistic relationship researchers relied on as large a number of morphological similarities among species as could be defined and tabulated. Statistically
276-532: A lexicostatistical study of the New Indo-Aryan languages based on a 100-word Swadesh list , using techniques developed by the glottochronologist and comparative linguist Sergei Starostin . That grouping system is notable for Kogan's exclusion of Dardic from Indo-Aryan on the basis of his previous studies showing low lexical similarity to Indo-Aryan (43.5%) and negligible difference with similarity to Iranian (39.3%). He also calculated Sinhala–Dhivehi to be
368-484: A "natural arrangement" based on evolution. He says: "It may be worth while to illustrate this view of classification, by taking the case of languages. If we possessed a perfect pedigree of mankind, a genealogical arrangement of the races of man would afford the best classification of the various languages now spoken throughout the world; and if all extinct languages, and all intermediate and slowly changing dialects, had to be included, such an arrangement would, I think, be
460-425: A and b, which are closest only to each other, are assumed to have a common ancestor, a-b. The next closest language, c, is assumed to have a common ancestor with a-b, and so on. The result is a projected series of historical paths leading from the overall common ancestor (the root) to the languages (the leaves). Each path is unique. There are no links between paths. Every leaf and node have one and only one ancestor. All
552-532: A biological hierarchy under several phyla , or most general groups, branching ultimately to the various species. The basis for this biological classification was the observed shared physical features of the species. Darwin, however, reviving another ancient metaphor, the tree of life , hypothesized that the groups of the Linnaean classification (today's taxa ), descended in a tree structure over time from simplest to most complex. The Linnaean hierarchical tree
644-819: A branch of the Indo-Iranian languages in the Indo-European language family . As of the early 21st century, they have more than 800 million speakers, primarily concentrated east of the Indus river in Bangladesh , North India , Eastern Pakistan , Sri Lanka , Maldives and Nepal . Moreover, apart from the Indian subcontinent , large immigrant and expatriate Indo-Aryan–speaking communities live in Northwestern Europe , Western Asia , North America ,
736-430: A considerable number of such items ... we necessarily draw a conclusion of common historical origin. Such genetic classifications are not arbitrary ... the analogy here to biological classification is extremely close ... just as in biology we classify species in the same genus or high unit because the resemblances are such as to suggest a hypothesis of common descent, so with genetic hypotheses in language." In this analogy,
828-399: A dialect continuum rather than from linguistically isolated child languages of a single language. Augustine of Hippo supposed that each of the descendants of Noah founded a nation and that each nation was given its own language: Assyrian for Assur , Hebrew for Heber , and so on. In all he identified 72 nations, tribal founders and languages. The confusion and dispersion occurred in
920-806: A language family is like a clade , the languages are like species , the proto-language is like an ancestor taxon , the language tree is like a phylogenetic tree and languages and dialects are like species and varieties. Greenberg formulated large tables of characteristics of hitherto neglected languages of Africa, the Americas, Indonesia and northern Eurasia and typed them according to their similarities. He called this approach " typological classification", arrived at by descriptive linguistics rather than by comparative linguistics . The comparative method has been used by historical linguists to piece together tree models utilizing discrete lexical, morphological, and phonological data. Chronology can be found but there
1012-639: A language in all parts, ... may very well be doubted." By then, discovery of the New World and exploration of the Far East had brought knowledge of numbers of new languages far beyond the 72 calculated by St. Augustine. Citing the Native American languages, Browne suggests the "confusion of tongues at first fell only upon those present in Sinaar at the work of Babel ...." For those "about the foot of
SECTION 10
#17327832280101104-621: A number of reconstructive activities by the scholars of the times: "The learned Casaubon conceiveth that a dialogue might be composed in Saxon, only of such words as are derivable from the Greek ... Verstegan made no doubt that he could contrive a letter that might be understood by the English, Dutch, and East Frislander ... And if, as the learned Buxhornius contendeth, the Scythian language as
1196-439: A series of essays beginning about 1950. Since the adoption of the family tree metaphor by the linguists, the concept of evolution had been proposed by Charles Darwin and was generally accepted in biology. Taxonomy , the classification of living things, had already been invented by Carl Linnaeus . It used a binomial nomenclature to assign a species name and a genus name to every known living organism. These were arranged in
1288-461: A significant number of characters, which could not be explained by genetic descent from the group's calculated ancestor, were borrowed. Presumably, if the wave model , which explained borrowing, were a complete explanation of the group's characters, no phylogeny at all could be found for it. If both models were partially effective, then a tree would exist, but it would need to be supplemented by non-genetic explanations. The researchers therefore modified
1380-421: A single parent." The purpose of phylogenetic software is to generate cladograms , a special kind of tree in which the links only bifurcate; that is, at any node in the same direction only two branches are offered. The input data is a set of characters that can be assigned states in different languages, such as present (1) or absent (0). A language therefore can be described by a unique coordinate set consisting of
1472-499: A standardised and Sanskritised register of Dehlavi , is the official language of the Government of India (along with English ). Together with Urdu , it is the third most-spoken language in the world. The Eastern Indo-Aryan languages, also known as Magadhan languages, are spoken throughout the eastern subcontinent, including Odisha and Bihar , alongside other regions surrounding the northwestern Himalayan corridor. Bengali
1564-613: A subfamily of Indo-Aryan. The Dardic group as a genetic grouping (rather than areal) has been scrutinised and questioned to a degree by recent scholarship: Southworth, for example, says "the viability of Dardic as a genuine subgroup of Indo-Aryan is doubtful" and "the similarities among [Dardic languages] may result from subsequent convergence". The Dardic languages are thought to be transitional with Punjabi and Pahari (e.g. Zoller describes Kashmiri as "an interlink between Dardic and West Pahāṛī"), as well as non-Indo-Aryan Nuristani; and are renowned for their relatively conservative features in
1656-578: A tree of languages appears fully developed short of being a professional linguist's view: "I will now hoist sail for the Netherlands, whose language is the same dialect with the English, and was so from the beginning, being both of them derived from the high Dutch [Howell is wrong here]: The Danish also is but a branch of the same tree ... Now the High Dutch or Teutonick Tongue, is one of the prime and most spacious Maternal Languages of Europe ... it
1748-424: A unique path from root to leaf, but its origin could be either the root under consideration or a contact language. If all the states of the experiment could be accounted for by the network, it was termed a perfect phylogenetic network. The generation of networks required two phases. In the first phase, the researchers devised a number of phylogenies, called candidate trees, to be tested for compatibility. A character
1840-530: A universal descent from the language of paradise, located in Kashmir central to the total range of the 500. Young begins by pointing out Adelung's indebtedness to Conrad Gesner 's Mithridates, de Differentiis Linguarum of 1555 and other subsequent catalogues of languages and alphabets. Young undertakes to present Adelung's classification. The monosyllabic type is most ancient and primitive, spoken in Asia, to
1932-1201: Is classified as an Indo-Aryan language of the Dardic subgroup . However, the term Dardic is not linguistic but merely geographic. The following tables set out the phonology of the Gawar-Bati language: The status of short /e/ and /o/ is unclear. A breathy voiced series, /bʱ dʱ gʱ/, existed recently in older speakers—and may still do so. It is rarely written. This alphabet is used in Pakistan: 35°19′38″N 71°35′05″E / 35.32722°N 71.58472°E / 35.32722; 71.58472 Indo-Aryan languages Pontic Steppe Caucasus East Asia Eastern Europe Northern Europe Pontic Steppe Northern/Eastern Steppe Europe South Asia Steppe Europe Caucasus India Indo-Aryans Iranians East Asia Europe East Asia Europe Indo-Aryan Iranian Indo-Aryan Iranian Others European The Indo-Aryan languages (or sometimes simply Indic languages ) are
SECTION 20
#17327832280102024-485: Is compatible when its origin is explained by the phylogeny generated. In a perfect phylogeny, all the characters are compatible and the compatibility of the tree is 100%. By the principle of parsimony, or Occam's razor , no networks are warranted. Candidate trees were obtained by first running the phylogeny-generation software using the Indo-European dataset (the strings of character states) as input, then modifying
2116-591: Is dear" (Mayrhofer II 182), Priyamazda ( priiamazda ) as Priyamedha "whose wisdom is dear" (Mayrhofer II 189, II378), Citrarata as Citraratha "whose chariot is shining" (Mayrhofer I 553), Indaruda/Endaruta as Indrota "helped by Indra " (Mayrhofer I 134), Shativaza ( šattiṷaza ) as Sātivāja "winning the race price" (Mayrhofer II 540, 696), Šubandhu as Subandhu "having good relatives" (a name in Palestine , Mayrhofer II 209, 735), Tushratta ( tṷišeratta, tušratta , etc.) as *tṷaiašaratha, Vedic Tvastar "whose chariot
2208-656: Is in many cases somewhat arbitrary. The classification of the Indo-Aryan languages is controversial, with many transitional areas that are assigned to different branches depending on classification. There are concerns that a tree model is insufficient for explaining the development of New Indo-Aryan, with some scholars suggesting the wave model . The following table of proposals is expanded from Masica (1991) (from Hoernlé to Turner), and also includes subsequent classification proposals. The table lists only some modern Indo-Aryan languages. Anton I. Kogan , in 2016, conducted
2300-401: Is no absolute date estimates utilizing this system. Glottochronology enables absolute dates to be estimated. Shared cognates (cognates meaning to have common historical origin) are calculate divergence times. However the method was found to be later discredited due to the data being unreliable. Due to this historical linguists have trouble with exact age estimation when pinpointing the age of
2392-577: Is suggested that "proto-Munda" languages may have once dominated the eastern Indo-Gangetic Plain , and were then absorbed by Indo-Aryan languages at an early date as Indo-Aryan spread east. Marathi-Konkani languages are ultimately descended from Maharashtri Prakrit , whereas Insular Indo-Aryan languages are descended from Elu Prakrit and possess several characteristics that markedly distinguish them from most of their mainland Indo-Aryan counterparts. Insular Indo-Aryan languages (of Sri Lanka and Maldives ) started developing independently and diverging from
2484-424: Is the earliest stage of the Indo-Aryan branch, from which all known languages of the later stages Middle and New Indo-Aryan are derived, some documented Middle Indo-Aryan variants cannot fully be derived from the documented form of Old Indo-Aryan (on which Vedic and Classical Sanskrit are based), but betray features that must go back to other undocumented dialects of Old Indo-Aryan. Tree model Popularized by
2576-620: Is the official language of Gujarat , and is spoken by over 50 million people. In Europe, various Romani languages are spoken by the Romani people , an itinerant community who historically migrated from India. The Western Indo-Aryan languages are thought to have diverged from their northwestern counterparts, although they have a common antecedent in Shauraseni Prakrit . Within India, Central Indo-Aryan languages are spoken primarily in
2668-567: Is the seventh most-spoken language in the world, and has a strong literary tradition; the national anthems of India and Bangladesh are written in Bengali. Assamese and Odia are the official languages of Assam and Odisha , respectively. The Eastern Indo-Aryan languages descend from Magadhan Apabhraṃśa and ultimately from Magadhi Prakrit . Eastern Indo-Aryan languages display many morphosyntactic features similar to those of Munda languages , while western Indo-Aryan languages do not. It
2760-661: Is vehement" (Mayrhofer, Etym. Wb., I 686, I 736). The earliest evidence of the group is from Vedic Sanskrit , that is used in the ancient preserved texts of the Indian subcontinent , the foundational canon of the Hindu synthesis known as the Vedas . The Indo-Aryan superstrate in Mitanni is of similar age to the language of the Rigveda , but the only evidence of it is a few proper names and specialized loanwords. While Old Indo-Aryan
2852-453: The (London) Quarterly Review of late 1813–1814, Thomas Young published a review of Johann Christoph Adelung 's Mithridates, oder allgemeine Sprachenkunde ("Mithridates, or a General History of Languages"), Volume I of which had come out in 1806, and Volumes II and III, 1809–1812 , continued by Johann Severin Vater. Adelung's work described some 500 "languages and dialects" and hypothesized
Gawar-Bati language - Misplaced Pages Continue
2944-745: The Caribbean , Southeast Africa , Polynesia and Australia , along with several million speakers of Romani languages primarily concentrated in Southeastern Europe . There are over 200 known Indo-Aryan languages. Modern Indo-Aryan languages descend from Old Indo-Aryan languages such as early Vedic Sanskrit , through Middle Indo-Aryan languages (or Prakrits ). The largest such languages in terms of first-speakers are Hindi–Urdu ( c. 330 million ), Bengali (242 million), Punjabi (about 150 million), Marathi (112 million), and Gujarati (60 million). A 2005 estimate placed
3036-473: The Galapagos Islands , who had appeared to be modifications of a common ancestor. Selection of domestic species to produce a new variety also played a role in his conclusions. The first edition of Origin of Species in 1859 discusses the language tree as though de novo under the topic of classification. Darwin criticises the synchronic method devised by Linnaeus , suggesting that it be replaced by
3128-440: The Indo-European languages . However, this is largely a theoretical, qualitative pursuit, and linguists have always emphasized the inherent limitations of the tree model due to the large role played by horizontal transmission in language evolution, ranging from loanwords to creole languages that have multiple mother languages. The wave model was developed in 1872 by Schleicher's student Johannes Schmidt as an alternative to
3220-540: The Kurgan theory and the Anatolian theory , both claiming origins of Info-European languages. The comparative method compares features of various languages to assess how similar one language is to another. The results of such an assessment are data-oriented; that is, the results depend on the number of features and the number of languages compared. Until the arrival of the computer on the historical linguistics landscape,
3312-464: The Stammbaum in an article of 1853, six years before the first edition of Origin of Species in 1859. The concept of descent of languages was by no means new. Thomas Jefferson , a devout linguist himself, had proposed that the continual necessity for neologisms implies that languages must "progress" or "advance." These ideas foreshadow evolution of either biological species or languages, but after
3404-609: The lingua primaeva or the lingua primigenia. In English it was the Adamic language; in German, the Ursprache or the hebräische Ursprache if one believed it was Hebrew. This mysterious language had the aura of purity and incorruption about it, and those qualities were the standards used to select candidates. This concept of Ursprache came into use well before the neo-grammarians adopted it for their proto-languages. The gap between
3496-474: The German linguist August Schleicher in 1853, the tree model has always been a common method of describing genetic relationships between languages since the first attempts to do so. It is central to the field of comparative linguistics , which involves using evidence from known languages and observed rules of language feature evolution to identify and describe the hypothetical proto-languages ancestral to each language family, such as Proto-Indo-European and
3588-653: The Indo-European language family. It could range from 4000 BP to 40,000 BP, or anywhere in-between those dates according to Dixon sourced from the rise and fall of language, (Cambridge University Press). As seen in the article here. Possible solutions for Glottochronology are forthcoming due to computational phylogenetic methods. Techniques such as using models of evolution improves accuracy of tree branch length and topology. There for, using computational phylogenetic methods computational methods enable researchers to analyze linguistic data from evolutionary biology. This further assists in testing theories against each other, such as
3680-448: The Indo-European phylogeny with greater confidence. To obtain a reasonably valid phylogeny, the researchers found they needed to enter as input all three types of characters: phonological, lexical and morphological, which were all required to present a picture that was sufficiently detailed for calculation of phylogeny. Only qualitative characters produced meaningful results. Repeated states were too ambiguous to be correctly interpreted by
3772-750: The Mittani are either in Hurrian (which appears to have been the predominant language of their kingdom) or Akkadian (the main diplomatic language of the Late Bronze Age Near East), these apparently Indo-Aryan names suggest that an Indo-Aryan elite imposed itself over the Hurrians in the course of the Indo-Aryan expansion . If these traces are Indo-Aryan, they would be the earliest known direct evidence of Indo-Aryan, and would increase
Gawar-Bati language - Misplaced Pages Continue
3864-563: The Science of Language." In a scenario reminiscent of that between Darwin and Wallace over the discovery of evolution (both discovered it independently), Schleicher endorsed Darwin's presentation, but criticised it for not inserting any species. He then presented a Stammbaum of languages, which, however, was not the first he had published. The evolution of languages was not the source of Darwin's theory of evolution. He had based that on variation of species, such as he had observed in finches in
3956-399: The author of the successful German-language candidate, Indo-Germanisch, who criticised Jones for his uncritical method, knew Chinese, Japanese, Tibetan and a number of other languages with their scripts.) The concept of a Biblical Ursprache appealed to their imagination. As hope of finding it gradually died they fell back on the growing concept of common Indo-European spoken by nomadic tribes on
4048-470: The best compatibility scores. The incompatibilities were all lexical, and A's were a subset of C's. Subsequent generation of networks found that all incompatibilities could be resolved with a minimum of three contact edges except for Tree E. As it did not have a high compatibility, it was excluded. Tree A had 16 possible networks, which a feasibility inspection reduced to three. Tree C had one network, but as it required an interface to Baltic and not Slavic, it
4140-452: The concept of species classification in biology. The limitations of the tree model, in particular its inability to handle the non-discrete distribution of shared innovations in dialect continua , have been addressed through the development of non-cladistic (non-tree-based) methodologies. They include the Wave model ; and more recently, the concept of linkage . An additional limitation of
4232-426: The contact of Schleicher with Darwin's ideas, and perhaps Darwin's contact with the historical linguists, Evolution and language change were inextricably linked, and would become the basis for classification. Now, as then, the main problems would be to prove specific lines of descent, and to identify the branch points. The old metaphor was given an entirely new meaning under the old name by Joseph Harold Greenberg in
4324-612: The context of Proto-Indo-Aryan . The Northern Indo-Aryan languages , also known as the Pahari ('hill') languages, are spoken throughout the Himalayan regions of the subcontinent. Northwestern Indo-Aryan languages are spoken in the northwestern region of India and eastern region of Pakistan. Punjabi is spoken predominantly in the Punjab region and is the official language of the northern Indian state of Punjab , in addition to being
4416-471: The continental Indo-Aryan languages from around 5th century BCE. The following languages are otherwise unclassified within Indo-Aryan: Dates indicate only a rough time frame. Proto-Indo-Aryan (or sometimes Proto-Indic ) is the reconstructed proto-language of the Indo-Aryan languages. It is intended to reconstruct the language of the pre-Vedic Indo-Aryans . Proto-Indo-Aryan is meant to be
4508-804: The early 21st century, the Computational Phylogenetics in Historical Linguistics (CPHL) project, a consortium of historical linguists, received funding from the National Science Foundation to study phylogenies. The Indo-European family is a major topic of study. As of January, 2012, they had collected and coded a "screened" database of "22 phonological characters, 13 morphological characters, and 259 lexical characters," and an unscreened database of more. Wordlists of 24 Indo-European languages are included. Larger numbers of features and languages increase
4600-423: The east of Eden, in the direction of Adam's exit from Eden. Then follows Jones' group, still without a name, but attributed to Jones: "Another ancient and extensive class of languages united by a greater number of resemblances than can well be altogether accidental." For this class he offers a name, "Indoeuropean," the first known linguistic use of the word, but not its first known use. The British East India Company
4692-588: The exceptionlessness of the sound laws and the regularity of the process. The linguist perhaps most responsible for establishing the link to Darwinism was August Schleicher. That he was comparing his Stammbaum , or family tree of languages, to Darwin's presentation of evolution shortly after that presentation, is proved by the open letter he wrote in 1863 to Ernst Haeckel , published posthumously, however. In 1869, Haeckel had suggested he read Origin of Species . After reading it Schleicher wrote Die Darwinische Theorie und die Sprachwissenschaft , "Darwinism tested by
SECTION 50
#17327832280104784-484: The gradual uncovering of an ancient master language from which all the other languages derive. Browne undoubtedly did his writing and thinking well before 1684. In that same revolutionary century in Britain James Howell published Volume II of Epistolae Ho-Elianae , quasi-fictional letters to various important persons in the realm containing valid historical information. In Letter LVIII the metaphor of
4876-435: The greater the number of similarities the more likely species were to be in the same clade. This approach appealed to Greenberg, who was interested in discovering linguistic universals . Altering the tree model to make the family tree a phylogenetic tree he said: "Any language consists of thousands of forms with both sound and meaning ... any sound whatever can express any meaning whatever. Therefore, if two languages agree in
4968-460: The hills, whereabout the ark rested ... their primitive language might in time branch out into several parts of Europe and Asia ...." This is an inkling of a tree. In Browne's view, simplification from a larger aboriginal language than Hebrew could account for the differences in language. He suggests ancient Chinese, from which the others descended by "confusion, admixtion and corruption". Later he invokes "commixture and alteration." Browne reports
5060-405: The horse race). The numeral aika "one" is of particular importance because it places the superstrate in the vicinity of Indo-Aryan proper as opposed to Indo-Iranian in general or early Iranian (which has aiva ). Another text has babru ( babhru , "brown"), parita ( palita , "grey"), and pinkara ( pingala , "red"). Their chief festival was the celebration of the solstice ( vishuva ) which
5152-611: The language of paradise, and brands it as mainly speculative. Young's designation, successful in English, was only one of several candidates proposed between 1810 and 1867: indo-germanique ( Conrad Malte-Brun , 1810), japetisk ( Rasmus Christian Rask , 1815), Indo-Germanisch ( Julius Klaproth , 1823), indisch-teutsch (F. Schmitthenner, 1826), sanskritisch ( Wilhelm von Humboldt , 1827), indokeltisch (A. F. Pott, 1840), arioeuropeo ( Graziadio Isaia Ascoli , 1854), Aryan ( Max Müller , 1861) and aryaque (H. Chavée, 1867). These men were all polyglots and prodigies in languages. (Klaproth, for example,
5244-497: The lineages and alter the classification to reflect them, which it did under the overall guidance of the Nomenclature Codes , rule books kept by international organizations to authorize and publish proposals to reclassify species and other taxa. The new approach was called phylogeny , the "generation of phyla," which devised a new tree metaphor, the phylogenetic tree . One unit in the tree and all its offspring units were
5336-502: The most divergent Indo-Aryan branch. Nevertheless, the modern consensus of Indo-Aryan linguists tends towards the inclusion of Dardic based on morphological and grammatical features. The Inner–Outer hypothesis argues for a core and periphery of Indo-Aryan languages, with Outer Indo-Aryan (generally including Eastern and Southern Indo-Aryan, and sometimes Northwestern Indo-Aryan, Dardic and Pahari ) representing an older stratum of Old Indo-Aryan that has been mixed to varying degrees with
5428-526: The most widely-spoken language in Pakistan. Sindhi and its variants are spoken natively in the Pakistani province of Sindh and neighbouring regions. Northwestern languages are ultimately thought to be descended from Shauraseni Prakrit , with influence from Persian and Arabic . Western Indo-Aryan languages are spoken in central and western India, in states such as Madhya Pradesh and Rajasthan , in addition to contiguous regions in Pakistan. Gujarati
5520-544: The mother tongue runs throughout the nations of Europe, and even as far as Persia, the community on many words, between so many nations, hath more reasonable traduction and were rather derivable from the common tongue diffused through them all, than from any particular nation, which hath also borrowed and holdeth but at second hand." The confusion at the Tower of Babel was thus removed as an obstacle by setting it aside. Attempts to find similarities in all languages were resulting in
5612-570: The network was highest. As it turned out, the number of compatible networks generated might vary from none to over a dozen. However, not all the possible interfaces were historically feasible. Interfaces between some languages were geographically and chronologically not very likely. Inspecting the results, the researchers excluded the non-feasible interfaces until a list of only feasible networks remained, which could be arranged in order of compatibility score. The researchers began with five candidate trees for Indo-European, lettered A-E, one generated from
SECTION 60
#17327832280105704-542: The newer stratum that is Inner Indo-Aryan. It is a contentious proposal with a long history, with varying degrees of claimed phonological and morphological evidence. Since its proposal by Rudolf Hoernlé in 1880 and refinement by George Grierson it has undergone numerous revisions and a great deal of debate, with the most recent iteration by Franklin Southworth and Claus Peter Zoller based on robust linguistic evidence (particularly an Outer past tense in -l- ). Some of
5796-521: The numbers in both cases were necessarily small. The effect was of trying to depict a photograph using a small number of large pixels, or picture units. The limitations of the Tree Model were all too painfully apparent, resulting in complaints from the major historical linguists. In the late 20th century, linguists began using software intended for biological classification to classify languages. Programs and methods became increasingly sophisticated. In
5888-402: The only possible one. Yet it might be that some very ancient language had altered little, and had given rise to few new languages, whilst others (owing to the spreading and subsequent isolation and states of civilisation of the several races, descended from a common race) had altered much, and had given rise to many new languages and dialects. The various degrees of difference in the languages from
5980-424: The perfect phylogeny. If not, then one or more contact edges, or bidirectional interfaces, could be added to the phylogeny. A language therefore might have more than one source of states: the parent or a contact language. A tree so modified was no longer a tree as such: there could be more than one path from root to leaf. The researchers called this arrangement a network. The states of a character still evolved along
6072-459: The phylogenetic software, two modifications of it and two suggested by Craig Melchert , a historical linguist and Indo-Europeanist. The trees differed mainly in the placement of the most ambiguous group, the Germanic languages, and Albanian, which did not have enough distinctive characters to place it exactly. Tree A contained 14 incompatible characters; B, 19; C, 17; D, 21; E,18. Trees A and C had
6164-555: The plains of Eurasia, and although they made a good case that this language can be deduced by the methods of comparative linguistics, in fact that is not how they obtained it. It was the one case in which their efforts to find the Ursprache succeeded. The model is due in its most strict formulation to the Neogrammarians . The model relies on earlier conceptions of William Jones , Franz Bopp and August Schleicher by adding
6256-651: The precision in dating the split between the Indo-Aryan and Iranian languages (as the texts in which the apparent Indicisms occur can be dated with some accuracy). In a treaty between the Hittites and the Mitanni, the deities Mitra , Varuna , Indra , and the Ashvins ( Nasatya ) are invoked. Kikkuli 's horse training text includes technical terms such as aika (cf. Sanskrit eka , "one"), tera ( tri , "three"), panza ( panca , "five"), satta ( sapta , seven), na ( nava , "nine"), vartana ( vartana , "turn", round in
6348-437: The precision, provided they meet certain criteria. Using specialized computer software, they test various phylogenetic hypotheses for their ability to account for the characters by genetic descent. One endemic limitation of the tree model is the very founding presumption on which it is based: it requires a classification based on languages or, more generally, on language varieties . Since a variety represents an abstraction from
6440-581: The predecessor of Old Indo-Aryan (1500–300 BCE), which is directly attested as Vedic and Mitanni-Aryan . Despite the great archaicity of Vedic, however, the other Indo-Aryan languages preserve a small number of conservative features lost in Vedic . Some theonyms, proper names, and other terminology of the Late Bronze Age Mitanni civilization of Upper Mesopotamia exhibit an Indo-Aryan superstrate. While what few written records left by
6532-429: The researchers. In order to find the factors that did bear on phylogeny the researchers needed to have some measure of the accuracy of their results; i.e., the results needed to be calibrated against known phylogenies. They ran the experiment using different assumptions looking for the ones that would produce the closest matches to the most secure Indo-European phylogenies. Those assumptions could be used on problem areas of
6624-403: The resultant tree into other hypotheses to be tested. None of the original candidate trees were perfect phylogenies, although some of the subtrees within them were. The next phase was to generate networks from the trees of highest compatibility scores by adding interfaces one at a time, selecting the interface of highest compatibility, until sufficiency was obtained; that is, the compatibility of
6716-406: The roots of the verbs and in the forms of grammar, than could possibly have been produced by accident; ...." He went on to postulate that they sprang from "some common source, which, perhaps, no longer exists." To them he added Gothic, Celtic and Persian as "to the same family." Jones did not name his "common source" nor develop the idea further, but it was taken up by the linguists of the times. In
6808-462: The same stock, would have to be expressed by groups subordinate to groups; but the proper or even only possible arrangement would still be genealogical; and this would be strictly natural, as it would connect together all languages, extinct and modern, by the closest affinities, and would give the filiation and origin of each tongue." Schleicher had never heard of Darwin before Haeckel brought him to Schleicher's attention. He had published his own work on
6900-405: The software and method to include the possibility of borrowing. The researchers introduced into the experiment the concept of the interface, or allowed boundary over which character states would flow. A one-way interface, or edge, existed between a parent and a child. If only one-way edges were sufficient to explain the presence of all the states in a language, then there was no need to look beyond
6992-473: The software; therefore characters that were subject to back formation and parallel development, which reverted a character to a prior state or adopted a state that evolved in another character, respectively, were screened from the input dataset. Despite their care to code the best qualitative characters in sufficient numbers, the researchers could obtain no perfect phylogenies for some groups, such as Germanic and Albanian within Indo-European. They reasoned that
7084-464: The state values for all of the characters considered. These coordinates can be like each other or less so. Languages that share the most states are most like each other. The software massages all the states of all the characters of all the languages by one of several mathematical methods to accomplish a pairwise comparison of each language with all the rest. It then constructs a cladogram based on degrees of similarity; for example, hypothetical languages,
7176-429: The states are accounted for by descent from other states. A cladogram that conforms to these requirements is a perfect phylogeny. At first there seemed to be little consistency of results in trials varying the factors presumed to be relevant. A new cladogram resulted from any change, which suggested that the method was not capturing the underlying evolution of languages but only reflecting the extemporaneous judgements of
7268-545: The theory's skeptics include Suniti Kumar Chatterji and Colin P. Masica . The below classification follows Masica (1991) , and Kausen (2006) . Percentage of Indo-Aryan speakers by native language: The Dardic languages (also Dardu or Pisaca) are a group of Indo-Aryan languages largely spoken in the northwestern extremities of the Indian subcontinent. Dardic was first formulated by George Abraham Grierson in his Linguistic Survey of India but he did not consider it to be
7360-453: The time of Peleg , son of Heber, son of Shem , son of Noah. Augustine made a hypothesis not unlike those of later historical linguists, that the family of Heber "preserved that language not unreasonably believed to have been the common language of the race ... thenceforth named Hebrew." Most of the 72 languages, however, date to many generations after Heber. St. Augustine solves this first problem by supposing that Heber, who lived 430 years,
7452-402: The total number of native speakers of the Indo-Aryan languages at nearly 900 million people. Other estimates are higher suggesting a figure of 1.5 billion speakers of Indo-Aryan languages. The Indo-Aryan family as a whole is thought to represent a dialect continuum , where languages are often transitional towards neighboring varieties. Because of this, the division into languages vs. dialects
7544-438: The totality of linguistic features , there is the possibility for information loss during the translation of data (from a map of isoglosses ) into a tree. For example, there is the issue of dialect continua . They provide varieties that are not unequivocally one language or another but contain features characteristic of more than one. The issue of how they are to be classified is similar to the issue presented by ring species to
7636-430: The tree model involves mixed and hybrid languages, as well as language mixing in general since the tree model allows only for divergences. For example, according to Zuckermann (2009:63), "Israeli", his term for Modern Hebrew , which he regards as a Semito-European hybrid, "demonstrates that the reality of linguistic genesis is far more complex than a simple family tree system allows. 'Revived' languages are unlikely to have
7728-404: The tree model that incorporates horizontal transmission. The tree model also has the same limitations as biological taxonomy with respect to the species problem of quantizing a continuous phenomenon that includes exceptions like ring species in biology and dialect continua in language. The concept of a linkage was developed in response and refers to a group of languages that evolved from
7820-554: The western Gangetic plains , including Delhi and parts of the Central Highlands , where they are often transitional with neighbouring lects. Many of these languages, including Braj and Awadhi , have rich literary and poetic traditions. Urdu , a Persianised derivative of Dehlavi descended from Shauraseni Prakrit , is the official language of Pakistan and also has strong historical connections to India , where it also has been designated with official status. Hindi ,
7912-530: The widely divergent families of languages remained unclosed. On February 2, 1786, Sir William Jones delivered his Third Anniversary Discourse to the Asiatic Society as its president on the topic of the Hindus . In it he applied the logic of the tree model to three languages, Greek, Latin and Sanskrit, but for the first time in history on purely linguistic grounds, noting "a stronger affinity, both in
8004-587: Was common in most cultures in the ancient world. The Mitanni warriors were called marya , the term for "warrior" in Sanskrit as well; note mišta-nnu (= miẓḍha , ≈ Sanskrit mīḍha ) "payment (for catching a fugitive)" (M. Mayrhofer, Etymologisches Wörterbuch des Altindoarischen , Heidelberg, 1986–2000; Vol. II:358). Sanskritic interpretations of Mitanni royal names render Artashumara ( artaššumara ) as Ṛtasmara "who thinks of Ṛta " (Mayrhofer II 780), Biridashva ( biridašṷa, biriiašṷ a) as Prītāśva "whose horse
8096-460: Was not feasible. Tree A, the most compatible and feasible tree, hypothesizes seven groups separating from Proto-Indo-European between about 4000 BC and 2250 BC, as follows. Trees B and E offer the alternative of Proto-Germano-Balto-Slavic (northern Indo-European), making Albanian an independent branch. The only date for which authors vouch is the last, based on the continuity of the Yamna culture ,
8188-441: Was still alive when God assigned the 72. St. Augustine's hypothesis stood without major question for over a thousand years. Then, in a series of tracts, published in 1684, expressing skepticism concerning various beliefs, especially Biblical, Sir Thomas Browne wrote: "Though the earth were widely peopled before the flood ... yet whether, after a large dispersion, and the space of sixteen hundred years, men maintained so uniform
8280-404: Was synchronic; Darwin envisioned a diachronic process of common descent . Where Linnaeus had conceived ranks , which were consistent with the great chain of being adopted by the rationalists , Darwin conceived lineages . Over the decades after Darwin it became clear that the ranks of Linnaeus' hierarchy did not correspond exactly to the lineages. It became the prime goal of taxonomy to discover
8372-509: Was the language of the Goths and Vandals , and continueth yet of the greatest part of Poland and Hungary , who have a Dialect of hers for their vulgar tongue ... Some of her writers would make this world believe that she was the language spoken in paradise." The search for "the language of paradise" was on among all the linguists of Europe. Those who wrote in Latin called it the lingua prima ,
8464-608: Was using "Indo-European commerce" to mean the trade of commodities between India and Europe. All the evidence Young cites for the ancestral group are the most similar words: mother, father, etc. Adelung's additional classes were the Tataric (which would later be known as the disputed family Altaic ), the African and the American, which depend on geography and a presumed descent from Eden. Young does not share Adelung's enthusiasm for
#9990