Tocharian is one of the most obscure branches of the group of Indo-European languages. It consisted of two languages, Tocharian A (Turfanian, Arsi, or East Tocharian) and Tocharian B (Kuchean or West Tocharian). These languages were spoken roughly from the 6th to 8th centuries; before they became extinct, their speakers were absorbed into the expanding Uyghur tribes.
Both languages were once spoken in the Tarim Basin in Central Asia, now the Xinjiang province of China. The name of the language is taken from the Tocharians (Greek: Τόχαροι, "Tokharoi") of the Greek historians (Ptolemy VI, 11, 6). These are sometimes identified with the Yuezhi and the Kushans, and the term Tokharistan usually refers to 1st millennium Bactria. A Turkic text refers to the Turfanian language (Tocharian A) as twqry. Interpretation is difficult, but F. W. K. Müller has associated this with the name of the Bactrian Tokharoi. In Tocharian, the language is referred to as arish-käna and the Tocharians as arya.
Note that the above consonantal values are largely based on the writing of Sanskrit/Prakrit loanwords. A retroflex value for /ʂ/ is particularly suspect as it has nothing to do with /r/ historically, being derived from palatalized /s/; it was probably a low-frequency sibilant // (like German spelling as opposed to the higher-frequency sibilant // (like Mandarin pin-yin spelling [x).
Tocharian is documented in manuscript fragments, mostly from the 8th century (with a few earlier ones) that were written on palm leaves, wooden tablets and Chinese paper, preserved by the extremely dry climate of the Tarim Basin. Samples of the language have been discovered at sites in Kucha and Karasahr, including many mural inscriptions.
Tocharian A and B are not intercomprehensible. Properly speaking, based on the tentative interpretation of twqry as related to Tokharoi, only Tocharian A may be referred to as Tocharian, while Tocharian B could be called Kuchean (its native name may have been kuśiññe), but since their grammars are usually treated together in scholarly works, the terms A and B have proven useful. The common Proto-Tocharian language must precede the attested languages by several centuries, probably dating to the 1st millennium BC.
The alphabet the Tocharians were using is derived from the North Indian Brahmi alphabetic syllabary (abugida) and is referred to as slanting Brahmi. It soon became apparent that a large proportion of the manuscripts were translations of known Buddhist works in Sanskrit and some of them were even bilingual, facilitating decipherment of the new language. Besides the Buddhist and Manichaean religious texts, there were also monastery correspondence and accounts, commercial documents, caravan permits, and medical and magical texts, and one love poem. Many Tocharians embraced Manichaean duality or Buddhism.
Tocharian has upset some theories about the relations of Indo-European languages and is revitalizing linguistic studies. The Tocharian languages are a major geographic exception to the usual pattern of Indo-European branches, being the only one that spread directly east from the theoretical Indo-European starting point in the Pontic steppe.
Tocharian probably died out after 840, when the Uyghurs were expelled from Mongolia by the Kirghiz, retreating to the Tarim Basin. This theory is supported by the discovery of translations of Tocharian texts into Uyghur. During Uyghur rule, the peoples mixed with the Uyghurs to produce much of the modern population of what is now Xinjiang.
| Tocharian vocabulary (sample) | ||||||||||||
| Modern English | Tocharian A | Tocharian B | Irish | Latin | Ancient Greek | Sanskrit | *Proto-Indo-European | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| one | sas | e | aon | ūnus | heis | eka | ||||||
| two | wu | wi | dó | duo | duo | dvāu | *duoh1 | |||||
| three | tre | trai | trí | trēs | treis | tri | *treyes | |||||
| four | śtwar | śtwer | ceathair | quattuor | téssares | catvāras | *kwetwores | |||||
| five | päñ | piś | cúig | quīnque | pente | pañka | *penkwe | |||||
| six | äk | kas | sé | sex | héx | * | ||||||
| seven | pät | ukt | seacht | septem | heptá | saptá | *septm | |||||
| eight | okät | okt | hocht | octō | októ | * | ||||||
| nine | ñu | ñu | naoi | novem | ennéa | náva | *newn | |||||
| ten | śäk | śak | deich | decem | deka | daśa | * | |||||
| hundred | känt | kante | cead | centum | hekatón | śatám | * | |||||
| father | pācar | pācer | athair | pater | patēr | pitá | *ph2tēr | |||||
| mother | mācar | mācer | máthair | mater | mētér | mātá | *me2tēr | |||||
| brother | pracar | procer | bráthair | frāter | phrátēr¹ | bhrātā | *bhre2tēr | |||||
| sister | ar | er | siúr | soror | éor¹ | svas | *swesor | |||||
| (horse)³ | yuk | yakwe | each | equus | híppos | áśva | * | |||||
| cow | ko | keu | bó | bos² | boûs | gāús | *gwou- | |||||
| (voice)² | vak | vek | focal¹ | vōx | épos¹ | vāk | *wekw- | |||||
| to milk | malk | mälk | bligh | mulgēre | amélgein | marjati¹ | *melg- | |||||
| name | ñom | ñem | ainm | nomen | ónoma | nāman | *nomn | |||||
¹ = Cognate, with shifted meaning ² = Borrowed cognate, not native. ³ = English meaning, unrelated word
Medieval languages | Indo-European languages | Languages of China | Central Asia | Extinct languages of Asia | Tocharians
Togaars | Tocariu | Tokhari | Tocharische Sprache | Τοχαρικά | Idioma tocario | 토카리아어 | Toharski jezici | Bahasa Tokharia | Tokkaríska | Tocario | Tochaars | トカラ語 | Tokariske språk | Języki tocharskie | Тохарские языки | Тохарски jeзици | Tokaarilaiset kielet | Tochariska språk | ภาษากลุ่มโตคาเรียน | Toharca | 吐火罗语
This article is licensed under the GNU Free Documentation License.
It uses material from the
"Tocharian languages".
Home Page • arts • business • computers • games • health • hospitals • home • kids & teens • news • physicians • recreation• reference • regional • science • shopping • society • sports • world