A word is a unit of language that carries meaning and consists of one or more morphemes which are linked more or less tightly together. Typically a word will consist of a root or stem and zero or more affixes. Words can be combined to create phrases, clauses and sentences. A word consisting of two or more stems joined together is called a compound.
Even in writing systems that use interword separation, word boundaries are not always clear; for example, even though ice cream is written like two words, it is a single compound because it cannot be separated by another morpheme or rephrased like iced cream or cream of ice. Likewise, a proper noun is a word, however long it is. A space may not be even the main morpheme boundary in a word; the word New Yorker is a compound of New York and -er, not of New and Yorker. In English, many common words have historically progressed from being written as two separate words (e.g. to day) to hyphenated (to-day) to a single word (today), a process which is still ongoing, as in the common spelling of all right as alright.
In synthetic languages, a single word stem (for example, love) may have a number of different forms (for example, loves, loving, and loved). However, these are not usually considered to be different words, but different forms of the same word. In these languages, words may be considered to be constructed from a number of morphemes (such as love and -s).
In polysynthetic languages, the number of morphemes per word can become so large that the word performs the same grammatical role as a phrase or clause in less synthetic languages (for example, in Yupik, angyaghllangyugtuq means "he wants to acquire a big boat"). These large-construction words are still single words, because they contain only one content word; the other morphemes are grammatical bound morphemes, which cannot stand alone.
Matters seem easier for analytic languages. For these languages, a word usually consists of only a root morpheme, which is often single-syllable. However, it is common even in those languages to combine roots into a compound stem.
In spoken language, the distinction of individual words is even more complex: short words are often run together, and long words are often broken up. Spoken French has some of the features of a polysynthetic language: je ne le sais pas ("I do not know it") tends towards //. As the majority of the world's languages are not written, the scientific determination of word boundaries becomes important.
There are five ways to determine where the word boundaries of spoken language should be placed:
In practice, linguists apply a mixture of all these methods to determine the word boundaries of any given sentence. Even with the careful application of these methods, the exact definition of a word is often still elusive.
Linguistic morphology | Syntax
Дума | Paraula | Slovo | Ord | Wort | Sõna | Palabra | Vorto | واژه | Sana | Mot | מלה | Vorto | Parola | Vocabulo | Збор | Woord | 語 | ord | ord | Paraula | Wyraz | Palavra | Cuvînt | Слово | Word | Beseda | Slovo (lingvistika) | Ord | Слово | װאָרט | 词语