This article deals with the phonology (i.e. the sound system) of the Japanese language.
| Bilabial | Dental | Alveolar | Post- alveolar | Palatal | Velar | Glottal | Place- less | |
|---|---|---|---|---|---|---|---|---|
| Stop | ||||||||
| Flap | ||||||||
| Fricative | ||||||||
| Affricate | ||||||||
| Nasal | ||||||||
| Approximant |
Note that this table does not cover all the consonantal variation in the Japanese language. Please refer below for the details of pronunciation.
Japanese has 5 vowels:
Japanese vowels are pronounced as monophthongs, unlike in English; they are similar to their Spanish or Italian counterparts. However, the high back vowel is somewhat centralized as well as "compression rounded", rather than protrusion rounded as , or unrounded as . More precisely, is pronounced with the lips compressed toward each other but not spread to the sides. The IPA transcriptions on the right side of the diagram at right are suggested by Okada (1999). Note, however, that there is no IPA symbol for lip compression, so no transcription will be complete. is transliterated as u.
Japanese a is a low, non-palatal, non-retracted low vowel, IPA , though it is also often represented as . It is between the English a in "father" and the English a in "dad." The Japanese o is a "flat" o, unlike the English one, which is a diphthong. Try to keep your tongue lowered while pronouncing the Japanese o. The i is like English ee in "feet." The e sounds to English speakers like a mix between short e in as in "bed," and long e as in "lay," though it is closer to the former than the latter.
Vowels have a phonemic length distinction (i.e., short vs. long). Cf. contrasting pairs of words like ojisan "uncle" vs. ojiisan "grandfather", or tsuki "moon" vs. tsūki "airflow".
In most phonological analyses, all vowels are treated as occurring with the time frame of one mora. Phonetically long vowels, then, are treated as a sequence of two identical vowels, i.e. ojiisan is not .
Although the phonotactics of Japanese lead some to believe that the language lacks diphthongs, this is not correct. A diphthong is defined as "two vowels pronounced in one syllable," so Japanese, like so many other languages, has them. However, unlike English, in which diphthongs are perceived as single vowels, Japanese diphthongs are perceived as sequences of two different vowels. These vowel sequences are phonetically different from the diphthongs that occur in languages like English. In English, a diphthong such as the one in eye is pronounced as a vowel with a following off-glide: or ; while in Japanese the sequence in ai 愛 'love' is pronounced as (as in naïve) where each vowel segment is of equal length. A glide plus a vowel is analyzed as a sequence of consonant and vowel. Furthermore, unlike English, Japanese distinguishes between 'to meet' and 'blue,' as well as between 'love' and 'dressing,' and also between , e.g., koi 'carp' and , e.g., koe 'voice.' Furthermore, because Japanese has long vowels, it also distinguishes between "regular" and "long" diphthongs, e.g., between 'hey' and 'to be numerous.'
Within words and phrase, Japanese allows long sequences of phonetic vowels without intervening consonants, although the pitch accent and slight rhythm breaks help track the timing when the vowels are identical.
Japanese contains a number of phonological processes which greatly alter the phonetic realization of consonants and vowels. A few are listed below.
Non-coronal voiced stops between vowels may be weakened to fricatives, especially in fast and/or casual speech:
| bilabial fricative : | abareru 暴れる 'to behave violently' | ||
| velar fricative : | hage はげ 'baldness' |
However, is further complicated by its variant realization as a velar nasal . Standard Japanese speakers can be categorized into 3 groups (A, B, C), which will be explained below. If a speaker pronounces a given word consistently with the allophone (i.e. a B-speaker), that speaker will never have as an allophone in that same word. If a speaker varies between and (i.e. an A-speaker) or is generally consistent in using , then the velar fricative is always another possible allophone in fast speech.
may be weakened to nasal when it occurs within words — this includes not only between vowels but also between a vowel and a consonant. There is a fair amount of variation between speakers, however. Some, such as Vance (1987), have suggested that the variation follows social class; others, such as Akamatsu (1997), suggest that the variation follows age and geographic location. The generalized situation is as follows.
At the beginning of words:
In the middle of simple words (i.e. non-compounds):
In the middle of compound words morpheme-initially:
So, for some speakers the following two words are a minimal pair while for others they are homophonous:
To summarize using the example of hage はげ 'baldness':
The palatals and palatalize the consonants they follow:
| → palatalized : | umi 海 'sea' | |||
| → palatalized : | gyōza ぎょうざ 'fried dumpling' | |||
| etc. |
The coronals and glottal are affected as follows:
| → alveolopalatal fricative : | shio 塩 'salt' | ||
| → alveolopalatal or : | jishin 地震 'earthquake'; gojuu 50 'fifty' | ||
| → alveolopalatal : | niwa 庭 'garden' | ||
| → alveolopalatal affricate : | chijin 知人 'acquaintance' | ||
| → palatal fricative : | hito 人 'person' |
Of the allophones of , the affricate is most common, especially at the beginning of utterances and after (or , depending on the analysis), while fricative may occur between vowels. Both sounds, however, are in free variation. The (laminodorso-)alveolopalatal allophone differs from a palatalized apico-dental , a palatalized apico-alveolar nasal, or a palatal nasal . Similarly, while the symbols and may be encountered, they are not strictly correct, as they represent palatal stops, whereas the Japanese sounds are articulated more forward as alveolopalatal. Since there are no IPA symbols for alveolopalatal stops, and are reasonable compromises, if properly explained.
In the case of the , , and , when followed by , historically, the consonants were palatalized with merging into a single pronunciation. In modern Japanese, these have become separate phonemes:
| (Romanized as sh): | shabon シャボン 'soap' | ||
| (Romanized as j): | じゃがいも 'potato' | ||
| (Romanized as ch): | cha 茶 'tea' |
The vowel also affects consonants that it follows:
| → bilabial fricative : | futa ふた 'lid' | ||
| → dental affricate : | tsugi 次 'next' |
Some analyses of Japanese treat the moraic nasal as the archiphoneme . However, other, less abstract approaches treat a syllable-final nasal as a regular coronal . In either case, it always follows vowels (never consonants) and undergoes a variety of assimilatory processes. Within words, it is variously:
Some speakers produce before , while others produce a nasalized vowel before (see Akamatsu 1997).
In some analyses of Japanese, the archiphoneme is posited. However, not all scholars agree that this is the best analysis. In those approaches that incorporate the moraic obstruent, it is said to completely assimilate to the following obstruent, resulting in an geminate (that is, double) consonant. The assimilated remains unreleased and thus the geminates are phonetically long consonants. does not occur before vowels or nasal consonants. This archiphoneme has a wide variety of phonetic realizations, for example:
| before : | → nippon 日本 'Japan' | ||
| before : | → happyaku '800' | ||
| before : | → kassen 合戦 'battle' | ||
| before : | → satchi 察知 'inference' | ||
| etc. |
Another analysis of Japanese dispenses with /Q/ and other archiphonemes entirely. In this approach, the words above are phonemicized as shown below:
| before : | → nippon 日本 'Japan' | ||
| before : | → happyaku '800' | ||
| before : | → kassen 合戦 'battle' | ||
| before : | → satchi 察知 'inference' | ||
| etc. |
Japanese vowels, especially and , tend to be devoiced when between unvoiced consonants except when they are in accented moras. Additionally, and are optionally devoiced following a voiceless consonant and at the end of an utterance.
| kutsu 靴 'shoe' | |||
| suhada すはだ 'bare skin' (* is not devoiced since it's accented) | |||
| hikan 悲観 'pessimism' | |||
| himo 紐 'string' (* is not devoiced since it accented) | |||
| or | hikaku 比較 'comparison' |
To a lesser extent (and even rarer ) may be devoiced with the further requirement that there be two or more adjacent moras containing .
| kokoro 心 'heart' |
Devoicing is common in even normal slow speech and is not restricted to only fast speech.
The common sentence-ending copula desu is pronounced .
Gender roles also play a part: it is regarded as effeminate to pronounce devoiced vowels, particularly the terminal "u" as in "arimasu". Basilectic varieties of Japanese can sometimes be recognized by their hyper-devoicing, while in some Western dialects and some registers of formal speech, every vowel is pronounced.
Japanese vowels are slightly nasalized when adjacent to nasals . Before the moraic nasal , vowels are heavily nasalized:
| seisan 生産 'production' |
At the beginning and end of utterances, Japanese vowels may be preceded and followed by a glottal stop , respectively. This is demonstrated below with the following words (as pronounced in isolation):
| : | en 円 'yen' | ||
| : | kishi 岸 'shore' | ||
| : | u 鵜 'cormorant' |
When an utterance-final word is uttered with emphasis, this glottal stop is plainly audible, and is often indicated in the writing system with a small letter tsu っ called a sokuon.
If considered as a system of morae (or moras) instead of syllables, (as the katakana and hiragana phonetic writing systems explicitly do) the sound structure is very simple: The language is made of morae, each with the same approximate time value and stress (stress, here, being correlated with loudness, not pitch). The Japanese mora may consist of either a vowel or one of the two moraic consonants, and (the less abstract analysis that dispenses with archiphonemes defines possible moraic consonants as any voiceless obstruent, or a nasal, in the syllable coda position. Scholars disagree over whether the coda nasal is limited to or can also include ). A vowel may be preceded by an optional (non-moraic) consonant, with or without a palatal glide .
| Mora Type | Example | Japanese | Morae per word |
| V | i 胃 'stomach' | 1-mora word | |
| CV | te 手 'hand' | 1-mora word | |
| CjV | kya きゃ '(surprised or scared scream)' | 1-mora word | |
| N | in | yon 四 'four' | 2-mora word |
| Q | in | mittsu 三つ 'three' | 3-mora word |
Consonantal morae are restricted from occurring word initially, though utterances starting with are possible. Vowels may be long, and consonants may be geminate (doubled). Geminate consonants are limited to a sequence of plus a voiceless obstruent, though some words are written with geminate voiced obstruents. In the analysis without archiphonemes, geminate clusters are simply two identical consonants, one after the other.
In the writing system, each kana corresponds to a mora. The moraic (i.e., the first half of a geminate cluster) is indicated by a small "tsu" symbol called a sokuon (subscript ッ in katakana, or っ in hiragana). Long vowels are usually indicated in katakana by a long dash following the first vowel, as in sābisu サービス 'service'. The direction of this dash follows the direction of writing.
In English, stressed syllables in a word are pronounced louder, longer, and with higher pitch, while unstressed syllables are relatively shorter in duration. In Japanese, all morae are pronounced with equal length and loudness. Japanese is therefore said to be a mora-timed language.
On the other hand, since all syllables have equal stress in Japanese, some unstressed syllables in European languages tend to be inaudible to the Japanese ear, leading to confusion.
(Compare to the syllable system of Finnish and Italian.)
See also Japanese pitch accent.
Japanese does have a distinct intonation pattern. This pattern can be heard not only in individual words, but also in whole sentences. Intonation is produced by a rise and fall in pitch over certain syllables. In the case of questions, the Japanese intonation patterns bear little resemblance to the English ones. This is a large source of confusion for many non-native speakers.
The Japanese intonation pattern varies with regional dialect.
This article is licensed under the GNU Free Documentation License.
It uses material from the
"Japanese phonology".
Home Page • arts • business • computers • games • health • hospitals • home • kids & teens • news • physicians • recreation• reference • regional • science • shopping • society • sports • world