Umlaut
The term umlaut is used for two closely related notions: a special kind of vowel modification and a particular diacritic mark.
Vowel modification
In linguistics, the process of umlaut (from German um- "around", "transformation" + laut "sound") is a modification of a vowel which causes it to be pronounced more to the front of the mouth to accommodate a vowel in the following syllable, especially when that syllable is an inflectional suffix. This process is found in many — especially Germanic — languages.
For example, the German noun Mann (man) with the a pronounced as in English "father", becomes Männer [m'En@r, m'En6] in the plural, with the ä pronounced like the ai in "hair", a front vowel sound that is assimilated to the vowel in the -er suffix. The original conditioning environment in German was an i or j in the following syllable (the plural suffix originally was -ir). Later, umlaut acquired a grammatical function and was extended by analogy, for example to form plurals like Ofen ['o:f@n] / Öfen ['2:f@n] ("oven"/"ovens"). Note that English, being a Germanic language, has preserved some of these changes in irregular inflected forms such as man/men, tooth/teeth, long (adj)/length (n), ox/oxen, etc., even though it has lost the suffixes that originally caused them, and has changed their spelling. In English, the process was called i-mutation.
An umlaut should be distinguished from a change in vowel indicating a difference in grammatical function, called an ablaut, as in sing/sang/sung. Ablaut originated in the Proto-Indo-European language, whereas umlaut originated later, in Proto-Germanic. These terms may also be used for similar changes in other language families.
Diacritical mark
The word is also used to refer to the diacritical mark composed of two small dots placed over a vowel (¨) to indicate this change in German (the same mark is used to indicate diaeresis in other languages). The origin of the graphical symbol lies in the following e, which was originally written above in tiny form. In handwritings of the Middle Ages until (in Germany) 1941, the e looked like two tiny strokes (compare the Sütterlin minuscule "e"). In script form this simplified to two bars above the letter. These bars became confused with the diaeresis, and are now normally written as two small dots above. The umlauts are ä, ö, and ü.
In Finnish, Hungarian and North Germanic languages (i.e., Danish, Icelandic, Norwegian and Swedish) characters (ü, ä, ö, and å) looking similar to German umlauts are in fact considered letters in their own right, despite their representing sounds similar to the corresponding sounds in German. As it is not a case of marking grammatical variation, i.e., of tempus or modus, nor of syllable modification, it is in fact neither a case of umlaut nor of diacritical marking. Hence it ought to be improper to call these characters umlauts; however, there is no more precise descriptor in English.
When typing in German, if umlaut letters are not available, they have to be replaced by the underlying vowel and a following e (Ersatzdarstellung). In Switzerland, capital umlauts are sometimes printed as digraphss, i.e., Ae, Oe, Ue, instead of ÃÂ, ÃÂ, ÃÂ.
Entering umlauts in HTML
In HTML umlauts can be entered with an &?uml; entity reference. All umlauts are part of all latin versions of the ISO 8859 character sets and thus have the same codepoints in ISO-8859-1 (-2, -3, -4, -9, -10, -13, -14, -15, -16) and Unicode. See the following table:
| Character | Replacement | HTML entity | Codepoint |
|---|---|---|---|
| ä | ae | ä
| 0x00E4 |
| ö | oe | ö
| 0x00F6 |
| ü | ue | ü
| 0x00FC |
| ÃÂ | Ae | Ä
| 0x00C4 |
| ÃÂ | Oe | Ö
| 0x00D6 |
| ÃÂ | Ue | Ü
| 0x00DC |
In addition to the umlauts given above, the dotted vowels below may, or may not, be valid in different alphabets:
| Character | Replacement | HTML entity | Codepoint |
|---|---|---|---|
| ë | (none) | ë
| 0x00EB |
| ï | (none) | ï
| 0x00EF |
| ÿ | (none) | ÿ
| 0x00FF |
| ÃÂ | (none) | Ë
| 0x00CB |
| ÃÂ | (none) | Ï
| 0x00CF |
| Ÿ | (none) | Ÿ
| 0x0178 |