Mojibake
Mojibake (文字化け, moji character + bake change, literally ghost characters or changed characters) is Japanese for broken characters: the result of trying to display text in character encodings which a piece of software is not configured to deal with. This is often because they are "foreign" alphabets with respect to the makers of the software, but the problem can also arise between different encodings of the same language - such as between EUC-JP and Shift-JIS, both encodings of Japanese characters.In mid 1990s as this problem became common, several website featured mojibake not as a problem to be tackled but as a computer joke. Words and even sentences were "deciphered" with meanings made up to deliver funny messages. It was even joked that this must be the work of extraterrestrials or ghosts trying to deliver secret messages.
It is called luan ma (亂碼 or 乱码 luan4 ma3), or "chaotic code(s)", in Chinese.
Example: "文字化け" might be displayed as "•ÃÂ¶Žš‰Ã»‚ï" (of course, depending on the software you use to view this article, that example may not show up correctly).
This problem is not unique to the Asian users. In the early 1990s Russian Internet users had to endure several different competing encodings (Unix KOI8-R, Windows CP-1251, DOS 866, standard ISO 8859-5, and several others) for the Cyrillic alphabet. Badly configured servers and lack of compatibility made garbled text a common and frustrating experience. Russian users, scared of the strange and unusual characters apearing instead of familiar Cyrillic letters, called them krokozyabry (). Unicode notwithstanding, the phenomenon is still widespread today. For example, some servers still strip the 8th bit from the characters as permitted by earlier email standards (which makes complete hash out of UTF-8 as well as all of the above). For this reason many Cyrillic users resort to Roman transliteration.
External Links