Hyphen
| Punctuation marks |
|---|
|
apostrophe ( ' ) parentheses ( ( ) ), brackets ( [ ] ); ( { } ); ( < > ) colon ( : ) comma ( , ) dash ( ‒ ); ( – ); ( — ); ( ― ) ellipsis ( ... ) exclamation mark ( ! ); ( á ! ) full stop/period ( . ) hyphen ( - ); ( ‐ ) interrobang ( ‽ ) question mark ( ? ); ( ÿ ? ) quotation marks ( ‘ ’ ); ( “ ” ); ( ‹ › ); ( ë û ); ( ‚ ‘ ); ( „ “ ); ( àà); ( 「 」 ); ( 『 』 ) semicolon ( ; ) slash ( / ) and backslash ( \\ ) space ( ) and interpunct ( ÷ ) vertical bar / pipe ( | ) asterisk ( * ) and dagger ( àà) |
A hyphen is a punctuation mark. It is used both to join words and to separate syllables. It is often confused with a dash, which is longer. Hyphenation is the use of hyphens.
| Table of contents |
|
2 Examples of usage 3 Hyphens in computing 4 External link |
Traditionally, the hyphen has been used in several ways:
Rules and customs of usage
However, the use of hyphens has in general been steadily declining, both in popular writing and in scholarly journals. Its use is almost always avoided by those who write advertising copy or labels on packaging, since they are often more concerned with visual cleanliness than semantic clarity. However, it is still used in most newspapers and magazines, so people remain accustomed to seeing and understanding it. Most writers who are obstreperous about other things are compliant when editors tell them to hyphenate compounds.
Some strong examples of semantic changes caused by the placement of hyphens:
Examples of usage
Additional examples of proper use:
Hyphens in computing
In the ASCII character encoding, the hyphen was encoded as character 45. Technically, this character is called the hyphen-minus, as it is also used as the minus sign and dasheses. In Unicode, this same character is encoded as U+002D so that Unicode remains compatible with ASCII. However, Unicode also encodes the hyphen and minus separately, as U+2010 ( ‐ ) and U+2212 ( − ), respectively.
When flowing text, it is sometimes preferable to break a word in half so that it continues on another line rather than moving the entire word to the next line. However, doing so requires some knowledge of the conventions of language, making the writing of a computer programs capable of doing so automatically and accurately difficult. To avoid this problem, Unicode encodes a soft hyphen character, U+00AD: when flowing text, a system may consider the soft hyphen to be a point at which a word may be broken, and display a hyphen at the end of the broken line; otherwise, the hyphen is not displayed. In HTML, the soft hyphen is encoded as the character entity ­.
Most text systems consider a hyphen to be a word boundary and a valid point at which to break a line when flowing text. However, this is not always desirable behavior, especially when it could lead to ambiguity. For this purpose, Unicode also encodes a non-breaking hyphen as U+2011. This character looks identical to the regular hyphen, but is not treated as a word boundary.