This article needs additional citations for verification. (February 2012) (Learn how and when to remove this template message)
A typographic approximation is a replacement of an element of the writing system (usually a glyph) with another glyph or glyphs. The replacement may be a nearly homographic character, a digraph, or a character string. An approximation is different from a typographical error in that an approximation is intentional and aims to preserve the visual appearance of the original. The concept of approximation also applies to the World Wide Web and other forms of textual information available via digital media, though usually at the level of characters, not glyphs.
Historically, the main cause of typographic approximation was a low quantity of glyphs (such as letterforms and symbols) available for printing. In the age of World Wide Web and digital typesetting, especially after the advent of Unicode and enormous amount of digital fonts, typographic approximations are usually caused either by low ability of humans to distinguish and find needed symbols or by inadequate replacement patterns in word processors, rather than by shortage in available characters.
|Normative:||3 × 2 − 1|
|Approximated:||3 x 2 - 1|
|An ASCII approximation|
of an arithmetical expression
Typewriter and line printer approximations
Merger of characters
On typewriter, several characters were merged due to limited size of glyph repertoire. Several modern computing characters appeared by merger of different symbols, such as the "typewriter" apostrophe, ', which can denote an apostrophe proper, ’, a single quotation mark, or the prime symbol.
Some typewriters have non-spacing keys used as diacritical marks. After the typist pushes, say, acute accent ◌́ the caret does not move. This allows the typist to overstrike this mark by a spacing letter, say, e and obtain é, an accented letter. Due to geometrical restrictions of a monospaced font, the result could not always be perfect. For example, overstriking unlikely was a feasible method to produce uppercase accented letters, such as É.
Overstrike was used on line printers for the same goal. This contributed to standardization of such characters as U+0060 ` .
Overstrike was also used to simulate boldface letters with line printers.
The US-ASCII character set and other variants of ISO/IEC 646 contains 95 graphic characters. It is comparable with a (Latin script) typewriter and insufficient for a quality typography. But high availability and robustness of ASCII character encoding prompted computer users to invent ASCII substitutes for various glyphs.
The following ASCII characters are used to approximate certain characters. Note that there are many Latin letters that are homographic to letters of other scripts, however those Latin letters are not listed below.
- U+0020 (space): alignment and justification.
- U+0022 " : various type of double quotes, double prime ″.
- U+0023 # : sharp symbol ♯.
- U+0027 ' : various type of single quotes, apostrophe ’, prime ′.
- Parentheses U+0028 ( U+0029 ) : encircled characters, such as (c) for Copyright symbol ©.
- U+002A * – see Asterisk.
- U+002B + – various symbols with strokes extending to left, up, right and down.
- U+002D - – probably an ASCII character the most used for approximations. A conventional representation of hyphen, an approximation of dash (especially as -- and ---), minus sign − and line drawing horizontal line ─ (see the image).
- U+002E . : various dot-like symbols, see Full stop.
- U+002F / – see Slash (punctuation).
- U+0031 1 : Turkish dotless ı, Cyrillic palochka Ӏ.
- U+0033 3 : IPA reversed epsilon ɜ, Cyrillic letter З.
- U+0034 4 : Cyrillic letter Ч.
- U+0038 8 : various non-Latin letters and symbols with similar grapheme.
- U+003A : – see Colon (punctuation).
- U+003C < and U+003E > : chevrons ⟨ ⟩, angle quotes ‹ ›, horizontal arrows (especially as digraphs <- and ->).
- U+003D = : line drawing horizontal double line ═ (see the image), double hyphen.
- U+003F ? – although not an approximation, the question sign sometimes replaces unrepresented and unrecognized characters.
- U+0040 @ – see At sign.
- U+004E N : Numero sign №.
- U+0054 T : various symbols with strokes extending to left, right and down, but not up.
- U+0055 U : set union ∪.
- U+0056 V : ∨.
- U+0058 X : X mark ✗.
- U+005B [ and U+005D ] : checkbox and similar rectangular pictograms.
- U+005E ^ : ∧, upwards arrow ↑, and similar symbols with the wedge at the top.
- U+005F _ – see Underscore.
- U+0060 ` – opening single quote ‘.
- U+0062 b - flat symbol ♭
- U+006F o : bullets and various circle-like symbols such as ∘ and ∞ (using two consecutive characters).
- U+0075 u : μ — SI prefix micro- or lowercase Greek letter mu
- U+0076 v : downwards arrow ↓, and similar symbols with the wedge at the bottom.
- U+0078 x : multiplication sign ×.
- U+007C | (on the image, this ASCII character is rendered as a broken bar ¦): line drawing vertical symbols.
- U+007E ~ – see Tilde.
Approximation of non-glyphs
In digital technologies, there are still some conditions where typographic approximations are appropriate. Some devices, such as mobile phones, cannot support huge character repertoire and power text formatting tools, which are ubiquitous on desktop computers of the 2000s.