utf8_sequence_0-0xff_assigned_printable.txt: (Extrans (html tags to text); looked okay when I pasted it in; did NOT preview -- immediately clicked Submit)
Sentences that contain all letters commonly used in a language --------------------------------------------------------------
Markus Kuhn -- 2012-04-11
This is an example of a plain-text file encoded in UTF-8.
Danish (da) ---------
Quizdeltagerne spiste jordbær med fløde, mens cirkusklovnen
Wolther spillede på xylofon.
(= Quiz contestants were eating strawbery with cream while Wolther
the circus clown played on xylophone.)
German (de) -----------
Falsches Üben von Xylophonmusik quält jeden größeren Zwerg
(= Wrongful practicing of xylophone music tortures every larger dwarf)
Zwölf Boxkämpfer jagten Eva quer über den Sylter Deich
(= Twelve boxing fighters hunted Eva across the dike of Sylt)
Heizölrückstoßabdämpfung
(= fuel oil recoil absorber)
(jqvwxy missing, but all non-ASCII letters in one word)
Greek (el) ----------
Γαζέες καὶ μυρτιὲς δὲν θὰ βρῶ πιὰ στὸ χρυσαφὶ ξέφωτο
(= No more shall I see acacias or myrtles in the golden clearing)
Ξεσκεπάζω τὴν ψυχοφθόρα βδελυγμία
(= I uncover the soul-destroying abhorrence)
English (en) ------------
The quick brown fox jumps over the lazy dog
Spanish (es) ------------
El pingüino Wenceslao hizo kilómetros bajo exhaustiva lluvia y
frío, añoraba a su querido cachorro.
(Contains every letter and every accent, but not every combination
of vowel + acute.)
French (fr) -----------
Portez ce vieux whisky au juge blond qui fume sur son île intérieure, à
côté de l'alcôve ovoïde, où les bûches se consument dans l'âtre, ce
qui lui permet de penser à la cænogenèse de l'être dont il est question
dans la cause ambiguë entendue à Moÿ, dans un capharnaüm qui,
pense-t-il, diminue çà et là la qualité de son œuvre.
l'île exiguë
Où l'obèse jury mûr
Fête l'haï volapük,
Âne ex aéquo au whist,
Ôtez ce vœu déçu.
Le cœur déçu mais l'âme plutôt naïve, Louÿs rêva de crapaüter en
canoë au delà des îles, près du mälström où brûlent les novæ.
Irish Gaelic (ga) -----------------
D'fhuascail Íosa, Úrmhac na hÓighe Beannaithe, pór Éava agus Ádhaimh
Hungarian (hu) --------------
Árvíztűrő tükörfúrógép
(= flood-proof mirror-drilling machine, only all non-ASCII letters)
Icelandic (is) --------------
Kæmi ný öxi hér ykist þjófum nú bæði víl og ádrepa
Sævör grét áðan því úlpan var ónýt
(some ASCII letters missing)
latin capital ligature OE, U+0152 ISOlat2:
"Œ" = raw bytes;
"Œ" = Œ
"Œ" =
"Œ" = latin small ligature oe, U+0153 ISOlat2:
"œ" = raw bytes;
"œ" = œ
"œ" =
"œ" =
latin capital letter S with caron, U+0160 ISOlat2:
"Š" = raw bytes;
"Š" = Š
"Š" =
"Š" = latin small letter s with caron, U+0161 ISOlat2:
"š" = raw bytes;
"š" = š
"š" =
"š" = latin capital letter Y with diaeresis, U+0178 ISOlat2:
"Ÿ" = raw bytes;
"Ÿ" = Ÿ
"Ÿ" =
"Ÿ" =
modifier letter circumflex accent, U+02C6 ISOpub:
"ˆ" = raw bytes;
"ˆ" = ˆ
"ˆ" =
"ˆ" = small tilde, U+02DC ISOdia:
"˜" = raw bytes;
"˜" = ˜
"˜" =
"˜" =
en space, U+2002 ISOpub:
" " = raw bytes;
" " =
" " =
" " = em space, U+2003 ISOpub:
" " = raw bytes;
" " =
" " =
" " = thin space, U+2009 ISOpub:
" " = raw bytes;
" " =
" " =
" " = zero width non-joiner, U+200C NEW RFC 2070:
"" = raw bytes;
"" =
"" =
"" = zero width joiner, U+200D NEW RFC 2070:
"" = raw bytes;
"" =
"" =
"" = left-to-right mark, U+200E NEW RFC 2070:
"" = raw bytes;
"" =
"" =
"" = right-to-left mark, U+200F NEW RFC 2070:
"" = raw bytes;
"" =
"" =
"" = en dash, U+2013 ISOpub:
"–" = raw bytes;
"–" = –
"–" =
"–" = em dash, U+2014 ISOpub:
"—" = raw bytes;
"—" = —
"—" =
"—" = left single quotation mark, U+2018 ISOnum:
"‘" = raw bytes;
"‘" = ‘
"‘" =
"‘" = right single quotation mark, U+2019 ISOnum:
"’" = raw bytes;
"’" = ’
"’" =
"’" = single low-9 quotation mark, U+201A NEW:
"‚" = raw bytes;
"‚" = ‚
"‚" =
"‚" = left double quotation mark, U+201C ISOnum:
"“" = raw bytes;
"“" = “
"“" =
"“" = right double quotation mark, U+201D ISOnum:
"”" = raw bytes;
"”" = ”
"”" =
"”" = double low-9 quotation mark, U+201E NEW:
"„" = raw bytes;
"„" = „
"„" =
"„" = dagger, U+2020 ISOpub:
"†" = raw bytes;
"†" = †
"†" =
"†" = double dagger, U+2021 ISOpub:
"‡" = raw bytes;
"‡" = ‡
"‡" =
"‡" = per mille sign, U+2030 ISOtech:
"‰" = raw bytes;
"‰" = ‰
"‰" =
"‰" = single left-pointing angle quotation mark, U+2039 ISO proposed:
"‹" = raw bytes;
"‹" = ‹
"‹" =
"‹" =
single right-pointing angle quotation mark, U+203A ISO proposed:
"›" = raw bytes;
"›" = ›
"›" =
"›" =
euro sign, U+20AC NEW:
"€" = raw bytes;
"€" = €
"€" =
"€" =
(Score: 2) by martyb on Thursday July 03 2014, @11:28PM
<!-- Relevant ISO entity set is given unless names are newly introduced.
New names (i.e., not in ISO 8879 list) do not clash with any
existing ISO 8879 entity names. ISO 10646 character numbers
are given for each character, in hex. CDATA values are decimal
conversions of the ISO 10646 values and refer to the document
character set. Names are ISO 10646 names.
-->
<!-- Latin Extended-B -->
latin small f with hook = function = florin, U+0192 ISOtech:
"ƒ" = UTF-8 encoded character
"ƒ" = ƒ
"ƒ" = ƒ
"ƒ" = ƒ
<!-- Greek -->
greek capital letter alpha, U+0391:
"Α" = UTF-8 encoded character
"Α" = Α
"Α" = Α
"Α" = Α
greek capital letter beta, U+0392:
"Β" = UTF-8 encoded character
"Β" = Β
"Β" = Β
"Β" = Β
greek capital letter gamma, U+0393 ISOgrk3:
"Γ" = UTF-8 encoded character
"Γ" = Γ
"Γ" = Γ
"Γ" = Γ
greek capital letter delta, U+0394 ISOgrk3:
"Δ" = UTF-8 encoded character
"Δ" = Δ
"Δ" = Δ
"Δ" = Δ
greek capital letter epsilon, U+0395:
"Ε" = UTF-8 encoded character
"Ε" = Ε
"Ε" = Ε
"Ε" = Ε
greek capital letter zeta, U+0396:
"Ζ" = UTF-8 encoded character
"Ζ" = Ζ
"Ζ" = Ζ
"Ζ" = Ζ
greek capital letter eta, U+0397:
"Η" = UTF-8 encoded character
"Η" = Η
"Η" = Η
"Η" = Η
greek capital letter theta, U+0398 ISOgrk3:
"Θ" = UTF-8 encoded character
"Θ" = Θ
"Θ" = Θ
"Θ" = Θ
greek capital letter iota, U+0399:
"Ι" = UTF-8 encoded character
"Ι" = Ι
"Ι" = Ι
"Ι" = Ι
greek capital letter kappa, U+039A:
"Κ" = UTF-8 encoded character
"Κ" = Κ
"Κ" = Κ
"Κ" = Κ
greek capital letter lambda, U+039B ISOgrk3:
"Λ" = UTF-8 encoded character
"Λ" = Λ
"Λ" = Λ
"Λ" = Λ
greek capital letter mu, U+039C:
"Μ" = UTF-8 encoded character
"Μ" = Μ
"Μ" = Μ
"Μ" = Μ
greek capital letter nu, U+039D:
"Ν" = UTF-8 encoded character
"Ν" = Ν
"Ν" = Ν
"Ν" = Ν
greek capital letter xi, U+039E ISOgrk3:
"Ξ" = UTF-8 encoded character
"Ξ" = Ξ
"Ξ" = Ξ
"Ξ" = Ξ
greek capital letter omicron, U+039F:
"Ο" = UTF-8 encoded character
"Ο" = Ο
"Ο" = Ο
"Ο" = Ο
greek capital letter pi, U+03A0 ISOgrk3:
"Π" = UTF-8 encoded character
"Π" = Π
"Π" = Π
"Π" = Π
greek capital letter rho, U+03A1:
"Ρ" = UTF-8 encoded character
"Ρ" = Ρ
"Ρ" = Ρ
"Ρ" = Ρ
<!-- there is no Sigmaf, and no U+03A2 character either -->
greek capital letter sigma, U+03A3 ISOgrk3:
"Σ" = UTF-8 encoded character
"Σ" = Σ
"Σ" = Σ
"Σ" = Σ
greek capital letter tau, U+03A4:
"Τ" = UTF-8 encoded character
"Τ" = Τ
"Τ" = Τ
"Τ" = Τ
greek capital letter upsilon, U+03A5 ISOgrk3:
"Υ" = UTF-8 encoded character
"Υ" = Υ
"Υ" = Υ
"Υ" = Υ
greek capital letter phi, U+03A6 ISOgrk3:
"Φ" = UTF-8 encoded character
"Φ" = Φ
"Φ" = Φ
"Φ" = Φ
greek capital letter chi, U+03A7:
"Χ" = UTF-8 encoded character
"Χ" = Χ
"Χ" = Χ
"Χ" = Χ
greek capital letter psi, U+03A8 ISOgrk3:
"Ψ" = UTF-8 encoded character
"Ψ" = Ψ
"Ψ" = Ψ
"Ψ" = Ψ
greek capital letter omega, U+03A9 ISOgrk3:
"Ω" = UTF-8 encoded character
"Ω" = Ω
"Ω" = Ω
"Ω" = Ω
greek small letter alpha, U+03B1 ISOgrk3:
"α" = UTF-8 encoded character
"α" = α
"α" = α
"α" = α
greek small letter beta, U+03B2 ISOgrk3:
"β" = UTF-8 encoded character
"β" = β
"β" = β
"β" = β
greek small letter gamma, U+03B3 ISOgrk3:
"γ" = UTF-8 encoded character
"γ" = γ
"γ" = γ
"γ" = γ
greek small letter delta, U+03B4 ISOgrk3:
"δ" = UTF-8 encoded character
"δ" = δ
"δ" = δ
"δ" = δ
greek small letter epsilon, U+03B5 ISOgrk3:
"ε" = UTF-8 encoded character
"ε" = ε
"ε" = ε
"ε" = ε
greek small letter zeta, U+03B6 ISOgrk3:
"ζ" = UTF-8 encoded character
"ζ" = ζ
"ζ" = ζ
"ζ" = ζ
greek small letter eta, U+03B7 ISOgrk3:
"η" = UTF-8 encoded character
"η" = η
"η" = η
"η" = η
greek small letter theta, U+03B8 ISOgrk3:
"θ" = UTF-8 encoded character
"θ" = θ
"θ" = θ
"θ" = θ
greek small letter iota, U+03B9 ISOgrk3:
"ι" = UTF-8 encoded character
"ι" = ι
"ι" = ι
"ι" = ι
greek small letter kappa, U+03BA ISOgrk3:
"κ" = UTF-8 encoded character
"κ" = κ
"κ" = κ
"κ" = κ
greek small letter lambda, U+03BB ISOgrk3:
"λ" = UTF-8 encoded character
"λ" = λ
"λ" = λ
"λ" = λ
greek small letter mu, U+03BC ISOgrk3:
"μ" = UTF-8 encoded character
"μ" = μ
"μ" = μ
"μ" = μ
greek small letter nu, U+03BD ISOgrk3:
"ν" = UTF-8 encoded character
"ν" = ν
"ν" = ν
"ν" = ν
greek small letter xi, U+03BE ISOgrk3:
"ξ" = UTF-8 encoded character
"ξ" = ξ
"ξ" = ξ
"ξ" = ξ
greek small letter omicron, U+03BF NEW:
"ο" = UTF-8 encoded character
"ο" = ο
"ο" = ο
"ο" = ο
greek small letter pi, U+03C0 ISOgrk3:
"π" = UTF-8 encoded character
"π" = π
"π" = π
"π" = π
greek small letter rho, U+03C1 ISOgrk3:
"ρ" = UTF-8 encoded character
"ρ" = ρ
"ρ" = ρ
"ρ" = ρ
greek small letter final sigma, U+03C2 ISOgrk3:
"ς" = UTF-8 encoded character
"ς" = ς
"ς" = ς
"ς" = ς
greek small letter sigma, U+03C3 ISOgrk3:
"σ" = UTF-8 encoded character
"σ" = σ
"σ" = σ
"σ" = σ
greek small letter tau, U+03C4 ISOgrk3:
"τ" = UTF-8 encoded character
"τ" = τ
"τ" = τ
"τ" = τ
greek small letter upsilon, U+03C5 ISOgrk3:
"υ" = UTF-8 encoded character
"υ" = υ
"υ" = υ
"υ" = υ
greek small letter phi, U+03C6 ISOgrk3:
"φ" = UTF-8 encoded character
"φ" = φ
"φ" = φ
"φ" = φ
greek small letter chi, U+03C7 ISOgrk3:
"χ" = UTF-8 encoded character
"χ" = χ
"χ" = χ
"χ" = χ
greek small letter psi, U+03C8 ISOgrk3:
"ψ" = UTF-8 encoded character
"ψ" = ψ
"ψ" = ψ
"ψ" = ψ
greek small letter omega, U+03C9 ISOgrk3:
"ω" = UTF-8 encoded character
"ω" = ω
"ω" = ω
"ω" = ω
greek small letter theta symbol, U+03D1 NEW:
"ϑ" = UTF-8 encoded character
"ϑ" = ϑ
"ϑ" = ϑ
"ϑ" = ϑ
greek upsilon with hook symbol, U+03D2 NEW:
"ϒ" = UTF-8 encoded character
"ϒ" = ϒ
"ϒ" = ϒ
"ϒ" = ϒ
greek pi symbol, U+03D6 ISOgrk3:
"ϖ" = UTF-8 encoded character
"ϖ" = ϖ
"ϖ" = ϖ
"ϖ" = ϖ
<!-- General Punctuation -->
bullet = black small circle, U+2022 ISOpub:
"•" = UTF-8 encoded character
"•" = •
"•" = •
"•" = •
<!-- bullet is NOT the same as bullet operator, U+2219 -->
horizontal ellipsis = three dot leader, U+2026 ISOpub:
"…" = UTF-8 encoded character
"…" = …
"…" = …
"…" = …
prime = minutes = feet, U+2032 ISOtech:
"′" = UTF-8 encoded character
"′" = ′
"′" = ′
"′" = ′
double prime = seconds = inches, U+2033 ISOtech:
"″" = UTF-8 encoded character
"″" = ″
"″" = ″
"″" = ″
overline = spacing overscore, U+203E NEW:
"‾" = UTF-8 encoded character
"‾" = ‾
"‾" = ‾
"‾" = ‾
fraction slash, U+2044 NEW:
"⁄" = UTF-8 encoded character
"⁄" = ⁄
"⁄" = ⁄
"⁄" = ⁄
<!-- Letterlike Symbols -->
script capital P = power set = Weierstrass p, U+2118 ISOamso:
"℘" = UTF-8 encoded character
"℘" = ℘
"℘" = ℘
"℘" = ℘
blackletter capital I = imaginary part, U+2111 ISOamso:
"ℑ" = UTF-8 encoded character
"ℑ" = ℑ
"ℑ" = ℑ
"ℑ" = ℑ
blackletter capital R = real part symbol, U+211C ISOamso:
"ℜ" = UTF-8 encoded character
"ℜ" = ℜ
"ℜ" = ℜ
"ℜ" = ℜ
trade mark sign, U+2122 ISOnum:
"™" = UTF-8 encoded character
"™" = ™
"™" = ™
"™" = ™
alef symbol = first transfinite cardinal, U+2135 NEW:
"ℵ" = UTF-8 encoded character
"ℵ" = ℵ
"ℵ" = ℵ
"ℵ" = ℵ
<!-- alef symbol is NOT the same as hebrew letter alef,
U+05D0 although the same glyph could be used to depict both characters -->
<!-- Arrows -->
leftwards arrow, U+2190 ISOnum:
"←" = UTF-8 encoded character
"←" = ←
"←" = ←
"←" = ←
upwards arrow, U+2191:
"↑" = UTF-8 encoded character
"↑" = ↑
"↑" = ↑
"↑" = ↑
rightwards arrow, U+2192 ISOnum:
"→" = UTF-8 encoded character
"→" = →
"→" = →
"→" = →
downwards arrow, U+2193 ISOnum:
"↓" = UTF-8 encoded character
"↓" = ↓
"↓" = ↓
"↓" = ↓
left right arrow, U+2194 ISOamsa:
"↔" = UTF-8 encoded character
"↔" = ↔
"↔" = ↔
"↔" = ↔
downwards arrow with corner leftwards = carriage return, U+21B5 NEW:
"↵" = UTF-8 encoded character
"↵" = ↵
"↵" = ↵
"↵" = ↵
leftwards double arrow, U+21D0 ISOtech:
"⇐" = UTF-8 encoded character
"⇐" = ⇐
"⇐" = ⇐
"⇐" = ⇐
<!-- ISO 10646 does not say that lArr is the same as the 'is implied by' arrow
but also does not have any other character for that function. So ? lArr can
be used for 'is implied by' as ISOtech suggests -->
upwards double arrow, U+21D1 ISOamsa:
"⇑" = UTF-8 encoded character
"⇑" = ⇑
"⇑" = ⇑
"⇑" = ⇑
rightwards double arrow, U+21D2 ISOtech:
"⇒" = UTF-8 encoded character
"⇒" = ⇒
"⇒" = ⇒
"⇒" = ⇒
<!-- ISO 10646 does not say this is the 'implies' character but does not have
another character with this function so ?
rArr can be used for 'implies' as ISOtech suggests -->
downwards double arrow, U+21D3 ISOamsa:
"⇓" = UTF-8 encoded character
"⇓" = ⇓
"⇓" = ⇓
"⇓" = ⇓
left right double arrow, U+21D4 ISOamsa:
"⇔" = UTF-8 encoded character
"⇔" = ⇔
"⇔" = ⇔
"⇔" = ⇔
<!-- Mathematical Operators -->
for all, U+2200 ISOtech:
"∀" = UTF-8 encoded character
"∀" = ∀
"∀" = ∀
"∀" = ∀
partial differential, U+2202 ISOtech:
"∂" = UTF-8 encoded character
"∂" = ∂
"∂" = ∂
"∂" = ∂
there exists, U+2203 ISOtech:
"∃" = UTF-8 encoded character
"∃" = ∃
"∃" = ∃
"∃" = ∃
empty set = null set = diameter, U+2205 ISOamso:
"∅" = UTF-8 encoded character
"∅" = ∅
"∅" = ∅
"∅" = ∅
nabla = backward difference, U+2207 ISOtech:
"∇" = UTF-8 encoded character
"∇" = ∇
"∇" = ∇
"∇" = ∇
element of, U+2208 ISOtech:
"∈" = UTF-8 encoded character
"∈" = ∈
"∈" = ∈
"∈" = ∈
not an element of, U+2209 ISOtech:
"∉" = UTF-8 encoded character
"∉" = ∉
"∉" = ∉
"∉" = ∉
contains as member, U+220B ISOtech:
"∋" = UTF-8 encoded character
"∋" = ∋
"∋" = ∋
"∋" = ∋
<!-- should there be a more memorable name than 'ni'? -->
n-ary product = product sign, U+220F ISOamsb:
"∏" = UTF-8 encoded character
"∏" = ∏
"∏" = ∏
"∏" = ∏
<!-- prod is NOT the same character as U+03A0 'greek capital letter pi' though
the same glyph might be used for both -->
n-ary sumation, U+2211 ISOamsb:
"∑" = UTF-8 encoded character
"∑" = ∑
"∑" = ∑
"∑" = ∑
<!-- sum is NOT the same character as U+03A3 'greek capital letter sigma'
though the same glyph might be used for both -->
minus sign, U+2212 ISOtech:
"−" = UTF-8 encoded character
"−" = −
"−" = −
"−" = −
asterisk operator, U+2217 ISOtech:
"∗" = UTF-8 encoded character
"∗" = ∗
"∗" = ∗
"∗" = ∗
square root = radical sign, U+221A ISOtech:
"√" = UTF-8 encoded character
"√" = √
"√" = √
"√" = √
proportional to, U+221D ISOtech:
"∝" = UTF-8 encoded character
"∝" = ∝
"∝" = ∝
"∝" = ∝
infinity, U+221E ISOtech:
"∞" = UTF-8 encoded character
"∞" = ∞
"∞" = ∞
"∞" = ∞
angle, U+2220 ISOamso:
"∠" = UTF-8 encoded character
"∠" = ∠
"∠" = ∠
"∠" = ∠
logical and = wedge, U+2227 ISOtech:
"∧" = UTF-8 encoded character
"∧" = ∧
"∧" = ∧
"∧" = ∧
logical or = vee, U+2228 ISOtech:
"∨" = UTF-8 encoded character
"∨" = ∨
"∨" = ∨
"∨" = ∨
intersection = cap, U+2229 ISOtech:
"∩" = UTF-8 encoded character
"∩" = ∩
"∩" = ∩
"∩" = ∩
union = cup, U+222A ISOtech:
"∪" = UTF-8 encoded character
"∪" = ∪
"∪" = ∪
"∪" = ∪
integral, U+222B ISOtech:
"∫" = UTF-8 encoded character
"∫" = ∫
"∫" = ∫
"∫" = ∫
therefore, U+2234 ISOtech:
"∴" = UTF-8 encoded character
"∴" = ∴
"∴" = ∴
"∴" = ∴
tilde operator = varies with = similar to, U+223C ISOtech:
"∼" = UTF-8 encoded character
"∼" = ∼
"∼" = ∼
"∼" = ∼
<!-- tilde operator is NOT the same character as the tilde, U+007E,
although the same glyph might be used to represent both -->
approximately equal to, U+2245 ISOtech:
"≅" = UTF-8 encoded character
"≅" = ≅
"≅" = ≅
"≅" = ≅
almost equal to = asymptotic to, U+2248 ISOamsr:
"≈" = UTF-8 encoded character
"≈" = ≈
"≈" = ≈
"≈" = ≈
not equal to, U+2260 ISOtech:
"≠" = UTF-8 encoded character
"≠" = ≠
"≠" = ≠
"≠" = ≠
identical to, U+2261 ISOtech:
"≡" = UTF-8 encoded character
"≡" = ≡
"≡" = ≡
"≡" = ≡
less-than or equal to, U+2264 ISOtech:
"≤" = UTF-8 encoded character
"≤" = ≤
"≤" = ≤
"≤" = ≤
greater-than or equal to, U+2265 ISOtech:
"≥" = UTF-8 encoded character
"≥" = ≥
"≥" = ≥
"≥" = ≥
subset of, U+2282 ISOtech:
"⊂" = UTF-8 encoded character
"⊂" = ⊂
"⊂" = ⊂
"⊂" = ⊂
superset of, U+2283 ISOtech:
"⊃" = UTF-8 encoded character
"⊃" = ⊃
"⊃" = ⊃
"⊃" = ⊃
<!-- note that nsup, 'not a superset of, U+2283' is not covered by the Symbol
font encoding and is not included. Should it be, for symmetry?
It is in ISOamsn -->
not a subset of, U+2284 ISOamsn:
"⊄" = UTF-8 encoded character
"⊄" = ⊄
"⊄" = ⊄
"⊄" = ⊄
subset of or equal to, U+2286 ISOtech:
"⊆" = UTF-8 encoded character
"⊆" = ⊆
"⊆" = ⊆
"⊆" = ⊆
superset of or equal to, U+2287 ISOtech:
"⊇" = UTF-8 encoded character
"⊇" = ⊇
"⊇" = ⊇
"⊇" = ⊇
circled plus = direct sum, U+2295 ISOamsb:
"⊕" = UTF-8 encoded character
"⊕" = ⊕
"⊕" = ⊕
"⊕" = ⊕
circled times = vector product, U+2297 ISOamsb:
"⊗" = UTF-8 encoded character
"⊗" = ⊗
"⊗" = ⊗
"⊗" = ⊗
up tack = orthogonal to = perpendicular, U+22A5 ISOtech:
"⊥" = UTF-8 encoded character
"⊥" = ⊥
"⊥" = ⊥
"⊥" = ⊥
dot operator, U+22C5 ISOamsb:
"⋅" = UTF-8 encoded character
"⋅" = ⋅
"⋅" = ⋅
"⋅" = ⋅
<!-- dot operator is NOT the same character as U+00B7 middle dot -->
<!-- Relevant ISO entity set is given unless names are newly introduced.
New names (i.e., not in ISO 8879 list) do not clash with any
existing ISO 8879 entity names. ISO 10646 character numbers
are given for each character, in hex. CDATA values are decimal
conversions of the ISO 10646 values and refer to the document
character set. Names are ISO 10646 names.
-->
<!-- C0 Controls and Basic Latin -->
quotation mark = APL quote, U+0022 ISOnum:
""" = UTF-8 encoded character
""" = "
""" = "
""" = "
ampersand, U+0026 ISOnum:
"&" = UTF-8 encoded character
"&" = &
"&" = &
"&" = &
less-than sign, U+003C ISOnum:
""<" = <
"<" = <
"<" = <
greater-than sign, U+003E ISOnum:
">" = UTF-8 encoded character
">" = >
">" = >
">" = >
<!-- Latin Extended-A -->
latin capital ligature OE, U+0152 ISOlat2:
"Œ" = UTF-8 encoded character
"Œ" = Œ
"Œ" = Œ
"Œ" = Œ
latin small ligature oe, U+0153 ISOlat2:
"œ" = UTF-8 encoded character
"œ" = œ
"œ" = œ
"œ" = œ
<!-- ligature is a misnomer, this is a separate character in some languages -->
latin capital letter S with caron, U+0160 ISOlat2:
"Š" = UTF-8 encoded character
"Š" = Š
"Š" = Š
"Š" = Š
latin small letter s with caron, U+0161 ISOlat2:
"š" = UTF-8 encoded character
"š" = š
"š" = š
"š" = š
latin capital letter Y with diaeresis, U+0178 ISOlat2:
"Ÿ" = UTF-8 encoded character
"Ÿ" = Ÿ
"Ÿ" = Ÿ
"Ÿ" = Ÿ
<!-- Spacing Modifier Letters -->
modifier letter circumflex accent, U+02C6 ISOpub:
"ˆ" = UTF-8 encoded character
"ˆ" = ˆ
"ˆ" = ˆ
"ˆ" = ˆ
small tilde, U+02DC ISOdia:
"˜" = UTF-8 encoded character
"˜" = ˜
"˜" = ˜
"˜" = ˜
<!-- General Punctuation -->
en space, U+2002 ISOpub:
" " = UTF-8 encoded character
" " =  
" " =  
" " =  
em space, U+2003 ISOpub:
" " = UTF-8 encoded character
" " =  
" " =  
" " =  
thin space, U+2009 ISOpub:
" " = UTF-8 encoded character
" " =  
" " =  
" " =  
zero width non-joiner, U+200C NEW RFC 2070:
"" = UTF-8 encoded character
"" = ‌
"" = ‌
"" = ‌
zero width joiner, U+200D NEW RFC 2070:
"" = UTF-8 encoded character
"" = ‍
"" = ‍
"" = ‍
left-to-right mark, U+200E NEW RFC 2070:
"" = UTF-8 encoded character
"" = ‎
"" = ‎
"" = ‎
right-to-left mark, U+200F NEW RFC 2070:
"" = UTF-8 encoded character
"" = ‏
"" = ‏
"" = ‏
en dash, U+2013 ISOpub:
"–" = UTF-8 encoded character
"–" = –
"–" = –
"–" = –
em dash, U+2014 ISOpub:
"—" = UTF-8 encoded character
"—" = —
"—" = —
"—" = —
left single quotation mark, U+2018 ISOnum:
"‘" = UTF-8 encoded character
"‘" = ‘
"‘" = ‘
"‘" = ‘
right single quotation mark, U+2019 ISOnum:
"’" = UTF-8 encoded character
"’" = ’
"’" = ’
"’" = ’
single low-9 quotation mark, U+201A NEW:
"‚" = UTF-8 encoded character
"‚" = ‚
"‚" = ‚
"‚" = ‚
left double quotation mark, U+201C ISOnum:
"“" = UTF-8 encoded character
"“" = “
"“" = “
"“" = “
right double quotation mark, U+201D ISOnum:
"”" = UTF-8 encoded character
"”" = ”
"”" = ”
"”" = ”
double low-9 quotation mark, U+201E NEW:
"„" = UTF-8 encoded character
"„" = „
"„" = „
"„" = „
dagger, U+2020 ISOpub:
"†" = UTF-8 encoded character
"†" = †
"†" = †
"†" = †
double dagger, U+2021 ISOpub:
"‡" = UTF-8 encoded character
"‡" = ‡
"‡" = ‡
"‡" = ‡
per mille sign, U+2030 ISOtech:
"‰" = UTF-8 encoded character
"‰" = ‰
"‰" = ‰
"‰" = ‰
single left-pointing angle quotation mark, U+2039 ISO proposed:
"‹" = UTF-8 encoded character
"‹" = ‹
"‹" = ‹
"‹" = ‹
<!-- lsaquo is proposed but not yet ISO standardized -->
single right-pointing angle quotation mark, U+203A ISO proposed:
"›" = UTF-8 encoded character
"›" = ›
"›" = ›
"›" = ›
<!-- rsaquo is proposed but not yet ISO standardized -->
(Score: 2) by martyb on Thursday June 26 2014, @09:48AM
Submitted as "Plain Old Text"
(Score: 2) by The Mighty Buzzard on Thursday June 26 2014, @10:46AM
Some text here because of the cat got your tongue filter still coming into play.
一 丁 丂 七 丄 丅 丆 万 丈 三 上 下 丌 不 与 丏 丐 丑 丒 专 且 丕 世 丗 丘 丙 业 丛 东 丝 丞 丟 丠 両 丢 丣 两 严 並 丧 丨 丩 个 丫 丬 中 丮 丯 丰 丱 串 丳 临 丵 丶 丷 丸 丹 为 主 丼 丽 举 丿 乀 乁 乂 乃 乄 久 乆 乇 么 义 乊 之 乌 乍 乎 乏 乐 乑 乒 乓 乔 乕 乖 乗 乘 乙 乚 乛 乜 九 乞 也 习 乡 乢 乣 乤 乥 书 乧 乨 乩 乪 乫 乬 乭 乮 乯 买 乱 乲 乳 乴 乵 乶 乷 乸 乹 乺 乻 乼 乽 乾 乿 ...
123
456
789
(Score: 2) by The Mighty Buzzard on Thursday June 26 2014, @10:47AM
Ascii art filter == defeated.
123
456
789
(Score: 2) by paulej72 on Saturday July 26 2014, @10:19AM
test
Team Leader for SN Development Dev Server [soylentnews.org]
(Score: 2) by martyb on Thursday June 26 2014, @11:00AM
This is another try at testing all named character entities that appear in the HTML 4 spec.
Submitted using "Plain Old Text"; NO preview.
(Score: 2) by martyb on Tuesday July 01 2014, @10:03AM
utf8_sequence_0-0xff_assigned_printable.txt: (Plain Old Text; looked okay when I pasted it in; did NOT preview -- immediately clicked Submit)
! " # $ % & ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; ? @ A B C D E F G H I J K L M
N O P Q R S T U V W X Y Z [ \ ] ^ _ ` a b c d e f g h i j k l m n o p q r s t u v w x y z { | } ~
¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯ ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó
Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
(Score: 2) by martyb on Tuesday July 01 2014, @10:07AM
(Score: 2) by martyb on Tuesday July 01 2014, @10:10AM
! " # $ % & ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; < = > ? @ A B C D E F G H I J K L M
N O P Q R S T U V W X Y Z [ \ ] ^ _ ` a b c d e f g h i j k l m n o p q r s t u v w x y z { | } ~
¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯ ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó
Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
(Score: 2) by martyb on Tuesday July 01 2014, @10:11AM
! " # $ % & ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; < = > ? @ A B C D E F G H I J K L M
N O P Q R S T U V W X Y Z [ \ ] ^ _ ` a b c d e f g h i j k l m n o p q r s t u v w x y z { | } ~
¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯ ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó
Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
(Score: 2) by martyb on Tuesday July 01 2014, @10:22AM
utf8_sequence_0-0xfff_assigned_printable.txt: (Plain Old Text; looked okay when I pasted it in; did NOT preview -- immediately clicked Submit)
Got error message:
Lameness filter encountered.
Your comment violated the "postercomment" compression filter. Try less whitespace and/or less repetition.
So, I removed the fist 0..FF characters I'd used in the earlier tests.
That *still* did not work.
Could be a result of there being too many spaces...
The test data had a space between EACH of the characters (UTF-8_sequence_separated)
Will save this without the UTF-8 characters so as to document this; and then will try again with non-separated characters.
(Score: 2) by martyb on Tuesday July 01 2014, @11:19AM
utf8_sequence_0-0xfff_assigned_printable_unseparated.txt
Chars looked okay when pasted in, Plain old text, no preview, pressed Submit button.
!"#$%&'()*+,-./0123456789:;?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~ ¡¢£¤¥¦§¨©ª«¬®¯°±²³´µ¶·¸¹º»¼½¾¿ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖ×ØÙÚÛÜÝÞßàáâãäåæçèéêëìíîïðñòóôõö÷øùúûüýþÿĀāĂ㥹ĆćĈĉĊċČčĎďĐđĒēĔĕĖėĘęĚěĜĝĞğĠġĢģĤĥĦħĨĩĪīĬĭĮįİıIJijĴĵĶķĸĹĺĻļĽľĿŀŁłŃńŅņŇňʼnŊŋŌōŎŏŐőŒœŔŕŖŗŘřŚśŜŝŞşŠšŢţŤťŦŧŨũŪūŬŭŮůŰűŲųŴŵŶŷŸŹźŻżŽžſƀƁƂƃƄƅƆƇƈƉƊƋƌƍƎƏƐƑƒƓƔƕƖƗƘƙƚƛƜƝƞƟƠơƢƣƤƥƦƧƨƩƪƫƬƭƮƯưƱƲƳƴƵƶƷƸƹƺƻƼƽƾƿǀǁǂǃDŽDždžLJLjljNJNjnjǍǎǏǐǑǒǓǔǕǖǗǘǙǚǛǜǝǞǟǠǡǢǣǤǥǦǧǨǩǪǫǬǭǮǯǰDZDzdzǴǵǶǷǸǹǺǻǼǽǾǿȀȁȂȃȄȅȆȇȈȉȊȋȌȍȎȏȐȑȒȓȔȕȖȗȘșȚțȜȝȞȟȠȡȢȣȤȥȦȧȨȩȪȫȬȭȮȯȰȱȲȳȴȵȶȷȸȹȺȻȼȽȾȿɀɁɐɑɒɓɔɕɖɗɘəɚɛɜɝɞɟɠɡɢɣɤɥɦɧɨɩɪɫɬɭɮɯɰɱɲɳɴɵɶɷɸɹɺɻɼɽɾɿʀʁʂʃʄʅʆʇʈʉʊʋʌʍʎʏʐʑʒʓʔʕʖʗʘʙʚʛʜʝʞʟʠʡʢʣʤʥʦʧʨʩʪʫʬʭʮʯʰʱʲʳʴʵʶʷʸʹʺʻʼʽʾʿˀˁ˂˃˄˅ˆˇˈˉˊˋˌˍˎˏːˑ˒˓˔˕˖˗˘˙˚˛˜˝˞˟ˠˡˢˣˤ˥˦˧˨˩˪˫ˬ˭ˮ˯˰˱˲˳˴˵˶˷˸˹˺˻˼˽˾˿̴̵̶̷̸̡̢̧̨̛̖̗̘̙̜̝̞̟̠̣̤̥̦̩̪̫̬̭̮̯̰̱̲̳̹̺̻̼͇͈͉͍͎̀́̂̃̄̅̆̇̈̉̊̋̌̍̎̏̐̑̒̓̔̽̾̿̀́͂̓̈́͆͊͋͌̕̚ͅ͏͓͔͕͖͙͚͐͑͒͗͛ͣͤͥͦͧͨͩͪͫͬͭͮͯ͘͜͟͢͝͞͠͡ʹ͵ͺ;΄΅Ά·ΈΉΊΌΎΏΐΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩΪΫάέήίΰαβγδεζηθικλμνξοπρςστυφχψωϊϋόύώϐϑϒϓϔϕϖϗϘϙϚϛϜϝϞϟϠϡϢϣϤϥϦϧϨϩϪϫϬϭϮϯϰϱϲϳϴϵ϶ϷϸϹϺϻϼϽϾϿЀЁЂЃЄЅІЇЈЉЊЋЌЍЎЏАБВГДЕЖЗИЙКЛМНОПРСТУФХЦЧШЩЪЫЬЭЮЯабвгдежзийклмнопрстуфхцчшщъыьэюяѐёђѓєѕіїјљњћќѝўџѠѡѢѣѤѥѦѧѨѩѪѫѬѭѮѯѰѱѲѳѴѵѶѷѸѹѺѻѼѽѾѿҀҁ҂҃҄҅҆҈҉ҊҋҌҍҎҏҐґҒғҔҕҖҗҘҙҚқҜҝҞҟҠҡҢңҤҥҦҧҨҩҪҫҬҭҮүҰұҲҳҴҵҶҷҸҹҺһҼҽҾҿӀӁӂӃӄӅӆӇӈӉӊӋӌӍӎӐӑӒӓӔӕӖӗӘәӚӛӜӝӞӟӠӡӢӣӤӥӦӧӨөӪӫӬӭӮӯӰӱӲӳӴӵӶӷӸӹԀԁԂԃԄԅԆԇԈԉԊԋԌԍԎԏԱԲԳԴԵԶԷԸԹԺԻԼԽԾԿՀՁՂՃՄՅՆՇՈՉՊՋՌՍՎՏՐՑՒՓՔՕՖՙ՚՛՜՝՞՟աբգդեզէըթժիլխծկհձղճմյնշոչպջռսվտրցւփքօֆև։֊ְֱֲֳִֵֶַָֹֻּֽ֑֖֛֢֣֤֥֦֧֪֚֭֮֒֓֔֕֗֘֙֜֝֞֟֠֡֨֩֫֬֯־ֿ׀ׁׂ׃ׅׄ׆ׇאבגדהוזחטיךכלםמןנסעףפץצקרשתװױײ׳״؋،؍؎؏ؐؑؒؓؔؕ؛؞؟ءآأؤإئابةتثجحخدذرزسشصضطظعغـفقكلمنهوىيًٌٍَُِّْٕٖٜٓٔٗ٘ٙٚٛٝٞ٠١٢٣٤٥٦٧٨٩٪٫٬٭ٮٯٰٱٲٳٴٵٶٷٸٹٺٻټٽپٿڀځڂڃڄڅچڇڈډڊڋڌڍڎڏڐڑڒړڔڕږڗژڙښڛڜڝڞڟڠڡڢڣڤڥڦڧڨکڪګڬڭڮگڰڱڲڳڴڵڶڷڸڹںڻڼڽھڿۀہۂۃۄۅۆۇۈۉۊۋیۍێۏېۑےۓ۔ەۖۗۘۙۚۛۜ۞ۣ۟۠ۡۢۤۥۦۧۨ۩۪ۭ۫۬ۮۯ۰۱۲۳۴۵۶۷۸۹ۺۻۼ۽۾ۿ܀܁܂܃܄܅܆܇܈܉܊܋܌܍ܐܑܒܓܔܕܖܗܘܙܚܛܜܝܞܟܠܡܢܣܤܥܦܧܨܩܪܫܬܭܮܯܱܴܷܸܹܻܼܾ݂݄݆݈ܰܲܳܵܶܺܽܿ݀݁݃݅݇݉݊ݍݎݏݐݑݒݓݔݕݖݗݘݙݚݛݜݝݞݟݠݡݢݣݤݥݦݧݨݩݪݫݬݭހށނރބޅކއވމފދތލގޏސޑޒޓޔޕޖޗޘޙޚޛޜޝޞޟޠޡޢޣޤޥަާިީުޫެޭޮޯްޱँंःऄअआइईउऊऋऌऍऎएऐऑऒओऔकखगघङचछजझञटठडढणतथदधनऩपफबभमयरऱलळऴवशषसह़ऽािीुूृॄॅॆेैॉॊोौ्ॐ॒॑॓॔क़ख़ग़ज़ड़ढ़फ़य़ॠॡॢॣ।॥०१२३४५६७८९॰ॽঁংঃঅআইঈউঊঋঌএঐওঔকখগঘঙচছজঝঞটঠডঢণতথদধনপফবভমযরলশষসহ়ঽািীুূৃৄেৈোৌ্ৎৗড়ঢ়য়ৠৡৢৣ০১২৩৪৫৬৭৮৯ৰৱ৲৳৴৵৶৷৸৹৺ਁਂਃਅਆਇਈਉਊਏਐਓਔਕਖਗਘਙਚਛਜਝਞਟਠਡਢਣਤਥਦਧਨਪਫਬਭਮਯਰਲਲ਼ਵਸ਼ਸਹ਼ਾਿੀੁੂੇੈੋੌ੍ਖ਼ਗ਼ਜ਼ੜਫ਼੦੧੨੩੪੫੬੭੮੯ੰੱੲੳੴઁંઃઅઆઇઈઉઊઋઌઍએઐઑઓઔકખગઘઙચછજઝઞટઠડઢણતથદધનપફબભમયરલળવશષસહ઼ઽાિીુૂૃૄૅેૈૉોૌ્ૐૠૡૢૣ૦૧૨૩૪૫૬૭૮૯૱ଁଂଃଅଆଇଈଉଊଋଌଏଐଓଔକଖଗଘଙଚଛଜଝଞଟଠଡଢଣତଥଦଧନପଫବଭମଯରଲଳଵଶଷସହ଼ଽାିୀୁୂୃେୈୋୌ୍ୖୗଡ଼ଢ଼ୟୠୡ୦୧୨୩୪୫୬୭୮୯୰ୱஂஃஅஆஇஈஉஊஎஏஐஒஓஔகஙசஜஞடணதநனபமயரறலளழவஶஷஸஹாிீுூெேைொோௌ்ௗ௦௧௨௩௪௫௬௭௮௯௰௱௲௳௴௵௶௷௸௹௺ఁంఃఅఆఇఈఉఊఋఌఎఏఐఒఓఔకఖగఘఙచఛజఝఞటఠడఢణతథదధనపఫబభమయరఱలళవశషసహాిీుూృౄెేైొోౌ్ౕౖౠౡ౦౧౨౩౪౫౬౭౮౯ಂಃಅಆಇಈಉಊಋಌಎಏಐಒಓಔಕಖಗಘಙಚಛಜಝಞಟಠಡಢಣತಥದಧನಪಫಬಭಮಯರಱಲಳವಶಷಸಹ಼ಽಾಿೀುೂೃೄೆೇೈೊೋೌ್ೕೖೞೠೡ೦೧೨೩೪೫೬೭೮೯ംഃഅആഇഈഉഊഋഌഎഏഐഒഓഔകഖഗഘങചഛജഝഞടഠഡഢണതഥദധനപഫബഭമയരറലളഴവശഷസഹാിീുൂൃെേൈൊോൌ്ൗൠൡ൦൧൨൩൪൫൬൭൮൯ංඃඅආඇඈඉඊඋඌඍඎඏඐඑඒඓඔඕඖකඛගඝඞඟචඡජඣඤඥඦටඨඩඪණඬතථදධනඳපඵබභමඹයරලවශෂසහළෆ්ාැෑිීුූෘෙේෛොෝෞෟෲෳ෴กขฃคฅฆงจฉชซฌญฎฏฐฑฒณดตถทธนบปผฝพฟภมยรฤลฦวศษสหฬอฮฯะัาำิีึืฺุู฿เแโใไๅๆ็่้๊๋์ํ๎๏๐๑๒๓๔๕๖๗๘๙๚๛ກຂຄງຈຊຍດຕຖທນບປຜຝພຟມຢຣລວສຫອຮຯະັາຳິີຶືຸູົຼຽເແໂໃໄໆ່້໊໋໌ໍ໐໑໒໓໔໕໖໗໘໙ໜໝༀ༁༂༃༄༅༆༇༈༉༊་༌།༎༏༐༑༒༓༔༕༖༗༘༙༚༛༜༝༞༟༠༡༢༣༤༥༦༧༨༩༪༫༬༭༮༯༰༱༲༳༴༵༶༷༸༹༺༻༼༽༾༿ཀཁགགྷངཅཆཇཉཊཋཌཌྷཎཏཐདདྷནཔཕབབྷམཙཚཛཛྷཝཞཟའཡརལཤཥསཧཨཀྵཪཱཱཱིིུུྲྀཷླྀཹེཻོཽཾཿ྄ཱྀྀྂྃ྅྆྇ྈྉྊྋྐྑྒྒྷྔྕྖྗྙྚྛྜྜྷྞྟྠྡྡྷྣྤྥྦྦྷྨྩྪྫྫྷྭྮྯྰྱྲླྴྵྶྷྸྐྵྺྻྼ྾྿࿀࿁࿂࿃࿄࿅࿆࿇࿈࿉࿊࿋࿌࿏࿐࿑
(Score: 2) by martyb on Tuesday July 01 2014, @12:02PM
Plain Old Text; no preview
(Score: 2) by The Mighty Buzzard on Wednesday July 02 2014, @10:08AM
How about this?
(Score: 2) by martyb on Tuesday July 01 2014, @02:30PM
http://www.cl.cam.ac.uk/~mgk25/ucs/examples/quickbrown.txt [cam.ac.uk]
Sentences that contain all letters commonly used in a language
--------------------------------------------------------------
Markus Kuhn -- 2012-04-11
This is an example of a plain-text file encoded in UTF-8.
Danish (da)
---------
Quizdeltagerne spiste jordbær med fløde, mens cirkusklovnen
Wolther spillede på xylofon.
(= Quiz contestants were eating strawbery with cream while Wolther
the circus clown played on xylophone.)
German (de)
-----------
Falsches Üben von Xylophonmusik quält jeden größeren Zwerg
(= Wrongful practicing of xylophone music tortures every larger dwarf)
Zwölf Boxkämpfer jagten Eva quer über den Sylter Deich
(= Twelve boxing fighters hunted Eva across the dike of Sylt)
Heizölrückstoßabdämpfung
(= fuel oil recoil absorber)
(jqvwxy missing, but all non-ASCII letters in one word)
Greek (el)
----------
Γαζέες καὶ μυρτιὲς δὲν θὰ βρῶ πιὰ στὸ χρυσαφὶ ξέφωτο
(= No more shall I see acacias or myrtles in the golden clearing)
Ξεσκεπάζω τὴν ψυχοφθόρα βδελυγμία
(= I uncover the soul-destroying abhorrence)
English (en)
------------
The quick brown fox jumps over the lazy dog
Spanish (es)
------------
El pingüino Wenceslao hizo kilómetros bajo exhaustiva lluvia y
frío, añoraba a su querido cachorro.
(Contains every letter and every accent, but not every combination
of vowel + acute.)
French (fr)
-----------
Portez ce vieux whisky au juge blond qui fume sur son île intérieure, à
côté de l'alcôve ovoïde, où les bûches se consument dans l'âtre, ce
qui lui permet de penser à la cænogenèse de l'être dont il est question
dans la cause ambiguë entendue à Moÿ, dans un capharnaüm qui,
pense-t-il, diminue çà et là la qualité de son œuvre.
l'île exiguë
Où l'obèse jury mûr
Fête l'haï volapük,
Âne ex aéquo au whist,
Ôtez ce vœu déçu.
Le cœur déçu mais l'âme plutôt naïve, Louÿs rêva de crapaüter en
canoë au delà des îles, près du mälström où brûlent les novæ.
Irish Gaelic (ga)
-----------------
D'fhuascail Íosa, Úrmhac na hÓighe Beannaithe, pór Éava agus Ádhaimh
Hungarian (hu)
--------------
Árvíztűrő tükörfúrógép
(= flood-proof mirror-drilling machine, only all non-ASCII letters)
Icelandic (is)
--------------
Kæmi ný öxi hér ykist þjófum nú bæði víl og ádrepa
Sævör grét áðan því úlpan var ónýt
(some ASCII letters missing)
Japanese (jp)
-------------
Hiragana: (Iroha)
いろはにほへとちりぬるを
わかよたれそつねならむ
うゐのおくやまけふこえて
あさきゆめみしゑひもせす
Katakana:
イロハニホヘト チリヌルヲ ワカヨタレソ ツネナラム
ウヰノオクヤマ ケフコエテ アサキユメミシ ヱヒモセスン
Hebrew (iw)
-----------
? דג סקרן שט בים מאוכזב ולפתע מצא לו חברה איך הקליטה
Polish (pl)
-----------
Pchnąć w tę łódź jeża lub ośm skrzyń fig
(= To push a hedgehog or eight bins of figs in this boat)
Russian (ru)
------------
В чащах юга жил бы цитрус? Да, но фальшивый экземпляр!
(= Would a citrus live in the bushes of south? Yes, but only a fake one!)
Съешь же ещё этих мягких французских булок да выпей чаю
(= Eat some more of these fresh French loafs and have some tea)
Thai (th)
---------
[--------------------------|------------------------]
๏ เป็นมนุษย์สุดประเสริฐเลิศคุณค่า กว่าบรรดาฝูงสัตว์เดรัจฉาน
จงฝ่าฟันพัฒนาวิชาการ อย่าล้างผลาญฤๅเข่นฆ่าบีฑาใคร
ไม่ถือโทษโกรธแช่งซัดฮึดฮัดด่า หัดอภัยเหมือนกีฬาอัชฌาสัย
ปฏิบัติประพฤติกฎกำหนดใจ พูดจาให้จ๊ะๆ จ๋าๆ น่าฟังเอย ฯ
[The copyright for the Thai example is owned by The Computer
Association of Thailand under the Royal Patronage of His Majesty the
King.]
Turkish (tr)
------------
Pijamalı hasta, yağız şoföre çabucak güvendi.
(=Patient with pajamas, trusted swarthy driver quickly)
Special thanks to the people from all over the world who contributed
these sentences since 1999.
A much larger collection of such pangrams is now available at
http://en.wikipedia.org/wiki/List_of_pangrams [wikipedia.org]
(Score: 2) by paulej72 on Tuesday July 01 2014, @04:24PM
Team Leader for SN Development Dev Server [soylentnews.org]
(Score: 2) by martyb on Tuesday July 01 2014, @03:26PM
<stderr>: Something like this? moc.elgoog.ln
? moc.elgoog.ln [soylentnews.org] this should be interesting.
<stderr> That should have been ln, not nl. :-/
(Score: 2) by The Mighty Buzzard on Wednesday July 02 2014, @09:25AM
Yep, those should have been stripped by bad_numeric but weren't.
123
456
789
(Score: 2) by martyb on Thursday July 03 2014, @05:09PM
!"#$%'()*+,-./0123456789:;?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~
¡¢£¤¥¦§¨©ª«¬®¯°±²³´µ¶·¸¹º»¼½¾¿ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖ×ØÙÚÛÜÝÞßàáâãäåæçèéêëìíîïðñòóôõö÷øùúûüýþÿĀāĂ㥹ĆćĈĉĊċČčĎďĐđĒēĔĕĖėĘęĚěĜĝĞğĠġĢģĤĥĦħĨĩĪīĬĭĮįİıIJijĴĵĶķĸĹĺĻļĽľĿŀŁłŃńŅņŇňʼnŊŋŌōŎŏŐőŒœŔŕŖŗŘřŚśŜŝŞşŠšŢţŤťŦŧŨũŪūŬŭŮůŰűŲųŴŵŶŷŸŹźŻżŽžſƀƁƂƃƄƅƆƇƈƉƊƋƌƍƎƏƐƑƒƓƔƕƖƗƘƙƚƛƜƝƞƟƠơƢƣƤƥƦƧƨƩƪƫƬƭƮƯưƱƲƳƴƵƶƷƸƹƺƻƼƽƾƿǀǁǂǃDŽDždžLJLjljNJNjnjǍǎǏǐǑǒǓǔǕǖǗǘǙǚǛǜǝǞǟǠǡǢǣǤǥǦǧǨǩǪǫǬǭǮǯǰDZDzdzǴǵǶǷǸǹǺǻǼǽǾǿȀȁȂȃȄȅȆȇȈȉȊȋȌȍȎȏȐȑȒȓȔȕȖȗȘșȚțȜȝȞȟȠȡȢȣȤȥȦȧȨȩȪȫȬȭȮȯȰȱȲȳȴȵȶȷȸȹȺȻȼȽȾȿɀɁɂɃɄɅɆɇɈɉɊɋɌɍɎɏɐɑɒɓɔɕɖɗɘəɚɛɜɝɞɟɠɡɢɣɤɥɦɧɨɩɪɫɬɭɮɯɰɱɲɳɴɵɶɷɸɹɺɻɼɽɾɿʀʁʂʃʄʅʆʇʈʉʊʋʌʍʎʏʐʑʒʓʔʕʖʗʘʙʚʛʜʝʞʟʠʡʢʣʤʥʦʧʨʩʪʫʬʭʮʯʰʱʲʳʴʵʶʷʸʹʺʻʼʽʾʿˀˁ˂˃˄˅ˆˇˈˉˊˋˌˍˎˏːˑ˒˓˔˕˖˗˘˙˚˛˜˝˞˟ˠˡˢˣˤ˥˦˧˨˩˪˫ˬ˭ˮ˯˰˱˲˳˴˵˶˷˸˹˺˻˼˽˾˿̴̵̶̷̸̡̢̧̨̛̖̗̘̙̜̝̞̟̠̣̤̥̦̩̪̫̬̭̮̯̰̱̲̳̹̺̻̼͇͈͉͍͎̀́̂̃̄̅̆̇̈̉̊̋̌̍̎̏̐̑̒̓̔̽̾̿̀́͂̓̈́͆͊͋͌̕̚ͅ͏͓͔͕͖͙͚͐͑͒͗͛ͣͤͥͦͧͨͩͪͫͬͭͮͯ͘͜͟͢͝͞͠͡ͰͱͲͳʹ͵Ͷͷͺͻͼͽ;Ϳ΄΅Ά·ΈΉΊΌΎΏΐΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩΪΫάέήίΰαβγδεζηθικλμνξοπρςστυφχψωϊϋόύώϏϐϑϒϓϔϕϖϗϘϙϚϛϜϝϞϟϠϡϢϣϤϥϦϧϨϩϪϫϬϭϮϯϰϱϲϳϴϵ϶ϷϸϹϺϻϼϽϾϿЀЁЂЃЄЅІЇЈЉЊЋЌЍЎЏАБВГДЕЖЗИЙКЛМНОПРСТУФХЦЧШЩЪЫЬЭЮЯабвгдежзийклмнопрстуфхцчшщъыьэюяѐёђѓєѕіїјљњћќѝўџѠѡѢѣѤѥѦѧѨѩѪѫѬѭѮѯѰѱѲѳѴѵѶѷѸѹѺѻѼѽѾѿҀҁ҂҃҄҅҆҇҈҉ҊҋҌҍҎҏҐґҒғҔҕҖҗҘҙҚқҜҝҞҟҠҡҢңҤҥҦҧҨҩҪҫҬҭҮүҰұҲҳҴҵҶҷҸҹҺһҼҽҾҿӀӁӂӃӄӅӆӇӈӉӊӋӌӍӎӏӐӑӒӓӔӕӖӗӘәӚӛӜӝӞӟӠӡӢӣӤӥӦӧӨөӪӫӬӭӮӯӰӱӲӳӴӵӶӷӸӹӺӻӼӽӾӿԀԁԂԃԄԅԆԇԈԉԊԋԌԍԎԏԐԑԒԓԔԕԖԗԘԙԚԛԜԝԞԟԠԡԢԣԤԥԦԧԨԩԪԫԬԭԮԯԱԲԳԴԵԶԷԸԹԺԻԼԽԾԿՀՁՂՃՄՅՆՇՈՉՊՋՌՍՎՏՐՑՒՓՔՕՖՙ՚՛՜՝՞՟ՠաբգդեզէըթժիլխծկհձղճմյնշոչպջռսվտրցւփքօֆևֈ։֊֍֎֏ְֱֲֳִֵֶַָֹֺֻּֽ֑֖֛֢֣֤֥֦֧֪֚֭֮֒֓֔֕֗֘֙֜֝֞֟֠֡֨֩֫֬֯־ֿ׀ׁׂ׃ׅׄ׆ׇאבגדהוזחטיךכלםמןנסעףפץצקרשתׯװױײ׳״؆؇؈؉؊؋،؍؎؏ؘؙؚؐؑؒؓؔؕؖؗ؛؝؞؟ؠءآأؤإئابةتثجحخدذرزسشصضطظعغػؼؽؾؿـفقكلمنهوىيًٌٍَُِّْٕٖٜٟٓٔٗ٘ٙٚٛٝٞ٠١٢٣٤٥٦٧٨٩٪٫٬٭ٮٯٰٱٲٳٴٵٶٷٸٹٺٻټٽپٿڀځڂڃڄڅچڇڈډڊڋڌڍڎڏڐڑڒړڔڕږڗژڙښڛڜڝڞڟڠڡڢڣڤڥڦڧڨکڪګڬڭڮگڰڱڲڳڴڵڶڷڸڹںڻڼڽھڿۀہۂۃۄۅۆۇۈۉۊۋیۍێۏېۑےۓ۔ەۖۗۘۙۚۛۜ۞ۣ۟۠ۡۢۤۥۦۧۨ۩۪ۭ۫۬ۮۯ۰۱۲۳۴۵۶۷۸۹ۺۻۼ۽۾ۿ܀܁܂܃܄܅܆܇܈܉܊܋܌܍ܐܑܒܓܔܕܖܗܘܙܚܛܜܝܞܟܠܡܢܣܤܥܦܧܨܩܪܫܬܭܮܯܱܴܷܸܹܻܼܾ݂݄݆݈ܰܲܳܵܶܺܽܿ݀݁݃݅݇݉݊ݍݎݏݐݑݒݓݔݕݖݗݘݙݚݛݜݝݞݟݠݡݢݣݤݥݦݧݨݩݪݫݬݭݮݯݰݱݲݳݴݵݶݷݸݹݺݻݼݽݾݿހށނރބޅކއވމފދތލގޏސޑޒޓޔޕޖޗޘޙޚޛޜޝޞޟޠޡޢޣޤޥަާިީުޫެޭޮޯްޱ߀߁߂߃߄߅߆߇߈߉ߊߋߌߍߎߏߐߑߒߓߔߕߖߗߘߙߚߛߜߝߞߟߠߡߢߣߤߥߦߧߨߩߪ߲߫߬߭߮߯߰߱߳ߴߵ߶߷߸߹ߺ߽߾߿
(Score: 2) by martyb on Thursday July 03 2014, @11:24PM
%HTMLlat1;
--
no-break space = non-breaking space, U+00A0 ISOnum:
" " = raw bytes;
" " =
" " =
" " =
inverted exclamation mark, U+00A1 ISOnum:
"¡" = raw bytes;
"¡" =
"¡" =
"¡" =
cent sign, U+00A2 ISOnum:
"¢" = raw bytes;
"¢" =
"¢" =
"¢" =
pound sign, U+00A3 ISOnum:
"£" = raw bytes;
"£" =
"£" =
"£" =
currency sign, U+00A4 ISOnum:
"¤" = raw bytes;
"¤" =
"¤" =
"¤" =
yen sign = yuan sign, U+00A5 ISOnum:
"¥" = raw bytes;
"¥" =
"¥" =
"¥" =
broken bar = broken vertical bar, U+00A6 ISOnum:
"¦" = raw bytes;
"¦" = |
"¦" = |
"¦" = |
section sign, U+00A7 ISOnum:
"§" = raw bytes;
"§" =
"§" =
"§" =
diaeresis = spacing diaeresis, U+00A8 ISOdia:
"¨" = raw bytes;
"¨" =
"¨" =
"¨" =
copyright sign, U+00A9 ISOnum:
"©" = raw bytes;
"©" = (C)
"©" = (C)
"©" = (C)
feminine ordinal indicator, U+00AA ISOnum:
"ª" = raw bytes;
"ª" =
"ª" =
"ª" =
left-pointing double angle quotation mark = left pointing guillemet, U+00AB ISOnum:
"«" = raw bytes;
"«" =
"«" =
"«" =
not sign, U+00AC ISOnum:
"¬" = raw bytes;
"¬" =
"¬" =
"¬" =
soft hyphen = discretionary hyphen, U+00AD ISOnum:
"" = raw bytes;
"" =
"" =
"" =
registered sign = registered trade mark sign, U+00AE ISOnum:
"®" = raw bytes;
"®" = (R)
"®" = (R)
"®" = (R)
macron = spacing macron = overline = APL overbar, U+00AF ISOdia:
"¯" = raw bytes;
"¯" =
"¯" =
"¯" =
degree sign, U+00B0 ISOnum:
"°" = raw bytes;
"°" =
"°" =
"°" =
plus-minus sign = plus-or-minus sign, U+00B1 ISOnum:
"±" = raw bytes;
"±" = +/-
"±" = +/-
"±" = +/-
superscript two = superscript digit two = squared, U+00B2 ISOnum:
"²" = raw bytes;
"²" =
"²" =
"²" =
superscript three = superscript digit three = cubed, U+00B3 ISOnum:
"³" = raw bytes;
"³" =
"³" =
"³" =
acute accent = spacing acute, U+00B4 ISOdia:
"´" = raw bytes;
"´" =
"´" =
"´" =
micro sign, U+00B5 ISOnum:
"µ" = raw bytes;
"µ" =
"µ" =
"µ" =
pilcrow sign = paragraph sign, U+00B6 ISOnum:
"¶" = raw bytes;
"¶" =
"¶" =
"¶" =
middle dot = Georgian comma = Greek middle dot, U+00B7 ISOnum:
"·" = raw bytes;
"·" =
"·" =
"·" =
cedilla = spacing cedilla, U+00B8 ISOdia:
"¸" = raw bytes;
"¸" =
"¸" =
"¸" =
superscript one = superscript digit one, U+00B9 ISOnum:
"¹" = raw bytes;
"¹" =
"¹" =
"¹" =
masculine ordinal indicator, U+00BA ISOnum:
"º" = raw bytes;
"º" =
"º" =
"º" =
right-pointing double angle quotation mark = right pointing guillemet, U+00BB ISOnum:
"»" = raw bytes;
"»" =
"»" =
"»" =
vulgar fraction one quarter = fraction one quarter, U+00BC ISOnum:
"¼" = raw bytes;
"¼" = 1/4
"¼" = 1/4
"¼" = 1/4
vulgar fraction one half = fraction one half, U+00BD ISOnum:
"½" = raw bytes;
"½" = 1/2
"½" = 1/2
"½" = 1/2
vulgar fraction three quarters = fraction three quarters, U+00BE ISOnum:
"¾" = raw bytes;
"¾" = 3/4
"¾" = 3/4
"¾" = 3/4
inverted question mark = turned question mark, U+00BF ISOnum:
"¿" = raw bytes;
"¿" =
"¿" =
"¿" =
latin capital letter A with grave = latin capital letter A grave, U+00C0 ISOlat1:
"À" = raw bytes;
"À" = A
"À" = A
"À" = A
latin capital letter A with acute, U+00C1 ISOlat1:
"Á" = raw bytes;
"Á" = A
"Á" = A
"Á" = A
latin capital letter A with circumflex, U+00C2 ISOlat1:
"Â" = raw bytes;
"Â" = A
"Â" = A
"Â" = A
latin capital letter A with tilde, U+00C3 ISOlat1:
"Ã" = raw bytes;
"Ã" = A
"Ã" = A
"Ã" = A
latin capital letter A with diaeresis, U+00C4 ISOlat1:
"Ä" = raw bytes;
"Ä" = A
"Ä" = A
"Ä" = A
latin capital letter A with ring above = latin capital letter A ring, U+00C5 ISOlat1:
"Å" = raw bytes;
"Å" = A
"Å" = A
"Å" = A
latin capital letter AE = latin capital ligature AE, U+00C6 ISOlat1:
"Æ" = raw bytes;
"Æ" = AE
"Æ" = AE
"Æ" = AE
latin capital letter C with cedilla, U+00C7 ISOlat1:
"Ç" = raw bytes;
"Ç" = C
"Ç" = C
"Ç" = C
latin capital letter E with grave, U+00C8 ISOlat1:
"È" = raw bytes;
"È" = E
"È" = E
"È" = E
latin capital letter E with acute, U+00C9 ISOlat1:
"É" = raw bytes;
"É" = E
"É" = E
"É" = E
latin capital letter E with circumflex, U+00CA ISOlat1:
"Ê" = raw bytes;
"Ê" = E
"Ê" = E
"Ê" = E
latin capital letter E with diaeresis, U+00CB ISOlat1:
"Ë" = raw bytes;
"Ë" = E
"Ë" = E
"Ë" = E
latin capital letter I with grave, U+00CC ISOlat1:
"Ì" = raw bytes;
"Ì" = I
"Ì" = I
"Ì" = I
latin capital letter I with acute, U+00CD ISOlat1:
"Í" = raw bytes;
"Í" = I
"Í" = I
"Í" = I
latin capital letter I with circumflex, U+00CE ISOlat1:
"Î" = raw bytes;
"Î" = I
"Î" = I
"Î" = I
latin capital letter I with diaeresis, U+00CF ISOlat1:
"Ï" = raw bytes;
"Ï" = I
"Ï" = I
"Ï" = I
latin capital letter ETH, U+00D0 ISOlat1:
"Ð" = raw bytes;
"Ð" = D
"Ð" = D
"Ð" = D
latin capital letter N with tilde, U+00D1 ISOlat1:
"Ñ" = raw bytes;
"Ñ" = N
"Ñ" = N
"Ñ" = N
latin capital letter O with grave, U+00D2 ISOlat1:
"Ò" = raw bytes;
"Ò" = O
"Ò" = O
"Ò" = O
latin capital letter O with acute, U+00D3 ISOlat1:
"Ó" = raw bytes;
"Ó" = O
"Ó" = O
"Ó" = O
latin capital letter O with circumflex, U+00D4 ISOlat1:
"Ô" = raw bytes;
"Ô" = O
"Ô" = O
"Ô" = O
latin capital letter O with tilde, U+00D5 ISOlat1:
"Õ" = raw bytes;
"Õ" = O
"Õ" = O
"Õ" = O
latin capital letter O with diaeresis, U+00D6 ISOlat1:
"Ö" = raw bytes;
"Ö" = O
"Ö" = O
"Ö" = O
multiplication sign, U+00D7 ISOnum:
"×" = raw bytes;
"×" = x
"×" = x
"×" = x
latin capital letter O with stroke = latin capital letter O slash, U+00D8 ISOlat1:
"Ø" = raw bytes;
"Ø" = O
"Ø" = O
"Ø" = O
latin capital letter U with grave, U+00D9 ISOlat1:
"Ù" = raw bytes;
"Ù" = U
"Ù" = U
"Ù" = U
latin capital letter U with acute, U+00DA ISOlat1:
"Ú" = raw bytes;
"Ú" = U
"Ú" = U
"Ú" = U
latin capital letter U with circumflex, U+00DB ISOlat1:
"Û" = raw bytes;
"Û" = U
"Û" = U
"Û" = U
latin capital letter U with diaeresis, U+00DC ISOlat1:
"Ü" = raw bytes;
"Ü" = U
"Ü" = U
"Ü" = U
latin capital letter Y with acute, U+00DD ISOlat1:
"Ý" = raw bytes;
"Ý" = Y
"Ý" = Y
"Ý" = Y
latin capital letter THORN, U+00DE ISOlat1:
"Þ" = raw bytes;
"Þ" =
"Þ" =
"Þ" =
latin small letter sharp s = ess-zed, U+00DF ISOlat1:
"ß" = raw bytes;
"ß" = B
"ß" = B
"ß" = B
latin small letter a with grave = latin small letter a grave, U+00E0 ISOlat1:
"à" = raw bytes;
"à" = a
"à" = a
"à" = a
latin small letter a with acute, U+00E1 ISOlat1:
"á" = raw bytes;
"á" = a
"á" = a
"á" = a
latin small letter a with circumflex, U+00E2 ISOlat1:
"â" = raw bytes;
"â" = a
"â" = a
"â" = a
latin small letter a with tilde, U+00E3 ISOlat1:
"ã" = raw bytes;
"ã" = a
"ã" = a
"ã" = a
latin small letter a with diaeresis, U+00E4 ISOlat1:
"ä" = raw bytes;
"ä" = a
"ä" = a
"ä" = a
latin small letter a with ring above = latin small letter a ring, U+00E5 ISOlat1:
"å" = raw bytes;
"å" = a
"å" = a
"å" = a
latin small letter ae = latin small ligature ae, U+00E6 ISOlat1:
"æ" = raw bytes;
"æ" = ae
"æ" = ae
"æ" = ae
latin small letter c with cedilla, U+00E7 ISOlat1:
"ç" = raw bytes;
"ç" = c
"ç" = c
"ç" = c
latin small letter e with grave, U+00E8 ISOlat1:
"è" = raw bytes;
"è" = e
"è" = e
"è" = e
latin small letter e with acute, U+00E9 ISOlat1:
"é" = raw bytes;
"é" = e
"é" = e
"é" = e
latin small letter e with circumflex, U+00EA ISOlat1:
"ê" = raw bytes;
"ê" = e
"ê" = e
"ê" = e
latin small letter e with diaeresis, U+00EB ISOlat1:
"ë" = raw bytes;
"ë" = e
"ë" = e
"ë" = e
latin small letter i with grave, U+00EC ISOlat1:
"ì" = raw bytes;
"ì" = i
"ì" = i
"ì" = i
latin small letter i with acute, U+00ED ISOlat1:
"í" = raw bytes;
"í" = i
"í" = i
"í" = i
latin small letter i with circumflex, U+00EE ISOlat1:
"î" = raw bytes;
"î" = i
"î" = i
"î" = i
latin small letter i with diaeresis, U+00EF ISOlat1:
"ï" = raw bytes;
"ï" = i
"ï" = i
"ï" = i
latin small letter eth, U+00F0 ISOlat1:
"ð" = raw bytes;
"ð" = d
"ð" = d
"ð" = d
latin small letter n with tilde, U+00F1 ISOlat1:
"ñ" = raw bytes;
"ñ" = n
"ñ" = n
"ñ" = n
latin small letter o with grave, U+00F2 ISOlat1:
"ò" = raw bytes;
"ò" = o
"ò" = o
"ò" = o
latin small letter o with acute, U+00F3 ISOlat1:
"ó" = raw bytes;
"ó" = o
"ó" = o
"ó" = o
latin small letter o with circumflex, U+00F4 ISOlat1:
"ô" = raw bytes;
"ô" = o
"ô" = o
"ô" = o
latin small letter o with tilde, U+00F5 ISOlat1:
"õ" = raw bytes;
"õ" = o
"õ" = o
"õ" = o
latin small letter o with diaeresis, U+00F6 ISOlat1:
"ö" = raw bytes;
"ö" = o
"ö" = o
"ö" = o
division sign, U+00F7 ISOnum:
"÷" = raw bytes;
"÷" = /
"÷" = /
"÷" = /
latin small letter o with stroke, = latin small letter o slash, U+00F8 ISOlat1:
"ø" = raw bytes;
"ø" = o
"ø" = o
"ø" = o
latin small letter u with grave, U+00F9 ISOlat1:
"ù" = raw bytes;
"ù" = u
"ù" = u
"ù" = u
latin small letter u with acute, U+00FA ISOlat1:
"ú" = raw bytes;
"ú" = u
"ú" = u
"ú" = u
latin small letter u with circumflex, U+00FB ISOlat1:
"û" = raw bytes;
"û" = u
"û" = u
"û" = u
latin small letter u with diaeresis, U+00FC ISOlat1:
"ü" = raw bytes;
"ü" = u
"ü" = u
"ü" = u
latin small letter y with acute, U+00FD ISOlat1:
"ý" = raw bytes;
"ý" = y
"ý" = y
"ý" = y
latin small letter thorn, U+00FE ISOlat1:
"þ" = raw bytes;
"þ" =
"þ" =
"þ" =
latin small letter y with diaeresis, U+00FF ISOlat1:
"ÿ" = raw bytes;
"ÿ" = y
"ÿ" = y
"ÿ" = y
%HTMLsymbol; --
latin small f with hook = function = florin, U+0192 ISOtech:
"ƒ" = raw bytes;
"ƒ" = ƒ
"ƒ" =
"ƒ" =
greek capital letter alpha, U+0391:
"Α" = raw bytes;
"Α" = Α
"Α" =
"Α" =
greek capital letter beta, U+0392:
"Β" = raw bytes;
"Β" = Β
"Β" =
"Β" =
greek capital letter gamma, U+0393 ISOgrk3:
"Γ" = raw bytes;
"Γ" = Γ
"Γ" =
"Γ" =
greek capital letter delta, U+0394 ISOgrk3:
"Δ" = raw bytes;
"Δ" = Δ
"Δ" =
"Δ" =
greek capital letter epsilon, U+0395:
"Ε" = raw bytes;
"Ε" = Ε
"Ε" =
"Ε" =
greek capital letter zeta, U+0396:
"Ζ" = raw bytes;
"Ζ" = Ζ
"Ζ" =
"Ζ" =
greek capital letter eta, U+0397:
"Η" = raw bytes;
"Η" = Η
"Η" =
"Η" =
greek capital letter theta, U+0398 ISOgrk3:
"Θ" = raw bytes;
"Θ" = Θ
"Θ" =
"Θ" =
greek capital letter iota, U+0399:
"Ι" = raw bytes;
"Ι" = Ι
"Ι" =
"Ι" =
greek capital letter kappa, U+039A:
"Κ" = raw bytes;
"Κ" = Κ
"Κ" =
"Κ" =
greek capital letter lambda, U+039B ISOgrk3:
"Λ" = raw bytes;
"Λ" = Λ
"Λ" =
"Λ" =
greek capital letter mu, U+039C:
"Μ" = raw bytes;
"Μ" = Μ
"Μ" =
"Μ" =
greek capital letter nu, U+039D:
"Ν" = raw bytes;
"Ν" = Ν
"Ν" =
"Ν" =
greek capital letter xi, U+039E ISOgrk3:
"Ξ" = raw bytes;
"Ξ" = Ξ
"Ξ" =
"Ξ" =
greek capital letter omicron, U+039F:
"Ο" = raw bytes;
"Ο" = Ο
"Ο" =
"Ο" =
greek capital letter pi, U+03A0 ISOgrk3:
"Π" = raw bytes;
"Π" = Π
"Π" =
"Π" =
greek capital letter rho, U+03A1:
"Ρ" = raw bytes;
"Ρ" = Ρ
"Ρ" =
"Ρ" =
greek capital letter sigma, U+03A3 ISOgrk3:
"Σ" = raw bytes;
"Σ" = Σ
"Σ" =
"Σ" =
greek capital letter tau, U+03A4:
"Τ" = raw bytes;
"Τ" = Τ
"Τ" =
"Τ" =
greek capital letter upsilon, U+03A5 ISOgrk3:
"Υ" = raw bytes;
"Υ" = Υ
"Υ" =
"Υ" =
greek capital letter phi, U+03A6 ISOgrk3:
"Φ" = raw bytes;
"Φ" = Φ
"Φ" =
"Φ" =
greek capital letter chi, U+03A7:
"Χ" = raw bytes;
"Χ" = Χ
"Χ" =
"Χ" =
greek capital letter psi, U+03A8 ISOgrk3:
"Ψ" = raw bytes;
"Ψ" = Ψ
"Ψ" =
"Ψ" =
greek capital letter omega, U+03A9 ISOgrk3:
"Ω" = raw bytes;
"Ω" = Ω
"Ω" =
"Ω" =
greek small letter alpha, U+03B1 ISOgrk3:
"α" = raw bytes;
"α" = α
"α" =
"α" =
greek small letter beta, U+03B2 ISOgrk3:
"β" = raw bytes;
"β" = β
"β" =
"β" =
greek small letter gamma, U+03B3 ISOgrk3:
"γ" = raw bytes;
"γ" = γ
"γ" =
"γ" =
greek small letter delta, U+03B4 ISOgrk3:
"δ" = raw bytes;
"δ" = δ
"δ" =
"δ" =
greek small letter epsilon, U+03B5 ISOgrk3:
"ε" = raw bytes;
"ε" = ε
"ε" =
"ε" =
greek small letter zeta, U+03B6 ISOgrk3:
"ζ" = raw bytes;
"ζ" = ζ
"ζ" =
"ζ" =
greek small letter eta, U+03B7 ISOgrk3:
"η" = raw bytes;
"η" = η
"η" =
"η" =
greek small letter theta, U+03B8 ISOgrk3:
"θ" = raw bytes;
"θ" = θ
"θ" =
"θ" =
greek small letter iota, U+03B9 ISOgrk3:
"ι" = raw bytes;
"ι" = ι
"ι" =
"ι" =
greek small letter kappa, U+03BA ISOgrk3:
"κ" = raw bytes;
"κ" = κ
"κ" =
"κ" =
greek small letter lambda, U+03BB ISOgrk3:
"λ" = raw bytes;
"λ" = λ
"λ" =
"λ" =
greek small letter mu, U+03BC ISOgrk3:
"μ" = raw bytes;
"μ" = μ
"μ" =
"μ" =
greek small letter nu, U+03BD ISOgrk3:
"ν" = raw bytes;
"ν" = ν
"ν" =
"ν" =
greek small letter xi, U+03BE ISOgrk3:
"ξ" = raw bytes;
"ξ" = ξ
"ξ" =
"ξ" =
greek small letter omicron, U+03BF NEW:
"ο" = raw bytes;
"ο" = ο
"ο" =
"ο" =
greek small letter pi, U+03C0 ISOgrk3:
"π" = raw bytes;
"π" = π
"π" =
"π" =
greek small letter rho, U+03C1 ISOgrk3:
"ρ" = raw bytes;
"ρ" = ρ
"ρ" =
"ρ" =
greek small letter final sigma, U+03C2 ISOgrk3:
"ς" = raw bytes;
"ς" = ς
"ς" =
"ς" =
greek small letter sigma, U+03C3 ISOgrk3:
"σ" = raw bytes;
"σ" = σ
"σ" =
"σ" =
greek small letter tau, U+03C4 ISOgrk3:
"τ" = raw bytes;
"τ" = τ
"τ" =
"τ" =
greek small letter upsilon, U+03C5 ISOgrk3:
"υ" = raw bytes;
"υ" = υ
"υ" =
"υ" =
greek small letter phi, U+03C6 ISOgrk3:
"φ" = raw bytes;
"φ" = φ
"φ" =
"φ" =
greek small letter chi, U+03C7 ISOgrk3:
"χ" = raw bytes;
"χ" = χ
"χ" =
"χ" =
greek small letter psi, U+03C8 ISOgrk3:
"ψ" = raw bytes;
"ψ" = ψ
"ψ" =
"ψ" =
greek small letter omega, U+03C9 ISOgrk3:
"ω" = raw bytes;
"ω" = ω
"ω" =
"ω" =
greek small letter theta symbol, U+03D1 NEW:
"ϑ" = raw bytes;
"ϑ" = ϑ
"ϑ" =
"ϑ" =
greek upsilon with hook symbol, U+03D2 NEW:
"ϒ" = raw bytes;
"ϒ" = ϒ
"ϒ" =
"ϒ" =
greek pi symbol, U+03D6 ISOgrk3:
"ϖ" = raw bytes;
"ϖ" = ϖ
"ϖ" =
"ϖ" =
bullet = black small circle, U+2022 ISOpub:
"•" = raw bytes;
"•" = •
"•" =
"•" =
horizontal ellipsis = three dot leader, U+2026 ISOpub:
"…" = raw bytes;
"…" = …
"…" =
"…" =
prime = minutes = feet, U+2032 ISOtech:
"′" = raw bytes;
"′" = ′
"′" =
"′" =
double prime = seconds = inches, U+2033 ISOtech:
"″" = raw bytes;
"″" = ″
"″" =
"″" =
overline = spacing overscore, U+203E NEW:
"‾" = raw bytes;
"‾" = ‾
"‾" =
"‾" =
fraction slash, U+2044 NEW:
"⁄" = raw bytes;
"⁄" = ⁄
"⁄" =
"⁄" =
script capital P = power set = Weierstrass p, U+2118 ISOamso:
"℘" = raw bytes;
"℘" = ℘
"℘" =
"℘" =
blackletter capital I = imaginary part, U+2111 ISOamso:
"ℑ" = raw bytes;
"ℑ" = ℑ
"ℑ" =
"ℑ" =
blackletter capital R = real part symbol, U+211C ISOamso:
"ℜ" = raw bytes;
"ℜ" = ℜ
"ℜ" =
"ℜ" =
trade mark sign, U+2122 ISOnum:
"™" = raw bytes;
"™" = ™
"™" =
"™" =
alef symbol = first transfinite cardinal, U+2135 NEW:
"ℵ" = raw bytes;
"ℵ" = ℵ
"ℵ" =
"ℵ" =
leftwards arrow, U+2190 ISOnum:
"←" = raw bytes;
"←" = ←
"←" =
"←" =
upwards arrow, U+2191:
"↑" = raw bytes;
"↑" = ↑
"↑" =
"↑" =
rightwards arrow, U+2192 ISOnum:
"→" = raw bytes;
"→" = →
"→" =
"→" =
downwards arrow, U+2193 ISOnum:
"↓" = raw bytes;
"↓" = ↓
"↓" =
"↓" =
left right arrow, U+2194 ISOamsa:
"↔" = raw bytes;
"↔" = ↔
"↔" =
"↔" =
downwards arrow with corner leftwards = carriage return, U+21B5 NEW:
"↵" = raw bytes;
"↵" = ↵
"↵" =
"↵" =
leftwards double arrow, U+21D0 ISOtech:
"⇐" = raw bytes;
"⇐" = ⇐
"⇐" =
"⇐" =
upwards double arrow, U+21D1 ISOamsa:
"⇑" = raw bytes;
"⇑" = ⇑
"⇑" =
"⇑" =
rightwards double arrow, U+21D2 ISOtech:
"⇒" = raw bytes;
"⇒" = ⇒
"⇒" =
"⇒" =
downwards double arrow, U+21D3 ISOamsa:
"⇓" = raw bytes;
"⇓" = ⇓
"⇓" =
"⇓" =
left right double arrow, U+21D4 ISOamsa:
"⇔" = raw bytes;
"⇔" = ⇔
"⇔" =
"⇔" =
for all, U+2200 ISOtech:
"∀" = raw bytes;
"∀" = ∀
"∀" =
"∀" =
partial differential, U+2202 ISOtech:
"∂" = raw bytes;
"∂" = ∂
"∂" =
"∂" =
there exists, U+2203 ISOtech:
"∃" = raw bytes;
"∃" = ∃
"∃" =
"∃" =
empty set = null set = diameter, U+2205 ISOamso:
"∅" = raw bytes;
"∅" = ∅
"∅" =
"∅" =
nabla = backward difference, U+2207 ISOtech:
"∇" = raw bytes;
"∇" = ∇
"∇" =
"∇" =
element of, U+2208 ISOtech:
"∈" = raw bytes;
"∈" = ∈
"∈" =
"∈" =
not an element of, U+2209 ISOtech:
"∉" = raw bytes;
"∉" = ∉
"∉" =
"∉" =
contains as member, U+220B ISOtech:
"∋" = raw bytes;
"∋" = ∋
"∋" =
"∋" =
n-ary product = product sign, U+220F ISOamsb:
"∏" = raw bytes;
"∏" = ∏
"∏" =
"∏" =
n-ary sumation, U+2211 ISOamsb:
"∑" = raw bytes;
"∑" = ∑
"∑" =
"∑" =
minus sign, U+2212 ISOtech:
"−" = raw bytes;
"−" = −
"−" =
"−" =
asterisk operator, U+2217 ISOtech:
"∗" = raw bytes;
"∗" = ∗
"∗" =
"∗" =
square root = radical sign, U+221A ISOtech:
"√" = raw bytes;
"√" = √
"√" =
"√" =
proportional to, U+221D ISOtech:
"∝" = raw bytes;
"∝" = ∝
"∝" =
"∝" =
infinity, U+221E ISOtech:
"∞" = raw bytes;
"∞" = ∞
"∞" =
"∞" =
angle, U+2220 ISOamso:
"∠" = raw bytes;
"∠" = ∠
"∠" =
"∠" =
logical and = wedge, U+2227 ISOtech:
"∧" = raw bytes;
"∧" = ∧
"∧" =
"∧" =
logical or = vee, U+2228 ISOtech:
"∨" = raw bytes;
"∨" = ∨
"∨" =
"∨" =
intersection = cap, U+2229 ISOtech:
"∩" = raw bytes;
"∩" = ∩
"∩" =
"∩" =
union = cup, U+222A ISOtech:
"∪" = raw bytes;
"∪" = ∪
"∪" =
"∪" =
integral, U+222B ISOtech:
"∫" = raw bytes;
"∫" = ∫
"∫" =
"∫" =
therefore, U+2234 ISOtech:
"∴" = raw bytes;
"∴" = ∴
"∴" =
"∴" =
tilde operator = varies with = similar to, U+223C ISOtech:
"∼" = raw bytes;
"∼" = ∼
"∼" =
"∼" =
approximately equal to, U+2245 ISOtech:
"≅" = raw bytes;
"≅" = ≅
"≅" =
"≅" =
almost equal to = asymptotic to, U+2248 ISOamsr:
"≈" = raw bytes;
"≈" = ≈
"≈" =
"≈" =
not equal to, U+2260 ISOtech:
"≠" = raw bytes;
"≠" = ≠
"≠" =
"≠" =
identical to, U+2261 ISOtech:
"≡" = raw bytes;
"≡" = ≡
"≡" =
"≡" =
less-than or equal to, U+2264 ISOtech:
"≤" = raw bytes;
"≤" = ≤
"≤" =
"≤" =
greater-than or equal to, U+2265 ISOtech:
"≥" = raw bytes;
"≥" = ≥
"≥" =
"≥" =
subset of, U+2282 ISOtech:
"⊂" = raw bytes;
"⊂" = ⊂
"⊂" =
"⊂" =
superset of, U+2283 ISOtech:
"⊃" = raw bytes;
"⊃" = ⊃
"⊃" =
"⊃" =
not a subset of, U+2284 ISOamsn:
"⊄" = raw bytes;
"⊄" = ⊄
"⊄" =
"⊄" =
subset of or equal to, U+2286 ISOtech:
"⊆" = raw bytes;
"⊆" = ⊆
"⊆" =
"⊆" =
superset of or equal to, U+2287 ISOtech:
"⊇" = raw bytes;
"⊇" = ⊇
"⊇" =
"⊇" =
circled plus = direct sum, U+2295 ISOamsb:
"⊕" = raw bytes;
"⊕" = ⊕
"⊕" =
"⊕" =
circled times = vector product, U+2297 ISOamsb:
"⊗" = raw bytes;
"⊗" = ⊗
"⊗" =
"⊗" =
up tack = orthogonal to = perpendicular, U+22A5 ISOtech:
"⊥" = raw bytes;
"⊥" = ⊥
"⊥" =
"⊥" =
dot operator, U+22C5 ISOamsb:
"⋅" = raw bytes;
"⋅" = ⋅
"⋅" =
"⋅" =
left ceiling = apl upstile, U+2308 ISOamsc:
"⌈" = raw bytes;
"⌈" = ⌈
"⌈" =
"⌈" =
right ceiling, U+2309 ISOamsc:
"⌉" = raw bytes;
"⌉" = ⌉
"⌉" =
"⌉" =
left floor = apl downstile, U+230A ISOamsc:
"⌊" = raw bytes;
"⌊" = ⌊
"⌊" =
"⌊" =
right floor, U+230B ISOamsc:
"⌋" = raw bytes;
"⌋" = ⌋
"⌋" =
"⌋" =
left-pointing angle bracket = bra, U+2329 ISOtech:
"〈" = raw bytes;
"⟨" = 〈
"〈" =
"〈" =
right-pointing angle bracket = ket, U+232A ISOtech:
"〉" = raw bytes;
"⟩" = 〉
"〉" =
"〉" =
lozenge, U+25CA ISOpub:
"◊" = raw bytes;
"◊" = ◊
"◊" =
"◊" =
black spade suit, U+2660 ISOpub:
"♠" = raw bytes;
"♠" = ♠
"♠" =
"♠" =
black club suit = shamrock, U+2663 ISOpub:
"♣" = raw bytes;
"♣" = ♣
"♣" =
"♣" =
black heart suit = valentine, U+2665 ISOpub:
"♥" = raw bytes;
"♥" = ♥
"♥" =
"♥" =
black diamond suit, U+2666 ISOpub:
"♦" = raw bytes;
"♦" = ♦
"♦" =
"♦" =
%HTMLspecial; --
quotation mark = APL quote, U+0022 ISOnum:
""" = raw bytes;
""" =
""" =
""" =
ampersand, U+0026 ISOnum:
"" = raw bytes;
"" =
"" =
"" =
less-than sign, U+003C ISOnum:
"" = raw bytes;
"" =
"" =
"" =
latin capital ligature OE, U+0152 ISOlat2:
"Œ" = raw bytes;
"Œ" = Œ
"Œ" =
"Œ" =
latin small ligature oe, U+0153 ISOlat2:
"œ" = raw bytes;
"œ" = œ
"œ" =
"œ" =
latin capital letter S with caron, U+0160 ISOlat2:
"Š" = raw bytes;
"Š" = Š
"Š" =
"Š" =
latin small letter s with caron, U+0161 ISOlat2:
"š" = raw bytes;
"š" = š
"š" =
"š" =
latin capital letter Y with diaeresis, U+0178 ISOlat2:
"Ÿ" = raw bytes;
"Ÿ" = Ÿ
"Ÿ" =
"Ÿ" =
modifier letter circumflex accent, U+02C6 ISOpub:
"ˆ" = raw bytes;
"ˆ" = ˆ
"ˆ" =
"ˆ" =
small tilde, U+02DC ISOdia:
"˜" = raw bytes;
"˜" = ˜
"˜" =
"˜" =
en space, U+2002 ISOpub:
" " = raw bytes;
" " =
" " =
" " =
em space, U+2003 ISOpub:
" " = raw bytes;
" " =
" " =
" " =
thin space, U+2009 ISOpub:
" " = raw bytes;
" " =
" " =
" " =
zero width non-joiner, U+200C NEW RFC 2070:
"" = raw bytes;
"" =
"" =
"" =
zero width joiner, U+200D NEW RFC 2070:
"" = raw bytes;
"" =
"" =
"" =
left-to-right mark, U+200E NEW RFC 2070:
"" = raw bytes;
"" =
"" =
"" =
right-to-left mark, U+200F NEW RFC 2070:
"" = raw bytes;
"" =
"" =
"" =
en dash, U+2013 ISOpub:
"–" = raw bytes;
"–" = –
"–" =
"–" =
em dash, U+2014 ISOpub:
"—" = raw bytes;
"—" = —
"—" =
"—" =
left single quotation mark, U+2018 ISOnum:
"‘" = raw bytes;
"‘" = ‘
"‘" =
"‘" =
right single quotation mark, U+2019 ISOnum:
"’" = raw bytes;
"’" = ’
"’" =
"’" =
single low-9 quotation mark, U+201A NEW:
"‚" = raw bytes;
"‚" = ‚
"‚" =
"‚" =
left double quotation mark, U+201C ISOnum:
"“" = raw bytes;
"“" = “
"“" =
"“" =
right double quotation mark, U+201D ISOnum:
"”" = raw bytes;
"”" = ”
"”" =
"”" =
double low-9 quotation mark, U+201E NEW:
"„" = raw bytes;
"„" = „
"„" =
"„" =
dagger, U+2020 ISOpub:
"†" = raw bytes;
"†" = †
"†" =
"†" =
double dagger, U+2021 ISOpub:
"‡" = raw bytes;
"‡" = ‡
"‡" =
"‡" =
per mille sign, U+2030 ISOtech:
"‰" = raw bytes;
"‰" = ‰
"‰" =
"‰" =
single left-pointing angle quotation mark, U+2039 ISO proposed:
"‹" = raw bytes;
"‹" = ‹
"‹" =
"‹" =
single right-pointing angle quotation mark, U+203A ISO proposed:
"›" = raw bytes;
"›" = ›
"›" =
"›" =
euro sign, U+20AC NEW:
"€" = raw bytes;
"€" = €
"€" =
"€" =
(Score: 2) by martyb on Thursday July 03 2014, @11:28PM
!-- Portions (C) International Organization for Standardization 1986
Permission to copy in any form is granted for use with
conforming SGML systems and applications as defined in
ISO 8879, provided this notice is included in all copies.
--
!-- Character entity set. Typical invocation:
!ENTITY % HTMLlat1 PUBLIC
"-//W3C//ENTITIES Latin 1//EN//HTML"
%HTMLlat1;
--
!-- Mathematical, Greek and Symbolic characters for HTML --
!-- Character entity set. Typical invocation:
!ENTITY % HTMLsymbol PUBLIC
"-//W3C//ENTITIES Symbols//EN//HTML"
%HTMLsymbol; --
!-- Portions (C) International Organization for Standardization 1986:
Permission to copy in any form is granted for use with
conforming SGML systems and applications as defined in
ISO 8879, provided this notice is included in all copies.
--
!-- Relevant ISO entity set is given unless names are newly introduced.
New names (i.e., not in ISO 8879 list) do not clash with any
existing ISO 8879 entity names. ISO 10646 character numbers
are given for each character, in hex. CDATA values are decimal
conversions of the ISO 10646 values and refer to the document
character set. Names are ISO 10646 names.
--
!-- Latin Extended-B --
!-- Greek --
!-- there is no Sigmaf, and no U+03A2 character either --
!-- General Punctuation --
!-- bullet is NOT the same as bullet operator, U+2219 --
!-- Letterlike Symbols --
!-- alef symbol is NOT the same as hebrew letter alef,
U+05D0 although the same glyph could be used to depict both characters --
!-- Arrows --
!-- ISO 10646 does not say that lArr is the same as the 'is implied by' arrow
but also does not have any other character for that function. So ? lArr can
be used for 'is implied by' as ISOtech suggests --
!-- ISO 10646 does not say this is the 'implies' character but does not have
another character with this function so ?
rArr can be used for 'implies' as ISOtech suggests --
!-- Mathematical Operators --
!-- should there be a more memorable name than 'ni'? --
!-- prod is NOT the same character as U+03A0 'greek capital letter pi' though
the same glyph might be used for both --
!-- sum is NOT the same character as U+03A3 'greek capital letter sigma'
though the same glyph might be used for both --
!-- tilde operator is NOT the same character as the tilde, U+007E,
although the same glyph might be used to represent both --
!-- note that nsup, 'not a superset of, U+2283' is not covered by the Symbol
font encoding and is not included. Should it be, for symmetry?
It is in ISOamsn --
!-- dot operator is NOT the same character as U+00B7 middle dot --
!-- Miscellaneous Technical --
!-- lang is NOT the same character as U+003C 'less than'
or U+2039 'single left-pointing angle quotation mark' --
!-- rang is NOT the same character as U+003E 'greater than'
or U+203A 'single right-pointing angle quotation mark' --
!-- Geometric Shapes --
!-- Miscellaneous Symbols --
!-- black here seems to mean filled as opposed to hollow --
!-- Special characters for HTML --
!-- Character entity set. Typical invocation:
!ENTITY % HTMLspecial PUBLIC
"-//W3C//ENTITIES Special//EN//HTML"
%HTMLspecial; --
!-- Portions (C) International Organization for Standardization 1986:
Permission to copy in any form is granted for use with
conforming SGML systems and applications as defined in
ISO 8879, provided this notice is included in all copies.
--
!-- Relevant ISO entity set is given unless names are newly introduced.
New names (i.e., not in ISO 8879 list) do not clash with any
existing ISO 8879 entity names. ISO 10646 character numbers
are given for each character, in hex. CDATA values are decimal
conversions of the ISO 10646 values and refer to the document
character set. Names are ISO 10646 names.
--
!-- C0 Controls and Basic Latin --
!-- Latin Extended-A --
!-- ligature is a misnomer, this is a separate character in some languages --
!-- Spacing Modifier Letters --
!-- General Punctuation --
!-- lsaquo is proposed but not yet ISO standardized --
!-- rsaquo is proposed but not yet ISO standardized --
(Score: 2) by martyb on Friday July 04 2014, @09:53AM
Summary:
Tests all named character entities defined in HTML 4.
Each test point contains:
Documents Referenced:
Tests:
!-- Portions (C) International Organization for Standardization 1986
Permission to copy in any form is granted for use with
conforming SGML systems and applications as defined in
ISO 8879, provided this notice is included in all copies.
--
!-- Character entity set. Typical invocation:
!ENTITY % HTMLlat1 PUBLIC
"-//W3C//ENTITIES Latin 1//EN//HTML"
%HTMLlat1;
--
!-- Mathematical, Greek and Symbolic characters for HTML --
!-- Character entity set. Typical invocation:
!ENTITY % HTMLsymbol PUBLIC
"-//W3C//ENTITIES Symbols//EN//HTML"
%HTMLsymbol; --
!-- Portions (C) International Organization for Standardization 1986:
Permission to copy in any form is granted for use with
conforming SGML systems and applications as defined in
ISO 8879, provided this notice is included in all copies.
--
!-- Relevant ISO entity set is given unless names are newly introduced.
New names (i.e., not in ISO 8879 list) do not clash with any
existing ISO 8879 entity names. ISO 10646 character numbers
are given for each character, in hex. CDATA values are decimal
conversions of the ISO 10646 values and refer to the document
character set. Names are ISO 10646 names.
--
!-- Latin Extended-B --
!-- Greek --
!-- there is no Sigmaf, and no U+03A2 character either --
!-- General Punctuation --
!-- bullet is NOT the same as bullet operator, U+2219 --
!-- Letterlike Symbols --
!-- alef symbol is NOT the same as hebrew letter alef,
U+05D0 although the same glyph could be used to depict both characters --
!-- Arrows --
!-- ISO 10646 does not say that lArr is the same as the 'is implied by' arrow
but also does not have any other character for that function. So ? lArr can
be used for 'is implied by' as ISOtech suggests --
!-- ISO 10646 does not say this is the 'implies' character but does not have
another character with this function so ?
rArr can be used for 'implies' as ISOtech suggests --
!-- Mathematical Operators --
!-- should there be a more memorable name than 'ni'? --
!-- prod is NOT the same character as U+03A0 'greek capital letter pi' though
the same glyph might be used for both --
!-- sum is NOT the same character as U+03A3 'greek capital letter sigma'
though the same glyph might be used for both --
!-- tilde operator is NOT the same character as the tilde, U+007E,
although the same glyph might be used to represent both --
!-- note that nsup, 'not a superset of, U+2283' is not covered by the Symbol
font encoding and is not included. Should it be, for symmetry?
It is in ISOamsn --
!-- dot operator is NOT the same character as U+00B7 middle dot --
!-- Miscellaneous Technical --
!-- lang is NOT the same character as U+003C 'less than'
or U+2039 'single left-pointing angle quotation mark' --
!-- rang is NOT the same character as U+003E 'greater than'
or U+203A 'single right-pointing angle quotation mark' --
!-- Geometric Shapes --
!-- Miscellaneous Symbols --
!-- black here seems to mean filled as opposed to hollow --
!-- Special characters for HTML --
!-- Character entity set. Typical invocation:
!ENTITY % HTMLspecial PUBLIC
"-//W3C//ENTITIES Special//EN//HTML"
%HTMLspecial; --
!-- Portions (C) International Organization for Standardization 1986:
Permission to copy in any form is granted for use with
conforming SGML systems and applications as defined in
ISO 8879, provided this notice is included in all copies.
--
!-- Relevant ISO entity set is given unless names are newly introduced.
New names (i.e., not in ISO 8879 list) do not clash with any
existing ISO 8879 entity names. ISO 10646 character numbers
are given for each character, in hex. CDATA values are decimal
conversions of the ISO 10646 values and refer to the document
character set. Names are ISO 10646 names.
--
!-- C0 Controls and Basic Latin --
!-- Latin Extended-A --
!-- ligature is a misnomer, this is a separate character in some languages --
!-- Spacing Modifier Letters --
!-- General Punctuation --
!-- lsaquo is proposed but not yet ISO standardized --
!-- rsaquo is proposed but not yet ISO standardized --
(Score: 2) by The Mighty Buzzard on Friday July 04 2014, @10:19AM
no-break space = non-breaking space, U+00A0 ISOnum:
" " = UTF-8 encoded character
" " =
" " =  
" " =  
inverted exclamation mark, U+00A1 ISOnum:
"¡" = UTF-8 encoded character
"¡" = ¡
"¡" = ¡
"¡" = ¡
cent sign, U+00A2 ISOnum:
"¢" = UTF-8 encoded character
"¢" = ¢
"¢" = ¢
"¢" = ¢
pound sign, U+00A3 ISOnum:
"£" = UTF-8 encoded character
"£" = £
"£" = £
"£" = £
currency sign, U+00A4 ISOnum:
"¤" = UTF-8 encoded character
"¤" = ¤
"¤" = ¤
"¤" = ¤
yen sign = yuan sign, U+00A5 ISOnum:
"¥" = UTF-8 encoded character
"¥" = ¥
"¥" = ¥
"¥" = ¥
broken bar = broken vertical bar, U+00A6 ISOnum:
"¦" = UTF-8 encoded character
"¦" = ¦
"¦" = ¦
"¦" = ¦
section sign, U+00A7 ISOnum:
"§" = UTF-8 encoded character
"§" = §
"§" = §
"§" = §
diaeresis = spacing diaeresis, U+00A8 ISOdia:
"¨" = UTF-8 encoded character
"¨" = ¨
"¨" = ¨
"¨" = ¨
copyright sign, U+00A9 ISOnum:
"©" = UTF-8 encoded character
"©" = ©
"©" = ©
"©" = ©
feminine ordinal indicator, U+00AA ISOnum:
"ª" = UTF-8 encoded character
"ª" = ª
"ª" = ª
"ª" = ª
left-pointing double angle quotation mark = left pointing guillemet, U+00AB ISOnum:
"«" = UTF-8 encoded character
"«" = «
"«" = «
"«" = «
not sign, U+00AC ISOnum:
"¬" = UTF-8 encoded character
"¬" = ¬
"¬" = ¬
"¬" = ¬
soft hyphen = discretionary hyphen, U+00AD ISOnum:
"" = UTF-8 encoded character
"" = ­
"" = ­
"" = ­
registered sign = registered trade mark sign, U+00AE ISOnum:
"®" = UTF-8 encoded character
"®" = ®
"®" = ®
"®" = ®
macron = spacing macron = overline = APL overbar, U+00AF ISOdia:
"¯" = UTF-8 encoded character
"¯" = ¯
"¯" = ¯
"¯" = ¯
degree sign, U+00B0 ISOnum:
"°" = UTF-8 encoded character
"°" = °
"°" = °
"°" = °
plus-minus sign = plus-or-minus sign, U+00B1 ISOnum:
"±" = UTF-8 encoded character
"±" = ±
"±" = ±
"±" = ±
superscript two = superscript digit two = squared, U+00B2 ISOnum:
"²" = UTF-8 encoded character
"²" = ²
"²" = ²
"²" = ²
superscript three = superscript digit three = cubed, U+00B3 ISOnum:
"³" = UTF-8 encoded character
"³" = ³
"³" = ³
"³" = ³
acute accent = spacing acute, U+00B4 ISOdia:
"´" = UTF-8 encoded character
"´" = ´
"´" = ´
"´" = ´
micro sign, U+00B5 ISOnum:
"µ" = UTF-8 encoded character
"µ" = µ
"µ" = µ
"µ" = µ
pilcrow sign = paragraph sign, U+00B6 ISOnum:
"¶" = UTF-8 encoded character
"¶" = ¶
"¶" = ¶
"¶" = ¶
middle dot = Georgian comma = Greek middle dot, U+00B7 ISOnum:
"·" = UTF-8 encoded character
"·" = ·
"·" = ·
"·" = ·
cedilla = spacing cedilla, U+00B8 ISOdia:
"¸" = UTF-8 encoded character
"¸" = ¸
"¸" = ¸
"¸" = ¸
superscript one = superscript digit one, U+00B9 ISOnum:
"¹" = UTF-8 encoded character
"¹" = ¹
"¹" = ¹
"¹" = ¹
masculine ordinal indicator, U+00BA ISOnum:
"º" = UTF-8 encoded character
"º" = º
"º" = º
"º" = º
right-pointing double angle quotation mark = right pointing guillemet, U+00BB ISOnum:
"»" = UTF-8 encoded character
"»" = »
"»" = »
"»" = »
vulgar fraction one quarter = fraction one quarter, U+00BC ISOnum:
"¼" = UTF-8 encoded character
"¼" = ¼
"¼" = ¼
"¼" = ¼
vulgar fraction one half = fraction one half, U+00BD ISOnum:
"½" = UTF-8 encoded character
"½" = ½
"½" = ½
"½" = ½
vulgar fraction three quarters = fraction three quarters, U+00BE ISOnum:
"¾" = UTF-8 encoded character
"¾" = ¾
"¾" = ¾
"¾" = ¾
inverted question mark = turned question mark, U+00BF ISOnum:
"¿" = UTF-8 encoded character
"¿" = ¿
"¿" = ¿
"¿" = ¿
latin capital letter A with grave = latin capital letter A grave, U+00C0 ISOlat1:
"À" = UTF-8 encoded character
"À" = À
"À" = À
"À" = À
latin capital letter A with acute, U+00C1 ISOlat1:
"Á" = UTF-8 encoded character
"Á" = Á
"Á" = Á
"Á" = Á
latin capital letter A with circumflex, U+00C2 ISOlat1:
"Â" = UTF-8 encoded character
"Â" = Â
"Â" = Â
"Â" = Â
latin capital letter A with tilde, U+00C3 ISOlat1:
"Ã" = UTF-8 encoded character
"Ã" = Ã
"Ã" = Ã
"Ã" = Ã
latin capital letter A with diaeresis, U+00C4 ISOlat1:
"Ä" = UTF-8 encoded character
"Ä" = Ä
"Ä" = Ä
"Ä" = Ä
latin capital letter A with ring above = latin capital letter A ring, U+00C5 ISOlat1:
"Å" = UTF-8 encoded character
"Å" = Å
"Å" = Å
"Å" = Å
latin capital letter AE = latin capital ligature AE, U+00C6 ISOlat1:
"Æ" = UTF-8 encoded character
"Æ" = Æ
"Æ" = Æ
"Æ" = Æ
latin capital letter C with cedilla, U+00C7 ISOlat1:
"Ç" = UTF-8 encoded character
"Ç" = Ç
"Ç" = Ç
"Ç" = Ç
latin capital letter E with grave, U+00C8 ISOlat1:
"È" = UTF-8 encoded character
"È" = È
"È" = È
"È" = È
latin capital letter E with acute, U+00C9 ISOlat1:
"É" = UTF-8 encoded character
"É" = É
"É" = É
"É" = É
latin capital letter E with circumflex, U+00CA ISOlat1:
"Ê" = UTF-8 encoded character
"Ê" = Ê
"Ê" = Ê
"Ê" = Ê
latin capital letter E with diaeresis, U+00CB ISOlat1:
"Ë" = UTF-8 encoded character
"Ë" = Ë
"Ë" = Ë
"Ë" = Ë
latin capital letter I with grave, U+00CC ISOlat1:
"Ì" = UTF-8 encoded character
"Ì" = Ì
"Ì" = Ì
"Ì" = Ì
latin capital letter I with acute, U+00CD ISOlat1:
"Í" = UTF-8 encoded character
"Í" = Í
"Í" = Í
"Í" = Í
latin capital letter I with circumflex, U+00CE ISOlat1:
"Î" = UTF-8 encoded character
"Î" = Î
"Î" = Î
"Î" = Î
latin capital letter I with diaeresis, U+00CF ISOlat1:
"Ï" = UTF-8 encoded character
"Ï" = Ï
"Ï" = Ï
"Ï" = Ï
latin capital letter ETH, U+00D0 ISOlat1:
"Ð" = UTF-8 encoded character
"Ð" = Ð
"Ð" = Ð
"Ð" = Ð
latin capital letter N with tilde, U+00D1 ISOlat1:
"Ñ" = UTF-8 encoded character
"Ñ" = Ñ
"Ñ" = Ñ
"Ñ" = Ñ
latin capital letter O with grave, U+00D2 ISOlat1:
"Ò" = UTF-8 encoded character
"Ò" = Ò
"Ò" = Ò
"Ò" = Ò
latin capital letter O with acute, U+00D3 ISOlat1:
"Ó" = UTF-8 encoded character
"Ó" = Ó
"Ó" = Ó
"Ó" = Ó
latin capital letter O with circumflex, U+00D4 ISOlat1:
"Ô" = UTF-8 encoded character
"Ô" = Ô
"Ô" = Ô
"Ô" = Ô
latin capital letter O with tilde, U+00D5 ISOlat1:
"Õ" = UTF-8 encoded character
"Õ" = Õ
"Õ" = Õ
"Õ" = Õ
latin capital letter O with diaeresis, U+00D6 ISOlat1:
"Ö" = UTF-8 encoded character
"Ö" = Ö
"Ö" = Ö
"Ö" = Ö
multiplication sign, U+00D7 ISOnum:
"×" = UTF-8 encoded character
"×" = ×
"×" = ×
"×" = ×
latin capital letter O with stroke = latin capital letter O slash, U+00D8 ISOlat1:
"Ø" = UTF-8 encoded character
"Ø" = Ø
"Ø" = Ø
"Ø" = Ø
latin capital letter U with grave, U+00D9 ISOlat1:
"Ù" = UTF-8 encoded character
"Ù" = Ù
"Ù" = Ù
"Ù" = Ù
latin capital letter U with acute, U+00DA ISOlat1:
"Ú" = UTF-8 encoded character
"Ú" = Ú
"Ú" = Ú
"Ú" = Ú
latin capital letter U with circumflex, U+00DB ISOlat1:
"Û" = UTF-8 encoded character
"Û" = Û
"Û" = Û
"Û" = Û
latin capital letter U with diaeresis, U+00DC ISOlat1:
"Ü" = UTF-8 encoded character
"Ü" = Ü
"Ü" = Ü
"Ü" = Ü
latin capital letter Y with acute, U+00DD ISOlat1:
"Ý" = UTF-8 encoded character
"Ý" = Ý
"Ý" = Ý
"Ý" = Ý
latin capital letter THORN, U+00DE ISOlat1:
"Þ" = UTF-8 encoded character
"Þ" = Þ
"Þ" = Þ
"Þ" = Þ
latin small letter sharp s = ess-zed, U+00DF ISOlat1:
"ß" = UTF-8 encoded character
"ß" = ß
"ß" = ß
"ß" = ß
latin small letter a with grave = latin small letter a grave, U+00E0 ISOlat1:
"à" = UTF-8 encoded character
"à" = à
"à" = à
"à" = à
latin small letter a with acute, U+00E1 ISOlat1:
"á" = UTF-8 encoded character
"á" = á
"á" = á
"á" = á
latin small letter a with circumflex, U+00E2 ISOlat1:
"â" = UTF-8 encoded character
"â" = â
"â" = â
"â" = â
latin small letter a with tilde, U+00E3 ISOlat1:
"ã" = UTF-8 encoded character
"ã" = ã
"ã" = ã
"ã" = ã
latin small letter a with diaeresis, U+00E4 ISOlat1:
"ä" = UTF-8 encoded character
"ä" = ä
"ä" = ä
"ä" = ä
latin small letter a with ring above = latin small letter a ring, U+00E5 ISOlat1:
"å" = UTF-8 encoded character
"å" = å
"å" = å
"å" = å
latin small letter ae = latin small ligature ae, U+00E6 ISOlat1:
"æ" = UTF-8 encoded character
"æ" = æ
"æ" = æ
"æ" = æ
latin small letter c with cedilla, U+00E7 ISOlat1:
"ç" = UTF-8 encoded character
"ç" = ç
"ç" = ç
"ç" = ç
latin small letter e with grave, U+00E8 ISOlat1:
"è" = UTF-8 encoded character
"è" = è
"è" = è
"è" = è
latin small letter e with acute, U+00E9 ISOlat1:
"é" = UTF-8 encoded character
"é" = é
"é" = é
"é" = é
latin small letter e with circumflex, U+00EA ISOlat1:
"ê" = UTF-8 encoded character
"ê" = ê
"ê" = ê
"ê" = ê
latin small letter e with diaeresis, U+00EB ISOlat1:
"ë" = UTF-8 encoded character
"ë" = ë
"ë" = ë
"ë" = ë
latin small letter i with grave, U+00EC ISOlat1:
"ì" = UTF-8 encoded character
"ì" = ì
"ì" = ì
"ì" = ì
latin small letter i with acute, U+00ED ISOlat1:
"í" = UTF-8 encoded character
"í" = í
"í" = í
"í" = í
latin small letter i with circumflex, U+00EE ISOlat1:
"î" = UTF-8 encoded character
"î" = î
"î" = î
"î" = î
latin small letter i with diaeresis, U+00EF ISOlat1:
"ï" = UTF-8 encoded character
"ï" = ï
"ï" = ï
"ï" = ï
latin small letter eth, U+00F0 ISOlat1:
"ð" = UTF-8 encoded character
"ð" = ð
"ð" = ð
"ð" = ð
latin small letter n with tilde, U+00F1 ISOlat1:
"ñ" = UTF-8 encoded character
"ñ" = ñ
"ñ" = ñ
"ñ" = ñ
latin small letter o with grave, U+00F2 ISOlat1:
"ò" = UTF-8 encoded character
"ò" = ò
"ò" = ò
"ò" = ò
latin small letter o with acute, U+00F3 ISOlat1:
"ó" = UTF-8 encoded character
"ó" = ó
"ó" = ó
"ó" = ó
latin small letter o with circumflex, U+00F4 ISOlat1:
"ô" = UTF-8 encoded character
"ô" = ô
"ô" = ô
"ô" = ô
latin small letter o with tilde, U+00F5 ISOlat1:
"õ" = UTF-8 encoded character
"õ" = õ
"õ" = õ
"õ" = õ
latin small letter o with diaeresis, U+00F6 ISOlat1:
"ö" = UTF-8 encoded character
"ö" = ö
"ö" = ö
"ö" = ö
division sign, U+00F7 ISOnum:
"÷" = UTF-8 encoded character
"÷" = ÷
"÷" = ÷
"÷" = ÷
latin small letter o with stroke, = latin small letter o slash, U+00F8 ISOlat1:
"ø" = UTF-8 encoded character
"ø" = ø
"ø" = ø
"ø" = ø
latin small letter u with grave, U+00F9 ISOlat1:
"ù" = UTF-8 encoded character
"ù" = ù
"ù" = ù
"ù" = ù
latin small letter u with acute, U+00FA ISOlat1:
"ú" = UTF-8 encoded character
"ú" = ú
"ú" = ú
"ú" = ú
latin small letter u with circumflex, U+00FB ISOlat1:
"û" = UTF-8 encoded character
"û" = û
"û" = û
"û" = û
latin small letter u with diaeresis, U+00FC ISOlat1:
"ü" = UTF-8 encoded character
"ü" = ü
"ü" = ü
"ü" = ü
latin small letter y with acute, U+00FD ISOlat1:
"ý" = UTF-8 encoded character
"ý" = ý
"ý" = ý
"ý" = ý
latin small letter thorn, U+00FE ISOlat1:
"þ" = UTF-8 encoded character
"þ" = þ
"þ" = þ
"þ" = þ
latin small letter y with diaeresis, U+00FF ISOlat1:
"ÿ" = UTF-8 encoded character
"ÿ" = ÿ
"ÿ" = ÿ
"ÿ" = ÿ
<!-- Mathematical, Greek and Symbolic characters for HTML -->
<!-- Character entity set. Typical invocation:
<!ENTITY % HTMLsymbol PUBLIC
"-//W3C//ENTITIES Symbols//EN//HTML">
%HTMLsymbol; -->
<!-- Portions © International Organization for Standardization 1986:
Permission to copy in any form is granted for use with
conforming SGML systems and applications as defined in
ISO 8879, provided this notice is included in all copies.
-->
<!-- Relevant ISO entity set is given unless names are newly introduced.
New names (i.e., not in ISO 8879 list) do not clash with any
existing ISO 8879 entity names. ISO 10646 character numbers
are given for each character, in hex. CDATA values are decimal
conversions of the ISO 10646 values and refer to the document
character set. Names are ISO 10646 names.
-->
<!-- Latin Extended-B -->
latin small f with hook = function = florin, U+0192 ISOtech:
"ƒ" = UTF-8 encoded character
"ƒ" = ƒ
"ƒ" = ƒ
"ƒ" = ƒ
<!-- Greek -->
greek capital letter alpha, U+0391:
"Α" = UTF-8 encoded character
"Α" = Α
"Α" = Α
"Α" = Α
greek capital letter beta, U+0392:
"Β" = UTF-8 encoded character
"Β" = Β
"Β" = Β
"Β" = Β
greek capital letter gamma, U+0393 ISOgrk3:
"Γ" = UTF-8 encoded character
"Γ" = Γ
"Γ" = Γ
"Γ" = Γ
greek capital letter delta, U+0394 ISOgrk3:
"Δ" = UTF-8 encoded character
"Δ" = Δ
"Δ" = Δ
"Δ" = Δ
greek capital letter epsilon, U+0395:
"Ε" = UTF-8 encoded character
"Ε" = Ε
"Ε" = Ε
"Ε" = Ε
greek capital letter zeta, U+0396:
"Ζ" = UTF-8 encoded character
"Ζ" = Ζ
"Ζ" = Ζ
"Ζ" = Ζ
greek capital letter eta, U+0397:
"Η" = UTF-8 encoded character
"Η" = Η
"Η" = Η
"Η" = Η
greek capital letter theta, U+0398 ISOgrk3:
"Θ" = UTF-8 encoded character
"Θ" = Θ
"Θ" = Θ
"Θ" = Θ
greek capital letter iota, U+0399:
"Ι" = UTF-8 encoded character
"Ι" = Ι
"Ι" = Ι
"Ι" = Ι
greek capital letter kappa, U+039A:
"Κ" = UTF-8 encoded character
"Κ" = Κ
"Κ" = Κ
"Κ" = Κ
greek capital letter lambda, U+039B ISOgrk3:
"Λ" = UTF-8 encoded character
"Λ" = Λ
"Λ" = Λ
"Λ" = Λ
greek capital letter mu, U+039C:
"Μ" = UTF-8 encoded character
"Μ" = Μ
"Μ" = Μ
"Μ" = Μ
greek capital letter nu, U+039D:
"Ν" = UTF-8 encoded character
"Ν" = Ν
"Ν" = Ν
"Ν" = Ν
greek capital letter xi, U+039E ISOgrk3:
"Ξ" = UTF-8 encoded character
"Ξ" = Ξ
"Ξ" = Ξ
"Ξ" = Ξ
greek capital letter omicron, U+039F:
"Ο" = UTF-8 encoded character
"Ο" = Ο
"Ο" = Ο
"Ο" = Ο
greek capital letter pi, U+03A0 ISOgrk3:
"Π" = UTF-8 encoded character
"Π" = Π
"Π" = Π
"Π" = Π
greek capital letter rho, U+03A1:
"Ρ" = UTF-8 encoded character
"Ρ" = Ρ
"Ρ" = Ρ
"Ρ" = Ρ
<!-- there is no Sigmaf, and no U+03A2 character either -->
greek capital letter sigma, U+03A3 ISOgrk3:
"Σ" = UTF-8 encoded character
"Σ" = Σ
"Σ" = Σ
"Σ" = Σ
greek capital letter tau, U+03A4:
"Τ" = UTF-8 encoded character
"Τ" = Τ
"Τ" = Τ
"Τ" = Τ
greek capital letter upsilon, U+03A5 ISOgrk3:
"Υ" = UTF-8 encoded character
"Υ" = Υ
"Υ" = Υ
"Υ" = Υ
greek capital letter phi, U+03A6 ISOgrk3:
"Φ" = UTF-8 encoded character
"Φ" = Φ
"Φ" = Φ
"Φ" = Φ
greek capital letter chi, U+03A7:
"Χ" = UTF-8 encoded character
"Χ" = Χ
"Χ" = Χ
"Χ" = Χ
greek capital letter psi, U+03A8 ISOgrk3:
"Ψ" = UTF-8 encoded character
"Ψ" = Ψ
"Ψ" = Ψ
"Ψ" = Ψ
greek capital letter omega, U+03A9 ISOgrk3:
"Ω" = UTF-8 encoded character
"Ω" = Ω
"Ω" = Ω
"Ω" = Ω
greek small letter alpha, U+03B1 ISOgrk3:
"α" = UTF-8 encoded character
"α" = α
"α" = α
"α" = α
greek small letter beta, U+03B2 ISOgrk3:
"β" = UTF-8 encoded character
"β" = β
"β" = β
"β" = β
greek small letter gamma, U+03B3 ISOgrk3:
"γ" = UTF-8 encoded character
"γ" = γ
"γ" = γ
"γ" = γ
greek small letter delta, U+03B4 ISOgrk3:
"δ" = UTF-8 encoded character
"δ" = δ
"δ" = δ
"δ" = δ
greek small letter epsilon, U+03B5 ISOgrk3:
"ε" = UTF-8 encoded character
"ε" = ε
"ε" = ε
"ε" = ε
greek small letter zeta, U+03B6 ISOgrk3:
"ζ" = UTF-8 encoded character
"ζ" = ζ
"ζ" = ζ
"ζ" = ζ
greek small letter eta, U+03B7 ISOgrk3:
"η" = UTF-8 encoded character
"η" = η
"η" = η
"η" = η
greek small letter theta, U+03B8 ISOgrk3:
"θ" = UTF-8 encoded character
"θ" = θ
"θ" = θ
"θ" = θ
greek small letter iota, U+03B9 ISOgrk3:
"ι" = UTF-8 encoded character
"ι" = ι
"ι" = ι
"ι" = ι
greek small letter kappa, U+03BA ISOgrk3:
"κ" = UTF-8 encoded character
"κ" = κ
"κ" = κ
"κ" = κ
greek small letter lambda, U+03BB ISOgrk3:
"λ" = UTF-8 encoded character
"λ" = λ
"λ" = λ
"λ" = λ
greek small letter mu, U+03BC ISOgrk3:
"μ" = UTF-8 encoded character
"μ" = μ
"μ" = μ
"μ" = μ
greek small letter nu, U+03BD ISOgrk3:
"ν" = UTF-8 encoded character
"ν" = ν
"ν" = ν
"ν" = ν
greek small letter xi, U+03BE ISOgrk3:
"ξ" = UTF-8 encoded character
"ξ" = ξ
"ξ" = ξ
"ξ" = ξ
greek small letter omicron, U+03BF NEW:
"ο" = UTF-8 encoded character
"ο" = ο
"ο" = ο
"ο" = ο
greek small letter pi, U+03C0 ISOgrk3:
"π" = UTF-8 encoded character
"π" = π
"π" = π
"π" = π
greek small letter rho, U+03C1 ISOgrk3:
"ρ" = UTF-8 encoded character
"ρ" = ρ
"ρ" = ρ
"ρ" = ρ
greek small letter final sigma, U+03C2 ISOgrk3:
"ς" = UTF-8 encoded character
"ς" = ς
"ς" = ς
"ς" = ς
greek small letter sigma, U+03C3 ISOgrk3:
"σ" = UTF-8 encoded character
"σ" = σ
"σ" = σ
"σ" = σ
greek small letter tau, U+03C4 ISOgrk3:
"τ" = UTF-8 encoded character
"τ" = τ
"τ" = τ
"τ" = τ
greek small letter upsilon, U+03C5 ISOgrk3:
"υ" = UTF-8 encoded character
"υ" = υ
"υ" = υ
"υ" = υ
greek small letter phi, U+03C6 ISOgrk3:
"φ" = UTF-8 encoded character
"φ" = φ
"φ" = φ
"φ" = φ
greek small letter chi, U+03C7 ISOgrk3:
"χ" = UTF-8 encoded character
"χ" = χ
"χ" = χ
"χ" = χ
greek small letter psi, U+03C8 ISOgrk3:
"ψ" = UTF-8 encoded character
"ψ" = ψ
"ψ" = ψ
"ψ" = ψ
greek small letter omega, U+03C9 ISOgrk3:
"ω" = UTF-8 encoded character
"ω" = ω
"ω" = ω
"ω" = ω
greek small letter theta symbol, U+03D1 NEW:
"ϑ" = UTF-8 encoded character
"ϑ" = ϑ
"ϑ" = ϑ
"ϑ" = ϑ
greek upsilon with hook symbol, U+03D2 NEW:
"ϒ" = UTF-8 encoded character
"ϒ" = ϒ
"ϒ" = ϒ
"ϒ" = ϒ
greek pi symbol, U+03D6 ISOgrk3:
"ϖ" = UTF-8 encoded character
"ϖ" = ϖ
"ϖ" = ϖ
"ϖ" = ϖ
<!-- General Punctuation -->
bullet = black small circle, U+2022 ISOpub:
"•" = UTF-8 encoded character
"•" = •
"•" = •
"•" = •
<!-- bullet is NOT the same as bullet operator, U+2219 -->
horizontal ellipsis = three dot leader, U+2026 ISOpub:
"…" = UTF-8 encoded character
"…" = …
"…" = …
"…" = …
prime = minutes = feet, U+2032 ISOtech:
"′" = UTF-8 encoded character
"′" = ′
"′" = ′
"′" = ′
double prime = seconds = inches, U+2033 ISOtech:
"″" = UTF-8 encoded character
"″" = ″
"″" = ″
"″" = ″
overline = spacing overscore, U+203E NEW:
"‾" = UTF-8 encoded character
"‾" = ‾
"‾" = ‾
"‾" = ‾
fraction slash, U+2044 NEW:
"⁄" = UTF-8 encoded character
"⁄" = ⁄
"⁄" = ⁄
"⁄" = ⁄
<!-- Letterlike Symbols -->
script capital P = power set = Weierstrass p, U+2118 ISOamso:
"℘" = UTF-8 encoded character
"℘" = ℘
"℘" = ℘
"℘" = ℘
blackletter capital I = imaginary part, U+2111 ISOamso:
"ℑ" = UTF-8 encoded character
"ℑ" = ℑ
"ℑ" = ℑ
"ℑ" = ℑ
blackletter capital R = real part symbol, U+211C ISOamso:
"ℜ" = UTF-8 encoded character
"ℜ" = ℜ
"ℜ" = ℜ
"ℜ" = ℜ
trade mark sign, U+2122 ISOnum:
"™" = UTF-8 encoded character
"™" = ™
"™" = ™
"™" = ™
alef symbol = first transfinite cardinal, U+2135 NEW:
"ℵ" = UTF-8 encoded character
"ℵ" = ℵ
"ℵ" = ℵ
"ℵ" = ℵ
<!-- alef symbol is NOT the same as hebrew letter alef,
U+05D0 although the same glyph could be used to depict both characters -->
<!-- Arrows -->
leftwards arrow, U+2190 ISOnum:
"←" = UTF-8 encoded character
"←" = ←
"←" = ←
"←" = ←
upwards arrow, U+2191:
"↑" = UTF-8 encoded character
"↑" = ↑
"↑" = ↑
"↑" = ↑
rightwards arrow, U+2192 ISOnum:
"→" = UTF-8 encoded character
"→" = →
"→" = →
"→" = →
downwards arrow, U+2193 ISOnum:
"↓" = UTF-8 encoded character
"↓" = ↓
"↓" = ↓
"↓" = ↓
left right arrow, U+2194 ISOamsa:
"↔" = UTF-8 encoded character
"↔" = ↔
"↔" = ↔
"↔" = ↔
downwards arrow with corner leftwards = carriage return, U+21B5 NEW:
"↵" = UTF-8 encoded character
"↵" = ↵
"↵" = ↵
"↵" = ↵
leftwards double arrow, U+21D0 ISOtech:
"⇐" = UTF-8 encoded character
"⇐" = ⇐
"⇐" = ⇐
"⇐" = ⇐
<!-- ISO 10646 does not say that lArr is the same as the 'is implied by' arrow
but also does not have any other character for that function. So ? lArr can
be used for 'is implied by' as ISOtech suggests -->
upwards double arrow, U+21D1 ISOamsa:
"⇑" = UTF-8 encoded character
"⇑" = ⇑
"⇑" = ⇑
"⇑" = ⇑
rightwards double arrow, U+21D2 ISOtech:
"⇒" = UTF-8 encoded character
"⇒" = ⇒
"⇒" = ⇒
"⇒" = ⇒
<!-- ISO 10646 does not say this is the 'implies' character but does not have
another character with this function so ?
rArr can be used for 'implies' as ISOtech suggests -->
downwards double arrow, U+21D3 ISOamsa:
"⇓" = UTF-8 encoded character
"⇓" = ⇓
"⇓" = ⇓
"⇓" = ⇓
left right double arrow, U+21D4 ISOamsa:
"⇔" = UTF-8 encoded character
"⇔" = ⇔
"⇔" = ⇔
"⇔" = ⇔
<!-- Mathematical Operators -->
for all, U+2200 ISOtech:
"∀" = UTF-8 encoded character
"∀" = ∀
"∀" = ∀
"∀" = ∀
partial differential, U+2202 ISOtech:
"∂" = UTF-8 encoded character
"∂" = ∂
"∂" = ∂
"∂" = ∂
there exists, U+2203 ISOtech:
"∃" = UTF-8 encoded character
"∃" = ∃
"∃" = ∃
"∃" = ∃
empty set = null set = diameter, U+2205 ISOamso:
"∅" = UTF-8 encoded character
"∅" = ∅
"∅" = ∅
"∅" = ∅
nabla = backward difference, U+2207 ISOtech:
"∇" = UTF-8 encoded character
"∇" = ∇
"∇" = ∇
"∇" = ∇
element of, U+2208 ISOtech:
"∈" = UTF-8 encoded character
"∈" = ∈
"∈" = ∈
"∈" = ∈
not an element of, U+2209 ISOtech:
"∉" = UTF-8 encoded character
"∉" = ∉
"∉" = ∉
"∉" = ∉
contains as member, U+220B ISOtech:
"∋" = UTF-8 encoded character
"∋" = ∋
"∋" = ∋
"∋" = ∋
<!-- should there be a more memorable name than 'ni'? -->
n-ary product = product sign, U+220F ISOamsb:
"∏" = UTF-8 encoded character
"∏" = ∏
"∏" = ∏
"∏" = ∏
<!-- prod is NOT the same character as U+03A0 'greek capital letter pi' though
the same glyph might be used for both -->
n-ary sumation, U+2211 ISOamsb:
"∑" = UTF-8 encoded character
"∑" = ∑
"∑" = ∑
"∑" = ∑
<!-- sum is NOT the same character as U+03A3 'greek capital letter sigma'
though the same glyph might be used for both -->
minus sign, U+2212 ISOtech:
"−" = UTF-8 encoded character
"−" = −
"−" = −
"−" = −
asterisk operator, U+2217 ISOtech:
"∗" = UTF-8 encoded character
"∗" = ∗
"∗" = ∗
"∗" = ∗
square root = radical sign, U+221A ISOtech:
"√" = UTF-8 encoded character
"√" = √
"√" = √
"√" = √
proportional to, U+221D ISOtech:
"∝" = UTF-8 encoded character
"∝" = ∝
"∝" = ∝
"∝" = ∝
infinity, U+221E ISOtech:
"∞" = UTF-8 encoded character
"∞" = ∞
"∞" = ∞
"∞" = ∞
angle, U+2220 ISOamso:
"∠" = UTF-8 encoded character
"∠" = ∠
"∠" = ∠
"∠" = ∠
logical and = wedge, U+2227 ISOtech:
"∧" = UTF-8 encoded character
"∧" = ∧
"∧" = ∧
"∧" = ∧
logical or = vee, U+2228 ISOtech:
"∨" = UTF-8 encoded character
"∨" = ∨
"∨" = ∨
"∨" = ∨
intersection = cap, U+2229 ISOtech:
"∩" = UTF-8 encoded character
"∩" = ∩
"∩" = ∩
"∩" = ∩
union = cup, U+222A ISOtech:
"∪" = UTF-8 encoded character
"∪" = ∪
"∪" = ∪
"∪" = ∪
integral, U+222B ISOtech:
"∫" = UTF-8 encoded character
"∫" = ∫
"∫" = ∫
"∫" = ∫
therefore, U+2234 ISOtech:
"∴" = UTF-8 encoded character
"∴" = ∴
"∴" = ∴
"∴" = ∴
tilde operator = varies with = similar to, U+223C ISOtech:
"∼" = UTF-8 encoded character
"∼" = ∼
"∼" = ∼
"∼" = ∼
<!-- tilde operator is NOT the same character as the tilde, U+007E,
although the same glyph might be used to represent both -->
approximately equal to, U+2245 ISOtech:
"≅" = UTF-8 encoded character
"≅" = ≅
"≅" = ≅
"≅" = ≅
almost equal to = asymptotic to, U+2248 ISOamsr:
"≈" = UTF-8 encoded character
"≈" = ≈
"≈" = ≈
"≈" = ≈
not equal to, U+2260 ISOtech:
"≠" = UTF-8 encoded character
"≠" = ≠
"≠" = ≠
"≠" = ≠
identical to, U+2261 ISOtech:
"≡" = UTF-8 encoded character
"≡" = ≡
"≡" = ≡
"≡" = ≡
less-than or equal to, U+2264 ISOtech:
"≤" = UTF-8 encoded character
"≤" = ≤
"≤" = ≤
"≤" = ≤
greater-than or equal to, U+2265 ISOtech:
"≥" = UTF-8 encoded character
"≥" = ≥
"≥" = ≥
"≥" = ≥
subset of, U+2282 ISOtech:
"⊂" = UTF-8 encoded character
"⊂" = ⊂
"⊂" = ⊂
"⊂" = ⊂
superset of, U+2283 ISOtech:
"⊃" = UTF-8 encoded character
"⊃" = ⊃
"⊃" = ⊃
"⊃" = ⊃
<!-- note that nsup, 'not a superset of, U+2283' is not covered by the Symbol
font encoding and is not included. Should it be, for symmetry?
It is in ISOamsn -->
not a subset of, U+2284 ISOamsn:
"⊄" = UTF-8 encoded character
"⊄" = ⊄
"⊄" = ⊄
"⊄" = ⊄
subset of or equal to, U+2286 ISOtech:
"⊆" = UTF-8 encoded character
"⊆" = ⊆
"⊆" = ⊆
"⊆" = ⊆
superset of or equal to, U+2287 ISOtech:
"⊇" = UTF-8 encoded character
"⊇" = ⊇
"⊇" = ⊇
"⊇" = ⊇
circled plus = direct sum, U+2295 ISOamsb:
"⊕" = UTF-8 encoded character
"⊕" = ⊕
"⊕" = ⊕
"⊕" = ⊕
circled times = vector product, U+2297 ISOamsb:
"⊗" = UTF-8 encoded character
"⊗" = ⊗
"⊗" = ⊗
"⊗" = ⊗
up tack = orthogonal to = perpendicular, U+22A5 ISOtech:
"⊥" = UTF-8 encoded character
"⊥" = ⊥
"⊥" = ⊥
"⊥" = ⊥
dot operator, U+22C5 ISOamsb:
"⋅" = UTF-8 encoded character
"⋅" = ⋅
"⋅" = ⋅
"⋅" = ⋅
<!-- dot operator is NOT the same character as U+00B7 middle dot -->
<!-- Miscellaneous Technical -->
left ceiling = apl upstile, U+2308 ISOamsc:
"⌈" = UTF-8 encoded character
"⌈" = ⌈
"⌈" = ⌈
"⌈" = ⌈
right ceiling, U+2309 ISOamsc:
"⌉" = UTF-8 encoded character
"⌉" = ⌉
"⌉" = ⌉
"⌉" = ⌉
left floor = apl downstile, U+230A ISOamsc:
"⌊" = UTF-8 encoded character
"⌊" = ⌊
"⌊" = ⌊
"⌊" = ⌊
right floor, U+230B ISOamsc:
"⌋" = UTF-8 encoded character
"⌋" = ⌋
"⌋" = ⌋
"⌋" = ⌋
left-pointing angle bracket = bra, U+2329 ISOtech:
"〈" = UTF-8 encoded character
"〈" = ⟨
"〈" = 〈
"〈" = 〈
<!-- lang is NOT the same character as U+003C 'less than'
or U+2039 'single left-pointing angle quotation mark' -->
right-pointing angle bracket = ket, U+232A ISOtech:
"〉" = UTF-8 encoded character
"〉" = ⟩
"〉" = 〉
"〉" = 〉
<!-- rang is NOT the same character as U+003E 'greater than'
or U+203A 'single right-pointing angle quotation mark' -->
<!-- Geometric Shapes -->
lozenge, U+25CA ISOpub:
"◊" = UTF-8 encoded character
"◊" = ◊
"◊" = ◊
"◊" = ◊
<!-- Miscellaneous Symbols -->
black spade suit, U+2660 ISOpub:
"♠" = UTF-8 encoded character
"♠" = ♠
"♠" = ♠
"♠" = ♠
<!-- black here seems to mean filled as opposed to hollow -->
black club suit = shamrock, U+2663 ISOpub:
"♣" = UTF-8 encoded character
"♣" = ♣
"♣" = ♣
"♣" = ♣
black heart suit = valentine, U+2665 ISOpub:
"♥" = UTF-8 encoded character
"♥" = ♥
"♥" = ♥
"♥" = ♥
black diamond suit, U+2666 ISOpub:
"♦" = UTF-8 encoded character
"♦" = ♦
"♦" = ♦
"♦" = ♦
<!-- Special characters for HTML -->
<!-- Character entity set. Typical invocation:
<!ENTITY % HTMLspecial PUBLIC
"-//W3C//ENTITIES Special//EN//HTML">
%HTMLspecial; -->
<!-- Portions © International Organization for Standardization 1986:
Permission to copy in any form is granted for use with
conforming SGML systems and applications as defined in
ISO 8879, provided this notice is included in all copies.
-->
<!-- Relevant ISO entity set is given unless names are newly introduced.
New names (i.e., not in ISO 8879 list) do not clash with any
existing ISO 8879 entity names. ISO 10646 character numbers
are given for each character, in hex. CDATA values are decimal
conversions of the ISO 10646 values and refer to the document
character set. Names are ISO 10646 names.
-->
<!-- C0 Controls and Basic Latin -->
quotation mark = APL quote, U+0022 ISOnum:
""" = UTF-8 encoded character
""" = "
""" = "
""" = "
ampersand, U+0026 ISOnum:
"&" = UTF-8 encoded character
"&" = &
"&" = &
"&" = &
less-than sign, U+003C ISOnum:
""<" = <
"<" = <
"<" = <
greater-than sign, U+003E ISOnum:
">" = UTF-8 encoded character
">" = >
">" = >
">" = >
<!-- Latin Extended-A -->
latin capital ligature OE, U+0152 ISOlat2:
"Œ" = UTF-8 encoded character
"Œ" = Œ
"Œ" = Œ
"Œ" = Œ
latin small ligature oe, U+0153 ISOlat2:
"œ" = UTF-8 encoded character
"œ" = œ
"œ" = œ
"œ" = œ
<!-- ligature is a misnomer, this is a separate character in some languages -->
latin capital letter S with caron, U+0160 ISOlat2:
"Š" = UTF-8 encoded character
"Š" = Š
"Š" = Š
"Š" = Š
latin small letter s with caron, U+0161 ISOlat2:
"š" = UTF-8 encoded character
"š" = š
"š" = š
"š" = š
latin capital letter Y with diaeresis, U+0178 ISOlat2:
"Ÿ" = UTF-8 encoded character
"Ÿ" = Ÿ
"Ÿ" = Ÿ
"Ÿ" = Ÿ
<!-- Spacing Modifier Letters -->
modifier letter circumflex accent, U+02C6 ISOpub:
"ˆ" = UTF-8 encoded character
"ˆ" = ˆ
"ˆ" = ˆ
"ˆ" = ˆ
small tilde, U+02DC ISOdia:
"˜" = UTF-8 encoded character
"˜" = ˜
"˜" = ˜
"˜" = ˜
<!-- General Punctuation -->
en space, U+2002 ISOpub:
" " = UTF-8 encoded character
" " =  
" " =  
" " =  
em space, U+2003 ISOpub:
" " = UTF-8 encoded character
" " =  
" " =  
" " =  
thin space, U+2009 ISOpub:
" " = UTF-8 encoded character
" " =  
" " =  
" " =  
zero width non-joiner, U+200C NEW RFC 2070:
"" = UTF-8 encoded character
"" = ‌
"" = ‌
"" = ‌
zero width joiner, U+200D NEW RFC 2070:
"" = UTF-8 encoded character
"" = ‍
"" = ‍
"" = ‍
left-to-right mark, U+200E NEW RFC 2070:
"" = UTF-8 encoded character
"" = ‎
"" = ‎
"" = ‎
right-to-left mark, U+200F NEW RFC 2070:
"" = UTF-8 encoded character
"" = ‏
"" = ‏
"" = ‏
en dash, U+2013 ISOpub:
"–" = UTF-8 encoded character
"–" = –
"–" = –
"–" = –
em dash, U+2014 ISOpub:
"—" = UTF-8 encoded character
"—" = —
"—" = —
"—" = —
left single quotation mark, U+2018 ISOnum:
"‘" = UTF-8 encoded character
"‘" = ‘
"‘" = ‘
"‘" = ‘
right single quotation mark, U+2019 ISOnum:
"’" = UTF-8 encoded character
"’" = ’
"’" = ’
"’" = ’
single low-9 quotation mark, U+201A NEW:
"‚" = UTF-8 encoded character
"‚" = ‚
"‚" = ‚
"‚" = ‚
left double quotation mark, U+201C ISOnum:
"“" = UTF-8 encoded character
"“" = “
"“" = “
"“" = “
right double quotation mark, U+201D ISOnum:
"”" = UTF-8 encoded character
"”" = ”
"”" = ”
"”" = ”
double low-9 quotation mark, U+201E NEW:
"„" = UTF-8 encoded character
"„" = „
"„" = „
"„" = „
dagger, U+2020 ISOpub:
"†" = UTF-8 encoded character
"†" = †
"†" = †
"†" = †
double dagger, U+2021 ISOpub:
"‡" = UTF-8 encoded character
"‡" = ‡
"‡" = ‡
"‡" = ‡
per mille sign, U+2030 ISOtech:
"‰" = UTF-8 encoded character
"‰" = ‰
"‰" = ‰
"‰" = ‰
single left-pointing angle quotation mark, U+2039 ISO proposed:
"‹" = UTF-8 encoded character
"‹" = ‹
"‹" = ‹
"‹" = ‹
<!-- lsaquo is proposed but not yet ISO standardized -->
single right-pointing angle quotation mark, U+203A ISO proposed:
"›" = UTF-8 encoded character
"›" = ›
"›" = ›
"›" = ›
<!-- rsaquo is proposed but not yet ISO standardized -->
euro sign, U+20AC NEW:
"€" = UTF-8 encoded character
"€" = €
"€" = €
"€" = €