Windows-1253
Windows code page 1253 ("Greek - ANSI"),[1] commonly known by its IANA-registered name Windows-1253[2] or abbreviated as cp1253,[3][4] is a Microsoft Windows code page used to write modern Greek. It is not capable of supporting the older polytonic Greek.
MIME / IANA | Windows-1253 |
---|---|
Alias(es) | cp1253 |
Language(s) | Greek, English |
Standard | WHATWG Encoding Standard |
Classification | Extended ASCII, Windows-125x |
Based on | ISO/IEC 8859-7, Windows-1252 |
It is not fully compatible with ISO 8859-7 because a few characters, including the letter Ά, are located at different byte values:
µ
[lower-alpha 1] and¶
are added at their locations from Windows-1252 and ISO 8859-1 (0xB5 and 0xB6). This collides with the locations of΅
andΆ
, respectively, in ISO 8859-7.‘
and’
are moved from their ISO 8859-7 locations (0xA1 and 0xA2) to their Windows-1252 locations (0x91 and 0x92). The displaced΅
andΆ
are moved to the vacated space at 0xA1 and 0xA2 respectively.¤
and¥
are added at their locations from Windows-1252 and ISO 8859-1 (0xA4 and 0xA5). This collides with additions made to ISO 8859-7 in 2003, when€
and₯
respectively were added to the same locations. The€
was added to Windows-1253 at 0x80, the same location which it was added to in Windows-1252. An iota subscript (ͺ) was also added to ISO 8859-7 at 0xAA; this remains unallocated in Windows-1253.- Several further characters are added at their Windows-1252 locations, although the rest do not collide with ISO 8859-7.
IBM uses code page 1253 (CCSID 1253 and euro sign extended CCSID 5349) for Windows-1253.[5][6][7]
Unicode is preferred for Greek in modern applications, especially as UTF-8 encoding on the Internet. Unicode provides many more glyphs for complete coverage, see Greek alphabet in Unicode and Ancient Greek Musical Notation for tables.
Character set
The following table shows Windows-1253. Each character is shown with its Unicode equivalent.
_0 | _1 | _2 | _3 | _4 | _5 | _6 | _7 | _8 | _9 | _A | _B | _C | _D | _E | _F | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0_ 0 |
NUL 0000 |
SOH 0001 |
STX 0002 |
ETX 0003 |
EOT 0004 |
ENQ 0005 |
ACK 0006 |
BEL 0007 |
BS 0008 |
HT 0009 |
LF 000A |
VT 000B |
FF 000C |
CR 000D |
SO 000E |
SI 000F |
1_ 16 |
DLE 0010 |
DC1 0011 |
DC2 0012 |
DC3 0013 |
DC4 0014 |
NAK 0015 |
SYN 0016 |
ETB 0017 |
CAN 0018 |
EM 0019 |
SUB 001A |
ESC 001B |
FS 001C |
GS 001D |
RS 001E |
US 001F |
2_ 32 |
SP 0020 |
! 0021 |
" 0022 |
# 0023 |
$ 0024 |
% 0025 |
& 0026 |
' 0027 |
( 0028 |
) 0029 |
* 002A |
+ 002B |
, 002C |
- 002D |
. 002E |
/ 002F |
3_ 48 |
0 0030 |
1 0031 |
2 0032 |
3 0033 |
4 0034 |
5 0035 |
6 0036 |
7 0037 |
8 0038 |
9 0039 |
: 003A |
; 003B |
< 003C |
= 003D |
> 003E |
? 003F |
4_ 64 |
@ 0040 |
A 0041 |
B 0042 |
C 0043 |
D 0044 |
E 0045 |
F 0046 |
G 0047 |
H 0048 |
I 0049 |
J 004A |
K 004B |
L 004C |
M 004D |
N 004E |
O 004F |
5_ 80 |
P 0050 |
Q 0051 |
R 0052 |
S 0053 |
T 0054 |
U 0055 |
V 0056 |
W 0057 |
X 0058 |
Y 0059 |
Z 005A |
[ 005B |
\ 005C |
] 005D |
^ 005E |
_ 005F |
6_ 96 |
` 0060 |
a 0061 |
b 0062 |
c 0063 |
d 0064 |
e 0065 |
f 0066 |
g 0067 |
h 0068 |
i 0069 |
j 006A |
k 006B |
l 006C |
m 006D |
n 006E |
o 006F |
7_ 112 |
p 0070 |
q 0071 |
r 0072 |
s 0073 |
t 0074 |
u 0075 |
v 0076 |
w 0077 |
x 0078 |
y 0079 |
z 007A |
{ 007B |
| 007C |
} 007D |
~ 007E |
DEL 007F |
8_ 128 |
€ 20AC |
‚ 201A |
ƒ 0192 |
„ 201E |
… 2026 |
† 2020 |
‡ 2021 |
‰ 2030 |
‹ 2039 |
|||||||
9_ 144 |
‘ 2018 |
’ 2019 |
“ 201C |
” 201D |
• 2022 |
– 2013 |
— 2014 |
™ 2122 |
› 203A |
|||||||
A_ 160 |
NBSP 00A0 |
΅ 0385 |
Ά 0386 |
£ 00A3 |
¤ 00A4 |
¥ 00A5 |
¦ 00A6 |
§ 00A7 |
¨ 00A8 |
© 00A9 |
« 00AB |
¬ 00AC |
SHY 00AD |
® 00AE |
― 2015 | |
B_ 176 |
° 00B0 |
± 00B1 |
² 00B2 |
³ 00B3 |
΄ 0384 |
µ 00B5 |
¶ 00B6 |
· 00B7 |
Έ 0388 |
Ή 0389 |
Ί 038A |
» 00BB |
Ό 038C |
½ 00BD |
Ύ 038E |
Ώ 038F |
C_ 192 |
ΐ 0390 |
Α 0391 |
Β 0392 |
Γ 0393 |
Δ 0394 |
Ε 0395 |
Ζ 0396 |
Η 0397 |
Θ 0398 |
Ι 0399 |
Κ 039A |
Λ 039B |
Μ 039C |
Ν 039D |
Ξ 039E |
Ο 039F |
D_ 208 |
Π 03A0 |
Ρ 03A1 |
Σ 03A3 |
Τ 03A4 |
Υ 03A5 |
Φ 03A6 |
Χ 03A7 |
Ψ 03A8 |
Ω 03A9 |
Ϊ 03AA |
Ϋ 03AB |
ά 03AC |
έ 03AD |
ή 03AE |
ί 03AF | |
E_ 224 |
ΰ 03B0 |
α 03B1 |
β 03B2 |
γ 03B3 |
δ 03B4 |
ε 03B5 |
ζ 03B6 |
η 03B7 |
θ 03B8 |
ι 03B9 |
κ 03BA |
λ 03BB |
μ 03BC |
ν 03BD |
ξ 03BE |
ο 03BF |
F_ 240 |
π 03C0 |
ρ 03C1 |
ς 03C2 |
σ 03C3 |
τ 03C4 |
υ 03C5 |
φ 03C6 |
χ 03C7 |
ψ 03C8 |
ω 03C9 |
ϊ 03CA |
ϋ 03CB |
ό 03CC |
ύ 03CD |
ώ 03CE |
Letter Number Punctuation Symbol Other Undefined
Differences from ISO 8859-7 are shown with darker shading on top of their legend color.
See also
- ISO 8859-7
- ISO 5428
Footnotes
- This is in addition to the existing
μ
at 0xEC, which remains in place. Unicode calls the one at 0xB5 "micro sign" (U+00B5) and the one at 0xEC "Greek small letter Mu" (U+03BC), although the former is mapped to the latter by NFKC (although not NFC) Unicode normalization. See also Duplicate characters in Unicode § Duplicate vs. derived character.
References
- Microsoft. "Codepage 1253: Greek - ANSI". Unicode Consortium.
- Lazhintseva, Katya (1996-05-03). "Registration of new MIME charset: Windows-1253". IANA.
- Steele, Shawn (1998-04-15). "CP1253.TXT: cp1253 to Unicode table, version 2.01". Unicode Consortium.
- "7.2.3. Standard Encodings". Python 3.6 Documentation. Python Software Foundation.
- "Code page 1253 information document". Archived from the original on 2016-03-03.
- "CCSID 1253 information document". Archived from the original on 2016-03-27.
- "CCSID 5349 information document". Archived from the original on 2014-11-29.
- Code Page CPGID 01253 (pdf) (PDF), IBM
- Code Page CPGID 01253 (txt), IBM
- International Components for Unicode (ICU), ibm-1253_P100-1995.ucm, 2002-12-03
- International Components for Unicode (ICU), ibm-5349_P100-1998.ucm, 2002-12-03