ANSEL
ANSEL, The American National Standard for Extended Latin Alphabet Coded Character Set for Bibliographic Use, was a character set used in text encoding. It provided a table of coded values for the representation of characters of the extended Latin alphabet in machine-readable form for thirty-five languages written in the Latin alphabet and for fifty-one romanized languages. The standard was reaffirmed in 2003 although it has been administratively withdrawn by ANSI effective 14 February 2013.[1] It is registered as Registration # 231 in the ISO International Register of Coded Character Sets to be Used with Escape Sequences.[2][3]
ANSEL is composed of a set of 63 graphic characters intended for use with ASCII, the American National Standard Code for Information Interchange, ANSI X3.4-1986,[3] including 29 combining diacritic characters. A combining diacritic character precedes the spacing character on which it should be superimposed.[1]
Code page layout
The following table shows ANSI/NISO Z39.47-1993 (R2003).[1] Each character is shown with its Unicode equivalent.
_0 | _1 | _2 | _3 | _4 | _5 | _6 | _7 | _8 | _9 | _A | _B | _C | _D | _E | _F | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0_ |
NUL 0000 0 |
SOH 0001 1 |
STX 0002 2 |
ETX 0003 3 |
EOT 0004 4 |
ENQ 0005 5 |
ACK 0006 6 |
BEL 0007 7 |
BS 0008 8 |
HT 0009 9 |
LF 000A 10 |
VT 000B 11 |
FF 000C 12 |
CR 000D 13 |
SO 000E 14 |
SI 000F 15 |
1_ |
DLE 0010 16 |
DC1 0011 17 |
DC2 0012 18 |
DC3 0013 19 |
DC4 0014 20 |
NAK 0015 21 |
SYN 0016 22 |
ETB 0017 23 |
CAN 0018 24 |
EM 0019 25 |
SUB 001A 26 |
ESC 001B 27 |
FS 001C 28 |
GS 001D 29 |
RS 001E 30 |
US 001F 31 |
2_ |
SP 0020 32 |
! 0021 33 |
" 0022 34 |
# 0023 35 |
$ 0024 36 |
% 0025 37 |
& 0026 38 |
' 0027 39 |
( 0028 40 |
) 0029 41 |
* 002A 42 |
+ 002B 43 |
, 002C 44 |
- 002D 45 |
. 002E 46 |
/ 002F 47 |
3_ |
0 0030 48 |
1 0031 49 |
2 0032 50 |
3 0033 51 |
4 0034 52 |
5 0035 53 |
6 0036 54 |
7 0037 55 |
8 0038 56 |
9 0039 57 |
: 003A 58 |
; 003B 59 |
< 003C 60 |
= 003D 61 |
> 003E 62 |
? 003F 63 |
4_ |
@ 0040 64 |
A 0041 65 |
B 0042 66 |
C 0043 67 |
D 0044 68 |
E 0045 69 |
F 0046 70 |
G 0047 71 |
H 0048 72 |
I 0049 73 |
J 004A 74 |
K 004B 75 |
L 004C 76 |
M 004D 77 |
N 004E 78 |
O 004F 79 |
5_ |
P 0050 80 |
Q 0051 81 |
R 0052 82 |
S 0053 83 |
T 0054 84 |
U 0055 85 |
V 0056 86 |
W 0057 87 |
X 0058 88 |
Y 0059 89 |
Z 005A 90 |
[ 005B 91 |
\ 005C 92 |
] 005D 93 |
^ 005E 94 |
_ 005F 95 |
6_ |
` 0060 96 |
a 0061 97 |
b 0062 98 |
c 0063 99 |
d 0064 100 |
e 0065 101 |
f 0066 102 |
g 0067 103 |
h 0068 104 |
i 0069 105 |
j 006A 106 |
k 006B 107 |
l 006C 108 |
m 006D 109 |
n 006E 110 |
o 006F 111 |
7_ |
p 0070 112 |
q 0071 113 |
r 0072 114 |
s 0073 115 |
t 0074 116 |
u 0075 117 |
v 0076 118 |
w 0077 119 |
x 0078 120 |
y 0079 121 |
z 007A 122 |
{ 007B 123 |
| 007C 124 |
} 007D 125 |
~ 007E 126 |
DEL 007F 127 |
8_ |
128 |
129 |
130 |
131 |
132 |
133 |
134 |
135 |
136 |
137 |
138 |
139 |
140 |
141 |
142 |
143 |
9_ |
144 |
145 |
146 |
147 |
148 |
149 |
150 |
151 |
152 |
153 |
154 |
155 |
156 |
157 |
158 |
159 |
A_ |
160 |
Ł 0141 161 |
Ø 00D8 162 |
Đ 0110 163 |
Þ 00DE 164 |
Æ 00C6 165 |
Œ 0152 166 |
ʹ 02B9 167 |
· 00B7 168 |
♭ 266D 169 |
® 00AE 170 |
± 00B1 171 |
Ơ 01A0 172 |
Ư 01AF 173 |
ʼ 02BC 174 |
175 |
B_ |
ʻ 02BB 176 |
ł 0142 177 |
ø 00F8 178 |
đ 0111 179 |
þ 00FE 180 |
æ 00E6 181 |
œ 0153 182 |
ʺ 02BA 183 |
ı 0131 184 |
£ 00A3 185 |
ð 00F0 186 |
187 |
ơ 01A1 188 |
ư 01B0 189 |
190 |
191 |
C_ |
° 00B0 192 |
ℓ 2113 193 |
℗ 2117 194 |
© 00A9 195 |
♯ 266F 196 |
¿ 00BF 197 |
¡ 00A1 198 |
199 |
200 |
201 |
202 |
203 |
204 |
205 |
206 |
207 |
D_ |
208 |
209 |
210 |
211 |
212 |
213 |
214 |
215 |
216 |
217 |
218 |
219 |
220 |
221 |
222 |
223 |
E_ |
̉ 0309 224 |
̀ 0300 225 |
́ 0301 226 |
̂ 0302 227 |
̃ 0303 228 |
̄ 0304 229 |
̆ 0306 230 |
̇ 0307 231 |
̈ 0308 232 |
̌ 030C 233 |
̊ 030A 234 |
︠ FE20 235 |
︡ FE21 236 |
̕ 0315 237 |
̋ 030B 238 |
̐ 0310 239 |
F_ |
̧ 0327 240 |
̨ 0328 241 |
̣ 0323 242 |
̤ 0324 243 |
̥ 0325 244 |
̳ 0333 245 |
̲ 0332 246 |
̦ 0326 247 |
̜ 031C 248 |
̮ 032E 249 |
︢ FE22 250 |
︣ FE23 251 |
252 |
253 |
̓ 0313 254 |
255 |
_0 | _1 | _2 | _3 | _4 | _5 | _6 | _7 | _8 | _9 | _A | _B | _C | _D | _E | _F |
Use
GEDCOM
The GEDCOM specification for exchanging genealogical data refers to ANSEL (ANSI/NISO Z39.47-1985) as a valid text encoding for GEDCOM files and extends it with additional characters which are shown in the following table.[4][5]
Hex | Unicode | Glyph | Description |
---|---|---|---|
0xBE | 25A1 | □ | empty box |
0xBF | 25A0 | ■ | black box |
0xCD | 0065 | e | midline e |
0xCE | 006F | o | midline o |
0xCF | 00DF | ß | es zet |
0xFC | 0338 | ̸ | diacritic slash through char |
MARC21
The MARC 21 Extended Latin character set is synchronized with ANSEL[3] but additionally supports the euro sign (€) and eszett (ß) characters.[6]
References
<templatestyles src="Reflist/styles.css" />
Cite error: Invalid <references>
tag; parameter "group" is allowed only.
<references />
, or <references group="..." />
External links
- National Information Standards Organization (NISO)
- American National Standards Institute (ANSI)
- ANSI/NISO Z39.47-1985
- ANSI/NISO Z39.47-1993 (R2003)
- ISO-IR 231
- MARC 21 Code Table Extended Latin (ANSEL)
- ↑ 1.0 1.1 1.2 Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ 3.0 3.1 3.2 Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.