-
:
. Unicode
-
/ , !!
-
:
-
( . behaviour behavior
( )
( v )
( /)
-
(coded character set): , bits. (glyph): . (, ) (font): (glyphs) .
-
ASCII (American Standard Code for Information Interchange): 7-bit 8-bit , EBCDIC (Extended Binary Coded Decimal Interchange Code): 8-bit IBM ISO 8859 ASCII Kanji: 6.000 GuoBiao (GB): 13.000
-
transliteration: ( , , )
: ( browsers , .
-
: ( , , ) :
-
Unicode 1980 Joseph Becker, Lee Collins Mark Davis
: Universality () Uniqueness () Uniformity ()
1991 Unicode Consortium 1991
-
Unicode , , , , ,
-
16-bit 65.536 256 ASCII 8.192 4.096 , 4.096 CJK (China, Japan, Korea) 20.000 .
-
16-bit . Unicode 1.000.000 1.114.112 . , Unicode, UTF8, UTF16 UTF32.
-
UTF (Universal Multiple-Octet Character Set Transformation Format):
UTF8: 8-bitsUTF16: 16-bitsUTF32: 32 bits
-
Unicode
ASCII Unicode
-
Unicode
-
Unicode Hindi , 16 . Unicode , 3.0.0. 10 , ,
-
Vidyanidhi 281 , 25.000-30.000 20%-25% : , , , . 22.000 19.000 , 2.200 Hindi 640 Kannada.
-
. , , . , , matras conjuncts. .
-
Vidyanidhi . , Hindi Kannada. . , , . , ThesisID. Dublin Core.
-
Unicode
-
.
-
Much remains to be done before linguistic barriers can be surmounted as effectively as geographic ones (Oard 1997)