Network Working Group                              KOI8-U Working Group
Request for Comments: 2319                                   April 1998
Category: Informational


                    Ukrainian Character Set KOI8-U

Status of this Memo

  This memo provides information for the Internet community.  It does
  not specify an Internet standard of any kind.  Distribution of this
  memo is unlimited.

Copyright Notice

  Copyright (C) The Internet Society (1998).  All Rights Reserved.

Abstract

  This document provides information about character encoding KOI8-U
  (KOI8 Ukrainian) wich is a de-facto standard in Ukrainian Internet
  community.  KOI8-U is compatible with KOI8-R (RFC 1489) in all
  Russian letters and extends it with four Ukrainian letters which
  locations are compliant with ISO-IR-111.  The official site of KOI8-U
  Working Group is http://www.net.ua.

Introduction

  This document provides information about character encoding KOI8-U
  (KOI8 Ukrainian) widely used in Ukrainian Internet community for mail
  and news exchange as well as for presentation of WWW information
  resources in Ukrainian language.

  Originally, specification of proposed standard koi8-u was officially
  adopted by the conference of Postmasters of Ukrainian Internet
  Service Providers in Slavsk in Autumn of 1992 presented by Igor
  Sviridov from Kiev and Stas Vorony from Kharkiv. Later in June 1995
  this specification was completed with UKRAINIAN GHE WITH UPTURN.

  KOI8-U (KOI8 Ukrainian) is a de-facto standard supported in many
  operation systems and Internet user applications including encoding
  tables, fonts, locale support for many operating systems and
  environments.

  MIME character set name: koi8-u






KOI8-U Working Group         Informational                      [Page 1]

RFC 2319             Ukrainian Character Set KOI8-U           April 1998


Relation to other RFCs

  This standard is based on several published standards:  RFC1489 (it
  is fully compatible in all Russian letters), RFC-1345, ISO-IR-111,
  ISO 10646.

Compatibility with other character sets

  The lower part of the KOI8-U Ukrainian Character Set is a complete
  copy of ASCII, just as it's used in KOI8-R and other non-ASCII
  codepages.

  The upper part of the KOI8-U Character Set contains all Russian
  letters defined in KOI8-R and four Ukrainian letters (#164, #180 -
  ukr. ie, #166, #182 - ukr. i, #167, #183 - ukr. yi, #173, #189 - ukr.
  ghe  with upturn) which locations are compliant with ISO-IR-111.

  BOX DRAWINGS elements in the other positions (that are not used by
  Ukrainian letters) are the same as in KOI8-R character set. List of
  all differences between KOI8-R and proposed KOI8-U is given in
  APPENDIX A.

Specification of the upper part of KOI8-U codepage

  The description of all characters of upper half part of KOI8-U
  codepage is given according to ISO 10646 Unicode Character Set (UCS).

  KOI8-U charset table in RFC1345 format is given in Appendix B.

   <decimal>  <hex-code>  <UCS>     <description>

  128       80      U2500      BOX DRAWINGS  LIGHT HORIZONTAL
  129       81      U2502      BOX DRAWINGS  LIGHT VERTICAL
  130       82      U250C      BOX DRAWINGS  LIGHT DOWN AND RIGHT
  131       83      U2510      BOX DRAWINGS  LIGHT DOWN AND LEFT
  132       84      U2514      BOX DRAWINGS  LIGHT UP AND RIGHT
  133       85      U2518      BOX DRAWINGS  LIGHT UP AND LEFT
  134       86      U251C      BOX DRAWINGS  LIGHT VERTICAL AND RIGHT
  135       87      U2524      BOX DRAWINGS  LIGHT VERTICAL AND LEFT
  136       88      U252C      BOX DRAWINGS  LIGHT DOWN AND HORIZONTAL
  137       89      U2534      BOX DRAWINGS  LIGHT UP AND HORIZONTAL
  138       8A      U253C      BOX DRAWINGS  LIGHT VERTICAL AND
                                HORIZONTAL
  139       8B      U2580      UPPER HALF BLOCK
  140       8C      U2584      LOWER HALF BLOCK
  141       8D      U2588      FULL BLOCK
  142       8E      U258C      LEFT HALF BLOCK
  143       8F      U2590      RIGHT HALF BLOCK



KOI8-U Working Group         Informational                      [Page 2]

RFC 2319             Ukrainian Character Set KOI8-U           April 1998


  144       90      U2591      LIGHT SHADE
  145       91      U2592      MEDIUM SHADE
  146       92      U2593      DARK SHADE
  147       93      U2320      TOP HALF INTEGRAL
  148       94      U25A0      BLACK SQUARE
  149       95      U2219      BULLET OPERATOR
  150       96      U221A      SQUARE ROOT
  151       97      U2248      ALMOST EQUAL TO
  152       98      U2264      LESS THAN OR EQUAL TO
  153       99      U2265      GREATER THAN OR EQUAL TO
  154       9A      U00A0      NO-BREAK SPACE
  155       9B      U2321      BOTTOM HALF INTEGRAL
  156       9C      U00B0      DEGREE SIGN
  157       9D      U00B2      SUPERSCRIPT TWO
  158       9E      U00B7      MIDDLE DOT
  159       9F      U00F7      DIVISION SIGN
  160       A0      U2550      BOX DRAWINGS  DOUBLE HORIZONTAL
  161       A1      U2551      BOX DRAWINGS  DOUBLE VERTICAL
  162       A2      U2552      BOX DRAWINGS  DOWN SINGLE AND RIGHT
                                DOUBLE
  163       A3      U0451      CYRILLIC SMALL LETTER IO
  164       A4      U0454      CYRILLIC SMALL LETTER UKRAINIAN IE
  165       A5      U2554      BOX DRAWINGS  DOUBLE DOWN AND RIGHT
  166       A6      U0456      CYRILLIC SMALL LETTER BYELORUSSIAN-
                                UKRAINIAN I
  167       A7      U0457      CYRILLIC SMALL LETTER YI (UKRAINIAN)
  168       A8      U2557      BOX DRAWINGS  DOUBLE DOWN AND LEFT
  169       A9      U2558      BOX DRAWINGS  UP SINGLE AND RIGHT DOUBLE
  170       AA      U2559      BOX DRAWINGS  UP DOUBLE AND RIGHT SINGLE
  171       AB      U255A      BOX DRAWINGS  DOUBLE UP AND RIGHT
  172       AC      U255B      BOX DRAWINGS  UP SINGLE AND LEFT DOUBLE
  173       AD      U0491      CYRILLIC SMALL LETTER GHE WITH UPTURN
  174       AE      U255D      BOX DRAWINGS  DOUBLE UP AND LEFT
  175       AF      U255E      BOX DRAWINGS  VERTICAL SINGLE AND
                                RIGHT DOUBLE
  176       B0      U255F      BOX DRAWINGS  VERTICAL DOUBLE AND
                                RIGHT SINGLE
  177       B1      U2560      BOX DRAWINGS  DOUBLE VERTICAL AND RIGHT
  178       B2      U2561      BOX DRAWINGS  VERTICAL SINGLE AND
                                LEFT DOUBLE
  179       B3      U0401      CYRILLIC CAPITAL LETTER IO
  180       B4      U0404      CYRILLIC CAPITAL LETTER UKRAINIAN IE
  181       B5      U2563      BOX DRAWINGS DOUBLE VERTICAL AND LEFT
  182       B6      U0406      CYRILLIC CAPITAL LETTER
                                BYELORUSSIAN-UKRAINIAN I
  183       B7      U0407      CYRILLIC CAPITAL LETTER YI (UKRAINIAN)
  184       B8      U2566      BOX DRAWINGS  DOUBLE DOWN AND HORIZONTAL
  185       B9      U2567      BOX DRAWINGS  UP SINGLE AND



KOI8-U Working Group         Informational                      [Page 3]

RFC 2319             Ukrainian Character Set KOI8-U           April 1998


                                HORIZONTAL DOUBLE
  186       BA      U2568      BOX DRAWINGS  UP DOUBLE AND
                                HORIZONTAL SINGLE
  187       BB      U2569      BOX DRAWINGS  DOUBLE UP AND HORIZONTAL
  188       BC      U256A      BOX DRAWINGS  VERTICAL SINGLE AND
                                HORIZONTAL DOUBLE
  189       BD      U0490      CYRILLIC CAPITAL LETTER GHE WITH UPTURN
  190       BE      U256C      BOX DRAWINGS  DOUBLE VERTICAL AND
                                HORIZONTAL
  191       BF      U00A9      COPYRIGHT SIGN
  192       C0      U044E      CYRILLIC SMALL LETTER YU
  193       C1      U0430      CYRILLIC SMALL LETTER A
  194       C2      U0431      CYRILLIC SMALL LETTER BE
  195       C3      U0446      CYRILLIC SMALL LETTER TSE
  196       C4      U0434      CYRILLIC SMALL LETTER DE
  197       C5      U0435      CYRILLIC SMALL LETTER IE
  198       C6      U0444      CYRILLIC SMALL LETTER EF
  199       C7      U0433      CYRILLIC SMALL LETTER GHE
  200       C8      U0445      CYRILLIC SMALL LETTER KHA
  201       C9      U0438      CYRILLIC SMALL LETTER I
  202       CA      U0439      CYRILLIC SMALL LETTER SHORT I
  203       CB      U043A      CYRILLIC SMALL LETTER KA
  204       CC      U043B      CYRILLIC SMALL LETTER EL
  205       CD      U043C      CYRILLIC SMALL LETTER EM
  206       CE      U043D      CYRILLIC SMALL LETTER EN
  207       CF      U043E      CYRILLIC SMALL LETTER O
  208       D0      U043F      CYRILLIC SMALL LETTER PE
  209       D1      U044F      CYRILLIC SMALL LETTER YA
  210       D2      U0440      CYRILLIC SMALL LETTER ER
  211       D3      U0441      CYRILLIC SMALL LETTER ES
  212       D4      U0442      CYRILLIC SMALL LETTER TE
  213       D5      U0443      CYRILLIC SMALL LETTER U
  214       D6      U0436      CYRILLIC SMALL LETTER ZHE
  215       D7      U0432      CYRILLIC SMALL LETTER VE
  216       D8      U044C      CYRILLIC SMALL LETTER SOFT SIGN
  217       D9      U044B      CYRILLIC SMALL LETTER YERU
  218       DA      U0437      CYRILLIC SMALL LETTER ZE
  219       DB      U0448      CYRILLIC SMALL LETTER SHA
  220       DC      U044D      CYRILLIC SMALL LETTER E
  221       DD      U0449      CYRILLIC SMALL LETTER SHCHA
  222       DE      U0447      CYRILLIC SMALL LETTER CHE
  223       DF      U044A      CYRILLIC SMALL LETTER HARD SIGN
  224       E0      U042E      CYRILLIC CAPITAL LETTER YU
  225       E1      U0410      CYRILLIC CAPITAL LETTER A
  226       E2      U0411      CYRILLIC CAPITAL LETTER BE
  227       E3      U0426      CYRILLIC CAPITAL LETTER TSE
  228       E4      U0414      CYRILLIC CAPITAL LETTER DE
  229       E5      U0415      CYRILLIC CAPITAL LETTER IE



KOI8-U Working Group         Informational                      [Page 4]

RFC 2319             Ukrainian Character Set KOI8-U           April 1998


  230       E6      U0424      CYRILLIC CAPITAL LETTER EF
  231       E7      U0413      CYRILLIC CAPITAL LETTER GHE
  232       E8      U0425      CYRILLIC CAPITAL LETTER KHA
  233       E9      U0418      CYRILLIC CAPITAL LETTER I
  234       EA      U0419      CYRILLIC CAPITAL LETTER SHORT I
  235       EB      U041A      CYRILLIC CAPITAL LETTER KA
  236       EC      U041B      CYRILLIC CAPITAL LETTER EL
  237       ED      U041C      CYRILLIC CAPITAL LETTER EM
  238       EE      U041D      CYRILLIC CAPITAL LETTER EN
  239       EF      U041E      CYRILLIC CAPITAL LETTER O
  240       F0      U041F      CYRILLIC CAPITAL LETTER PE
  241       F1      U042F      CYRILLIC CAPITAL LETTER YA
  242       F2      U0420      CYRILLIC CAPITAL LETTER ER
  243       F3      U0421      CYRILLIC CAPITAL LETTER ES
  244       F4      U0422      CYRILLIC CAPITAL LETTER TE
  245       F5      U0423      CYRILLIC CAPITAL LETTER U
  246       F6      U0416      CYRILLIC CAPITAL LETTER ZHE
  247       F7      U0412      CYRILLIC CAPITAL LETTER VE
  248       F8      U042C      CYRILLIC CAPITAL LETTER SOFT SIGN
  249       F9      U042B      CYRILLIC CAPITAL LETTER YERU
  250       FA      U0417      CYRILLIC CAPITAL LETTER ZE
  251       FB      U0428      CYRILLIC CAPITAL LETTER SHA
  252       FC      U042D      CYRILLIC CAPITAL LETTER E
  253       FD      U0429      CYRILLIC CAPITAL LETTER SHCHA
  254       FE      U0427      CYRILLIC CAPITAL LETTER CHE
  255       FF      U042A      CYRILLIC CAPITAL LETTER HARD SIGN

Security Considerations

  This memo raises no known security issues.

Acknowledgments

  The present edition of this document was prepared by joint activity
  of KOI8-U Working group and is a result of wide discussion in
  Ukrainian USENET newsgroup ukr.nodes and consensus reached among
  majority of Ukrainian ISPs.

  Special acknowledges to:

  Andrew Chernov <[email protected]>, author of the first on Internet
  RFC 1489 describing KOI8-R Russian character set;

  Igor Sviridov <[email protected]> for the initial work on establishing and
  support of KOI8-U character set and it's implementation in first e-
  mail products.





KOI8-U Working Group         Informational                      [Page 5]

RFC 2319             Ukrainian Character Set KOI8-U           April 1998


  Many people have contributed to the early work on koi8-u encoding:

         Stanislav V. Voronyi <[email protected]>
         Serge Vakulenko <[email protected]>
         Lena Savchenko <[email protected]>
         Igor Romanenko <[email protected]>
         Ruslan Belkin <[email protected]>
         Andrey Blohintsev <[email protected]>

References

  [1]  Chernov, A., "Registration of a Cyrillic Character Set", RFC
       1489, July 1993.

  [2]  UNICODE 2.0 CHARACTER DATABASE. - ftp://unicode.org/pub/2.0-
       Update/UnicodeData-2.0.14.txt

  [3]  Ukrainian letters in koi8-u and other character sets
       ftp://ftp.ua.net/pub/info/encodings/koi8-u/ukr_chars_in_koi8-
       u_and_others.txt, June 1995.

  [4]  ECMA-CYRILLIC. - ftp://dkuug.dk/i18n/charmaps.all/ECMA-
       CYRILLIC

  [5]  Simonsen, K., "Character Mnemonics & Character Sets" RFC 1345,
       June 1992.

KOI8-U Working Group List

  Coordinator:
  Alexander Yeremenko <[email protected]>

  Yuri Demchenko <[email protected]>
  Victor Forsyuk <[email protected]>
  Taras Heychenko <[email protected]>
  Pavel Gulchuk <[email protected]>
  Dmitry Kohmanyuk <[email protected]>
  Boris Mostovoy <[email protected]>
  Helen Panchenko <[email protected]>
  Igor Romanenko <[email protected]>
  Eugene Sherstobitov <[email protected]>
  Andrew Stesin <[email protected]>
  Igor Sviridov <[email protected]>
  Roman A. Tkachuk <[email protected]>







KOI8-U Working Group         Informational                      [Page 6]

RFC 2319             Ukrainian Character Set KOI8-U           April 1998


APPENDIX A

DIFFERENCE OF KOI8-U from KOI8-R (RFC 1489)

  KOI8-U is compatible with KOI8-R in all Cyrillic Letters and
  completes it with four Ukrainian letters UKRAINIAN IE #164, #180,
  CYRILLIC SMALL LETTER BELORUSSIAN-UKRAINIAN I #166, #182, UKRAINIAN
  YI #167, #183, UKRAINIAN GHE WITH UPTURN #173, #189.

     <decimal> <hex-code>  <UCS>    <description>

   164       A4      U0454      CYRILLIC SMALL LETTER UKRAINIAN IE
   166       A6      U0456      CYRILLIC SMALL LETTER
                                BELORUSSIAN-UKRAINIAN I
   167       A7      U0457      CYRILLIC SMALL LETTER YI (UKRAINIAN)
   173       AD      U0491      CYRILLIC SMALL LETTER UKRAINIAN GHE
                                WITH UPTURN
   180       B4      U0403      CYRILLIC CAPITAL LETTER UKRAINIAN IE
   182       B6      U0406      CYRILLIC CAPITAL LETTER
                                BELORUSSIAN-UKRAINIAN I
   183       B7      U0407      CYRILLIC CAPITAL LETTER YI (UKRAINIAN)
   189       BD      U0490      CYRILLIC CAPITAL LETTER UKRAINIAN GHE
                                WITH UPTURN




























KOI8-U Working Group         Informational                      [Page 7]

RFC 2319             Ukrainian Character Set KOI8-U           April 1998


APPENDIX B

KOI8-U charset table in RFC1345 format

  &charset KOI8-U
  &rem source: RFC 2319
  &rem Mibenum: 2088
  &rem source: http://www.net.ua/KOI8-U/
  &bits 8
  &code 0
  NU SH SX EX ET EQ AK BL BS HT LF VT FF CR SO SI
  DL D1 D2 D3 D4 NK SY EB CN EM SB EC FS GS RS US
  SP ! " Nb DO % & ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
  At A B C D E F G H I J K L M N O P Q R S T U V W X Y Z <( // )> '> _
  '! a b c d e f g h i j k l m n o p q r s t u v w x y z (! !! !) '? DT
  hh vv dr dl ur ul vr vl dh uh vh TB LB FB lB RB
  .S :S ?S Iu fS Sb RT ?2 =< >= NS Il DG 2S .M -:
  HH VV dR io ie DR ii yi LD uR Ur UR uL g3 UL vR
  Vr VR vL IO IE VL II YI DH uH Uh UH vH G3 VH Co
  ju a= b= c= d= e= f= g= h= i= j= k= l= m= n= o=
  p= ja r= s= t= u= z% v= %' y= z= s% je sc c% ='
  JU A= B= C= D= E= F= G= H= I= J= K= L= M= N= O=
  P= JA R= S= T= U= Z% V= %" Y= Z= S% JE Sc C% ="




























KOI8-U Working Group         Informational                      [Page 8]

RFC 2319             Ukrainian Character Set KOI8-U           April 1998


Full Copyright Statement

  Copyright (C) The Internet Society (1998).  All Rights Reserved.

  This document and translations of it may be copied and furnished to
  others, and derivative works that comment on or otherwise explain it
  or assist in its implementation may be prepared, copied, published
  and distributed, in whole or in part, without restriction of any
  kind, provided that the above copyright notice and this paragraph are
  included on all such copies and derivative works.  However, this
  document itself may not be modified in any way, such as by removing
  the copyright notice or references to the Internet Society or other
  Internet organizations, except as needed for the purpose of
  developing Internet standards in which case the procedures for
  copyrights defined in the Internet Standards process must be
  followed, or as required to translate it into languages other than
  English.

  The limited permissions granted above are perpetual and will not be
  revoked by the Internet Society or its successors or assigns.

  This document and the information contained herein is provided on an
  "AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET ENGINEERING
  TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING
  BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION
  HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF
  MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
























KOI8-U Working Group         Informational                      [Page 9]