unixdev.net


Switch to SpeakEasy.net DSL

The Modular Manual Browser

Home Page
Manual: (OSF1-V5.1-alpha)
Page:
Section:
Apropos / Subsearch:
optional field



jiskanji(5)							  jiskanji(5)



NAME

  jiskanji, jiskanji7, JIS7 - A	character encoding system (codeset) for
  Japanese

DESCRIPTION

  JIS Kanji is a codeset that uses the JIS X0202 symbol	extension method for
  encoding the JIS X0208 and JIS X0201 character sets. There are two types of
  JIS Kanji encoding: 7-bit JIS	Kanji code and 8-bit JIS Kanji code.

  7-bit	JIS Kanji Code


  In 7-bit JIS Kanji encoding, all character values are	7-bit bytes. Charac-
  ters are interpreted according to preceding in and out sequences as fol-
  lows:

    +  Kanji in	sequence (ESC $	B)

       The code	values following the Kanji in sequence (ESC $ B) are treated
       as characters in	the JIS	X0208 Kanji character set.

    +  Kanji out sequence (ESC ( B)

       The code	values following the Kanji out sequence	(ESC ( B) are treated
       as ASCII	characters.

    +  Supplementary Kanji in sequence (ESC $ (	D)

       The code	values following the supplementary Kanji in sequence (ESC $ (
       D) are treated as characters in the JIS X0212 supplementary Kanji
       character set.

    +  User-Defined Character (UDC) in sequence	(ESC $ ( 0)

       The code	values following the UDC in sequence (ESC $ ( 0) are treated
       as characters in	the vendor-defined or user-defined character set.

    +  Kana in (SO) and	Kana out (SI) sequences

       The code	values following SO(0x0e) and preceding	SI(0x0f) are treated
       as characters in	the JIS	X0201 Katakana character set.

    +  Katakana	in sequence (ESC ( I)

       Code values following the Katakana in sequence (ESC ( I)	are treated
       as characters in	the JIS	X0201 Katakana character set. In this case,
       the Kanji out sequence is used to switch	back to	ASCII code.

       The Katakana in and Kanji out sequences are an alternative to using
       the Kana	in and out sequences (SO/SI).


  8-bit	JIS Kanji Code


  In 8-bit JIS Kanji encoding, the JIS X0201 Katakana characters are
  represented as 8-bit bytes. Using this form of encoding, in and out
  sequences have the following effect:

    +  Kanji in	sequence (ESC $	B)

       Code values following the Kanji in sequence (ESC	$ B) are treated as
       characters in the JIS X0208 Kanji character set.

    +  Supplementary Kanji in sequence (ESC $ (	D)

       Code values following the supplementary Kanji in	sequence (ESC $	( D)
       are treated as characters in the	JIS X0212 supplementary	Kanji charac-
       ter set.

    +  User-Defined Character (UDC) in sequence	(ESC $ ( 0)

       Code values following the UDC in	sequence (ESC $	( 0) are treated as
       vendor-defined or user-defined characters.

    +  Kanji out sequence (ESC ( B) Code values	following the Kanji out
       sequence	(ESC ( B) are treated as ASCII characters.

    +  Kana in and out sequences (SI/SO)

       These sequences are ignored.

  Codeset Conversion


  The following	codeset	converter pairs	are available for converting Japanese
  characters between jiskanji7 or JIS7 and other encoding formats.  The	RES-
  TRICTIONS section discusses some conversion limitations that apply to	these
  converters.

  Refer	to iconv_intro(5) for an introduction to codeset conversion. For more
  information about the	other codeset for which	jiskanji7 or JIS7 is the
  input	or output, see the reference page specified in the list	item.

    +  deckanji_jiskanji7 or deckanji_JIS7, jiskanji7_deckanji or
       JIS7_deckanji

       Converting from and to the DEC Kanji codeset: deckanji(5).

    +  eucJP_jiskanji7 or eucJP_JIS7, jiskanji7_eucJP or JIS7_eucJP

       Converting from and to Japanese Extended	UNIX Code: eucJP(5).

    +  eucTW_jiskanji7 or eucTW_JIS7, jiskanji7_eucTW or JIS7_eucTW

       Converting from and to Taiwanese	Extended UNIX Code: eucTW(5).

    +  sdeckanji_jiskanji7 or sdeckanji_JIS7, jiskanji7_sdeckanji or
       JIS7_sdeckanji

       Converting from and to the Super	DEC Kanji codeset: sdeckanji(5).

    +  SJIS_jiskanji7 or SJIS_JIS7, jiskanji7_SJIS or JIS7_SJIS

       Converting from and to Shift JIS	format:	SJIS(5).

       Shift JIS encoding format is identical to encoding in Microsoft code-
       pages used on PC	systems. Therefore, you	can use	these converters to
       convert Japanese	characters between JIS Kanji and PC code-page format.
       For general information on how the operating system supports PC code
       pages, see code_page(5).



RESTRICTIONS

  The JIS Kanji	codeset	is not supported directly by a locale but through
  code conversion (through the iconv utility, Japanese terminal	(tty) code
  conversion, and so forth).

  In the codeset naming	conventions used by the	iconv utility, the string
  JIS7 indicates 7-bit JIS Kanji code that follows a Katakana in sequence and
  the string jiskanji7 indicates 7-bit JIS Kanji code entered between Kana in
  and out sequences.  The following sequences are valid	for input to the
  iconv	utility	but are	not generated when code	is converted to	jiskanji7:

    +  Kanji in	(ESC $ @)

    +  Kanji in	(ESC && @ ESC $ B)

    +  Kanji in	(ESC $ ( B)

    +  Kanji in	(ESC $ ( @)

    +  Supplementary Kanji in (ESC $ D)

    +  Kana in (ESC ( J)

    +  Kana in (ESC ( H)

  In the code naming conventions of the	Japanese terminal, the string jis7
  indicates 7-bit JIS Kanji code and the string	jis8 indicates 8-bit JIS
  Kanji	code. When the terminal	code is	set to jis7, the Kana in and out
  sequences (SI/SO) are	used for JIS X0201 Katakana character representation.

SEE ALSO

  Commands: locale(1)

  Others: ascii(5), code_page(5), deckanji(5), eucJP(5), i18n_intro(5),
  i18n_printing(5), iconv_intro(5), iso2022jp(5), Japanese(5), l10n_intro(5),
  sdeckanji(5),	shiftjis(5)