unixdev.net


Switch to SpeakEasy.net DSL

The Modular Manual Browser

Home Page
Manual: (OSF1-V5.1-alpha)
Page:
Section:
Apropos / Subsearch:
optional field



shiftjis(5)							  shiftjis(5)



NAME

  shiftjis, SJIS - A character encoding	system (codeset) for Japanese

DESCRIPTION

  The Shift JIS	(SJIS) codeset consists	of the following character sets:

    +  JIS X0201

    +  JIS X0208

    +  User-Defined Characters (UDC)

  Shift	JIS Encoding


  Shift	JIS character codes use	a combination of single-byte data and 2-byte
  data to represent characters defined in the JIS X0201	and JIS	X0208 stan-
  dards	and in the UDC area.

  All JIS X0201	characters are represented in the form of single-byte data.
  The Roman letters in JIS X0201 are encoded by	setting	the most significant
  bit (MSB) of each byte to off, while the Katakana characters are encoded by
  setting the most significant bit (MSB) of each byte to on. For more infor-
  mation on JIS	X0201 characters, refer	to deckanji(5).	In the Super DEC
  Kanji	codeset, the code ranges for JIS X0201 characters are as follows:

    +  For Roman letters, 00 to	7F

    +  For Katakana characters,	A1 to DF

  JIS X0208 characters are encoded in 2-byte values.  The values for the
  first	bytes are encoded so that they fall outside the	range of byte values
  for JIS X0201	characters (in other words, the	JIS X0208 first	byte ranges
  are from 81 to 9F and	from E0	to FC).	In this	manner,	characters from	the
  two different	standards can be supported by the same codeset.	The range for
  the second byte of a JIS X0208 character is 40 to FC (except for 7F).	 For
  more information on JIS X0208	characters, refer to deckanji(5).

  The Shift JIS	codeset	provides for 2444 UDC characters. These	are encoded
  as 2-byte values whose code range is F040 to FCFC.

  Font Support for Super DEC Kanji


  For display devices, the operating system supports Super DEC Kanji encoding
  by conversion	to DEC Kanji encoding and then using fonts available for DEC
  Kanji.  Refer	to the iconv_intro(5) reference	page for information on
  codeset conversion.

  For printers,	the operating system supports only printer-resident fonts;
  therefore, Super DEC Kanji fonts cannot be dynamically loaded	to a printer.
  For general information on printing non-English text,	refer to
  i18n_printing(5).





SEE ALSO

  Commands: locale(1)

  Others: ascii(5), deckanji(5), eucJP(5), i18n_intro(5), i18n_printing(5),
  iconv_intro(5), iso2022jp(5),	Japanese(5), jiskanji(5), l10n_intro(5),
  sdeckanji(5)