Class CharsetCompoundText

All Implemented Interfaces:
Comparable<Charset>

class CharsetCompoundText extends CharsetICU
  • Field Details

  • Constructor Details

    • CharsetCompoundText

      public CharsetCompoundText(String icuCanonicalName, String javaCanonicalName, String[] aliases)
  • Method Details

    • isASCIIRange

      private static boolean isASCIIRange(int codepoint)
    • isIBM915

      private static boolean isIBM915(int codepoint)
    • isIBM916

      private static boolean isIBM916(int codepoint)
    • isCompoundS3

      private static boolean isCompoundS3(int codepoint)
    • isCompoundS2

      private static boolean isCompoundS2(int codepoint)
    • isIBM914

      private static boolean isIBM914(int codepoint)
    • isIBM874

      private static boolean isIBM874(int codepoint)
    • isIBM912

      private static boolean isIBM912(int codepoint)
    • isIBM913

      private static boolean isIBM913(int codepoint)
    • isCompoundS1

      private static boolean isCompoundS1(int codepoint)
    • isISO8859_14

      private static boolean isISO8859_14(int codepoint)
    • isIBM923

      private static boolean isIBM923(int codepoint)
    • findNextEsc

      private static int findNextEsc(ByteBuffer source)
    • getState

      private static byte getState(int codepoint)
    • findStateFromEscSeq

      private static byte findStateFromEscSeq(ByteBuffer source, byte[] toUBytes, int toUBytesLength)
    • LoadConverters

      private void LoadConverters()
    • newDecoder

      public CharsetDecoder newDecoder()
      Specified by:
      newDecoder in class Charset
    • newEncoder

      public CharsetEncoder newEncoder()
      Specified by:
      newEncoder in class Charset
    • getUnicodeSetImpl

      void getUnicodeSetImpl(UnicodeSet setFillIn, int which)
      Description copied from class: CharsetICU
      This follows ucnv.c method ucnv_detectUnicodeSignature() to detect the start of the stream for example U+FEFF (the Unicode BOM/signature character) that can be ignored. Detects Unicode signature byte sequences at the start of the byte stream and returns number of bytes of the BOM of the indicated Unicode charset. 0 is returned when no Unicode signature is recognized.
      Specified by:
      getUnicodeSetImpl in class CharsetICU