Class PDFDocEncoding


  • public final class PDFDocEncoding
    extends Object
    • Method Detail

      • toAcrobatString

        public static String toAcrobatString​(byte[] bytes)
        Converts byte array to same string format as Acrobat does.
      • toAcrobatString

        public static String toAcrobatString​(byte[] bytes,
                                             int start,
                                             int length)
        Converts byte array to same string format as Acrobat does.
      • toEscapeString

        public static String toEscapeString​(byte[] bytes,
                                            int start,
                                            int length)
        Converts the byte array to escaped string.
      • fromEscapeString

        public static byte[] fromEscapeString​(String str)
        Converts escaped string to a byte array.
      • fromUnicodeString

        public static byte[] fromUnicodeString​(String uniStr)
        Converts a unicode string to byte array.
      • fromUnicodeString

        public static byte[] fromUnicodeString​(String uniStr,
                                               boolean utf16)
      • decode

        public static char[] decode​(byte[] bytes,
                                    int start,
                                    int length)
                             throws CharacterCodingException
        PDFDocEncoding-encoded bytes to string conversion.
        Throws:
        CharacterCodingException - the byte stream is not a legal encoding.
      • isUnicode

        public static boolean isUnicode​(String uniStr)
        Returns true if passed string represents unicode, else false.
      • isPDFDocEncoding

        public static boolean isPDFDocEncoding​(String uniStr)
      • convertUTF16ToPDFDocEncoding

        public static void convertUTF16ToPDFDocEncoding​(byte[] input,
                                                        EncodedData<char[]> encInfo)
        Tries to convert the UTF-16 bytes passed to PDF Doc encoding. This method copies the logic of ASTextGetPDTextCopy method in ASText.cpp file of Acrobat.
        Parameters:
        input -
        encInfo - This is the encoding information of the output.
      • encToUCSStrInternal

        public static void encToUCSStrInternal​(EncodedData<char[]> inputEncInfo,
                                               EncodedData<byte[]> outputEncInfo)
        This method converts PDF Doc encoded data to UCS format. Logic is copied from EncToUCSStrInternal method in PDConvrt.cpp file of Acrobat.
        Parameters:
        inputEncInfo -
        outputEncInfo -