String

From GNUpdf

PDF string types
Enlarge
PDF string types

Text strings

The text string type is used to store encoded text using one of the following CCS:

CCS description
PDFDocEncoding Unibyte encoding of a character set defined in the PDF standard. The character set contain the ISO Latin 1 characters.
UTF-16BE Multibyte encoding of the unicode/ISO-10646 character set. A big-endian UTF16 mark precedes the data in the encoded string (U+FEFF or 254d, 255d).

ASCII strings

An ASCII string contain text encoded using a unibyte encoding of the ASCII (ISO 646) character set. Byte strings

Byte strings are string objects whose content is interpreted as a sequence of unsigned chars (octects). The values encoded in the octects may not be characters.

If the values encoded in the octects are in fact characters, the used encoding is unknown.