Changelogs & Release Notes for SetaPDF-Extractor 2.3.0.817

switch product

Show changelog from version to

6 Bugfixes

4 Features

9 Tweaks

Added new extraction strategy which extract plain text based on found glyphs (instead of fragments).

Added possibility to add custom glyph names and callbacks which are used by SetaPDF_Core_Font_Glyph_List::byName().
Added more logic from the Adobe Glyph List specification to the Glyph List class.
Added new pdf parser class to parse tokens without object initiation (a kind of lightweight parser for content streams).

Fixed resolving of Differences array in simple fonts (if it is referenced by an indirect object).
Fixed resolving of pages in page tree structure in SetaPDF_Core_Document_Pages::_ensureAllPageObjects().
Fixed faulty encoding value in ZapfDingbats and Symbol font.
Fixed handling of Differences array in fonts encoding dictionary.
Fixed missing parameter in SetaPDF_Core_Document_Page_Annotation_BorderStyle::getDictionary() calls.
Fixed observing in __wakeup() method of indirect objects.

Use sys_get_temp_dir() as default directory path for temporary files in SetaPDF_Core_Writer_TempFile.
Added static method createTempFile() to SetaPDF_Core_Writer_TempFile to allow creation of temporary files with specified content.
Optimized pdf parser classes to reduce reader/tokenizer method calls.
Increased default $length parameter in SetaPDF_Core_Reader_String::increaseLength() to reduce method calls (performance gain).
Optimized tokenizer class to reduce reader method calls.