Changelogs & Release Notes

Show changelog from version to

6 Bugfixes
4 Features
9 Tweaks
SetaPDF-Extractor Component
Feature
  • Added new extraction strategy which extract plain text based on found glyphs (instead of fragments).
Tweak
  • Refactored sorter classes/logics (performance gain).
  • Optimized word strategy to ignore character codes below 32.
  • Optimized recognizing of word breaks in "numeric words".
  • Optimized resolving of undefined space width value.
SetaPDF-Core Component
Feature
  • Added possibility to add custom glyph names and callbacks which are used by SetaPDF_Core_Font_Glyph_List::byName().
  • Added more logic from the Adobe Glyph List specification to the Glyph List class.
  • Added new pdf parser class to parse tokens without object initiation (a kind of lightweight parser for content streams).
Bugfix
  • Fixed resolving of Differences array in simple fonts (if it is referenced by an indirect object).
  • Fixed resolving of pages in page tree structure in SetaPDF_Core_Document_Pages::_ensureAllPageObjects().
  • Fixed faulty encoding value in ZapfDingbats and Symbol font.
  • Fixed handling of Differences array in fonts encoding dictionary.
  • Fixed missing parameter in SetaPDF_Core_Document_Page_Annotation_BorderStyle::getDictionary() calls.
  • Fixed observing in __wakeup() method of indirect objects.
Tweak
  • Use sys_get_temp_dir() as default directory path for temporary files in SetaPDF_Core_Writer_TempFile.
  • Added static method createTempFile() to SetaPDF_Core_Writer_TempFile to allow creation of temporary files with specified content.
  • Optimized pdf parser classes to reduce reader/tokenizer method calls.
  • Increased default $length parameter in SetaPDF_Core_Reader_String::increaseLength() to reduce method calls (performance gain).
  • Optimized tokenizer class to reduce reader method calls.