Changelogs & Release Notes

Show changelog from version to

4 Bugfixes
9 Features
12 Tweaks
SetaPDF-Extractor Component
Feature
  • Added $_delimiterType property and related get- and set-methods to word classes.
  • Added SetaPDF_Extractor_Result_WordInterface interface and refactored code to use it instead of the word classes.
  • Added getString() and __toString() methods to SetaPDF_Extractor_Result_Words to be able to recreate a string by the words.
  • Added SetaPDF_Extractor_Result_Words::getStringAndOffsets() to access the string version and offset positions of words in the string result.
  • Added SetaPDF_Extractor_Result_Words::search() method to support searching for words by regular expressions.
  • Added _ignore() method to all strategy classes to allow individual logic to ignore found items.
  • Added 'Soft Hyphen (SHY)' (U+00AD) as a default hyphen character in the WordGroup strategy.
Bugfix
  • Fixed itemsJoining() method in Sorter class to support independent order or arguments.
  • Fixed behavior of FlexLine strategy if lines were merged together because of an item that acts as a kind of connector.
Tweak
  • Replaced various SetaPDF_Extractor_Result_CompareableInterface parameter types by SetaPDF_Extractor_TextItem to be consistent by its internal usage.
  • Moved $_ignoreSpaceCharacter property and related methods from SetaPDF_Extractor_Strategy_ExactPlain to SetaPDF_Extractor_Strategy_Glyph.
  • Fixed typo in SetaPDF_Extractor_Strategy_Plain::getKeepIntersectingSpaces() method name (was "getKeepIntersecingSpaces()").
SetaPDF-Core Component
Feature
  • Added SetaPDF_Core_Type_AbstractType::is() method which let you check an object for a specific type while direct or indirect objects are resolved automatically.
  • Added SetaPDF_Core_Document_Page::getDictionary() method.
Bugfix
  • Fixed count value in Outlines::appendChild() if items where added in an uncommon order.
  • Fixed SetaPDF_Core_Document_Page::setLastModified() method.
Tweak
  • Harden data types by using SetaPDF_Core_Type_*::ensureType() calls.
  • Code style, doc-block optimizations and cleanup.
  • Optimized additional actions class to use indirect references (if possible) instead of direct action values.
  • Fixed class/interface inheritance of SetaPDF_Core_Writer_FileInterface.
  • Optimized SetaPDF_Core_Type_AbstractType::_ensureType() method to catch exceptions thrown by invalid indirect references.
  • Removed iterator cache instance in Outlines class.
  • Allow to remove ViewerPreferences entries when false is passed as a value.
  • Refactored encoding classes and interface.
  • Added various cached version of CMAP files.