Extract text, glyphs, words and metrics from PDF documents with PHP

SetaPDF-Extractor

Extract text, glyphs, words and metrics from PDF documents with PHP

Demos

On this pages we want to demonstrate the SetaPDF-Extractor component in action. All demos will show you the PHP code that was used to create the output.

Extract Plain Text

Extract plain text from a PDF document.

Get Words

Get words and their bounding boxes from PDF documents.

Mark Words

Mark or highlight all words on a specific PDF page.

Extract Words By a Specific Location

Use a rectangle filter to limit the result to a specific area.

Phrase Search

Create a phrase search with the SetaPDF-Extractor component.

Count Words

Count words in a PDF document with PHP.

Get Word Groups

Get words grouped by visible entities.

Mark Word Groups

Mark word groups of visible entities.

...more demos

See more live demos in our demo package, which is shipped with the products.