Extract text, glyphs, words and metrics from PDF documents with PHP

SetaPDF-Extractor

Extract text, glyphs, words and metrics from PDF documents with PHP

Demos

On this pages we want to demonstrate the SetaPDF-Extractor component in action. All demos will show you the PHP code that was used to create the output.

Extract Plain Text

Extract plain text from a PDF document.

Get Words

Get words and their bounding boxes from PDF documents.

Mark Words

Mark or highlight all words on a specific PDF page.

Extract Words By a Specific Location

Use a rectangle filter to limit the result to a specific area.

Phrase Search

Create a phrase search with the SetaPDF-Extractor component.

Count Words

Count words in a PDF document with PHP.