Text Layout
Previous  Top  Next

TEXTfromPDF can format the extracted text in three different ways:

1) Simple Formatting
2) Original Layout
3) Reading Order

Depending on how you intend to use the extracted text, the formatting may or may not be important to you. For example, if the text is merely going to be used for search purposes by a search engine or a database, then it may not need to be formatted. If, on the other hand, a person is going to have to manually work with the contents of the text file, it should probably be formatted. Formatting visually organizes the text in a manner that is easier for a human to work with.

Simple Formatting

Select this option if you do not need the extracted text to be formatted very precisely. This is the fastest extraction method since the PDF's formatting requires less attention during extraction and the output text does not need to be overly formatted.

Original Layout

Select this option to if you want the extracted text to retain the same formatting as the original document. For example, if the original document has multiple columns of text, the text file will also present the text in columns.

Reading Order

Select this option to if you want the extracted text to be formatted in standard left to right, top to bottom "reading order".