is a text extraction tool for WinXP/2000 that automates the conversion
of Adobe PDF documents to text files. The PDFs may be on
local drives, network drives, or on the Internet.
files are great for exchanging formatted documents between people who don't
use the same software. But sometimes we need to be able to take the text
out of a PDF file and use it in Web pages, word processing documents, PowerPoint
presentations, desktop publishing software, search and indexing applications,
or in content management systems.
access to the text content in PDF documents without requiring any Adobe
product. The extracted content is saved to text files where it
can be easily searched, archived, repurposed, and managed. A console
version is included with the application for script or batch file execution.
An ActiveX DLL version is available here
for application developers.
TEXTfromPDF is not a print driver or Optical Character Recognition (OCR)
is not able to process graphics or scanned documents.
matters most to you? Speed or Accuracy?
TEXTfromPDF offers both using multiple extraction engines optimized
for different purposes.
the fastest text extractor available on the market, TEXTfromPDF's "Simple
Formatting" option can process hundreds of PDFs in seconds. This
option utilizes an extraction engine written in Assembly Language.
Programming code written in this language executes at blazing speeds not
normally attainable by code written in other languages.
here to see how TEXTfromPDF and our competitors fared in a head
to head performance test.
faithful reproduction of the original PDF layout is required, TEXTfromPDF
can provide amazingly accurate conversion results. This extraction engine
has been refined over many years to produce text files that are as close
to the original PDF layout as possible.