PDF to Text : Batch Extract Text from PDF files 4+

RootRise Technologies Pvt. Ltd.

    • 5.0 • 1 Rating
    • Free

Screenshots

Description

PDF to Text is a fantastic utility to batch convert PDF documents into text formats. PDF to Text extracts text contents of PDF document into Plain Text UTF8 and UTF16 format (.txt), Microsoft Word format (.doc and .docx), Open Document format (.odt), Rich Text format (.rtf). PDF to Text preserves accurate layouts of original PDF files and also preserves text formatting for rich text conversion.

FEATURES:

◆ Batch processing.

◆ Converts password-protected PDF documents.

◆ Converts PDF documents into Text files, Unicode, RTF, Microsoft® Word and Open Document Text formats.

◆ Add PDF files into conversion list recursively from subfolders.

◆ An advance option to maintain source directory hierarchy at destination with converted text files in respective folders. While conversion PDF to Text automatically creates intermediate directories of input PDF path in target folder with converted text files.

◆ Very flexible options to add files. Simply Drag & Drop into list or Right click Open With in Finder or Drop on "PDF to Text" application to add files for convert beside direct Add File/Folder buttons.


NOTE : PDF to Text do not support Optical Character Recognition (OCR) to process PDF documents containing scanned or faxed raster images. PDF to Text do not convert vector graphics and raster images.

What’s New

Version 1.1

- Fine tuning with upcoming release of OS X
- Added support to Finder like sorting of PDF files.
- Other improvements

Ratings and Reviews

5.0 out of 5
1 Rating

1 Rating

simbasounds ,

Wow.. fast and accurate

I had a large number of pdf documents for conversion to text file format.
There were around 1000 ranging in length from article to book
and roughly split evenly along those lines, assuming a threshold of 40 000 words between article and book.

I had tried Acrobat Pro, first with a straight conversion, which failed in many cases.
Then I tried running full accessibility checks and fixes (including OCR).
Then I attempted the conversion again which also failed in many cases.

Acrobat - from Adobe, the creator of the pdf format - was the slowest.
On a single large document it could take up to an hour to process.
It was also the least accurate, and most prone to failure.
I did try tweaking various settings, but to no avail.

From there I tried Calibre, first on the unprocessed docs, then on the accessibility-processed ones.
I normally use the “Heuristic Processing” setting to clean up pdf conversions.
Calibre was a bit faster, a bit more accurate, and more successful.

Still, out of the original 1000 there were 70 documents that wouldn’t convert.
They were also a evenly splt between articles and books.
I don’t know how many were accessibility-processed by Acrobat, but I think most were un-processed.

So I dropped them into PDF to Text.app, and it did all of them in 10 minutes!
I haven’t gone over each one, but the 20 or so I looked at were more accurate and neater by far than either Acrobat or Calibre.

App Privacy

The developer, RootRise Technologies Pvt. Ltd., indicated that the app’s privacy practices may include handling of data as described below.

Data Not Collected

The developer does not collect any data from this app.

Privacy practices may vary based on, for example, the features you use or your age. Learn More

More By This Developer

Winmail Reader Lite
Productivity
PDFOptim
Productivity
Winmail Reader
Business
XPSView
Productivity
PDF to JPG - Converter
Productivity
JPG to PDF
Productivity