SILVERCODERS DocToText 4.0 Build 1512

A cross-platform and Open Source CLI utility for extracting plain text from many documents
SILVERCODERS DocToText - Usage example of SILVERCODERS DocToText from the command-line
  1 Screenshot
SILVERCODERS DocToText is an open source, multi-platform, free and powerful command-line utility that allows you to effortlessly convert a single or multiple documents, in different file formats, to the Plain Text format.

Supports numerous file formats

The application supports numerous file formats, including Microsoft Word (DOC, DOCX), Microsoft Excel (XLS, XLSX, XLSB), Microsoft PowerPoint (PPT, PPTX), Rich Text Format (RTF), OpenDocument, OASIS text documents (ODT), MSOOXML or OpenXML, OpenOffice.org XML (OOXML), OASIS spreadsheets (ODS), OASIS presentations (ODP).

In addition, the OASIS graphics (ODG), iWork formats (NUMBERS, PAGES, KEYNOTE), OpenDocument Flat XML formats (FODS, FODP, FODT), Email files (EML), HyperText Markup Language (HTML) and Portable Document Format (PDF) are also supported by SILVERCODERS DocToText.

Command-line options

As mentioned, this is a command-line utility, which means that you can’t interact with it through a pretty graphical user interface (GUI), but only via an X11 terminal emulator. Type the “sh doctotext.sh” command, after you’ve extracted the binary archive that corresponds to your computer’s hardware architecture, to view its command-line options.

From there, the user can try to parse the file that he/she tries to convert as RTF, ODF, OOXML, XLS, XLSB, iWork, PPT, DOC, HTML, PDF, EML or ODFXML documents first, fix corrupted XML files, strip XML tags instead of parsing them, use a specific command to unzip files from archives, instead of using the built-in decompression utility, as well as to write logs to a specified file.

Supported operating systems and platforms

SILVERCODERS DocToText has been designed from the offset as a cross-platform software written in the UNIX Shell programming language, which means that it has been successfully tested with some of the most popular GNU/Linux distributions, as well as with the Microsoft Windows and Mac OS X operating systems. Both 64-bit and 32-bit hardware platforms are supported at this time.

Reviewed by , last updated on October 10th, 2014


price:
FREE!
developed by:
SILVERCODERS
homepage:
silvercoders.com
license type:
GPL (GNU General Public License) 
category:
ROOT \ Text Editing&Processing \ Others
SILVERCODERS DocToText
Download Button

In a hurry? Add it to your Download Basket!

softpedia rating

4.0/5

user rating 24

4.0/5
 

0/5

Rate it!
What's New in version 0.14.0
  • DocToText version 0.14.0 was oficially released today.
  • HyperText Markup Language (HTML) format support was introduced in this version.
  • The possibility to retrieve metadata like document author, last modification date or number of pages was added.
  • The new important feature is extracting text from annotations (comments) embedded in odt, doc, docx or rtf files. Some malfunctions were also fixed.
read full changelog
 

Application description

SILVERCODERS DocToText is a freely distributed, universal, portable and open-source command-line software that allows ...

Add your review!

SUBMIT