SILVERCODERS DocToText 4.0 Build 1512

A cross-platform and Open Source CLI utility for extracting plain text from many documents

  Add it to your Download Basket!

 Add it to your Watch List!

0/5

Rate it!

What's new in SILVERCODERS DocToText 0.14.0:

  • DocToText version 0.14.0 was oficially released today.
  • HyperText Markup Language (HTML) format support was introduced in this version.
  • The possibility to retrieve metadata like document author, last modification date or number of pages was added.
  • The new important feature is extracting text from annotations (comments) embedded in odt, doc, docx or rtf files. Some malfunctions were also fixed.
Read full changelog
send us
an update
LICENSE TYPE:
GPL (GNU General Public License) 
OUR RATING:
4.0/5
DEVELOPED BY:
SILVERCODERS
USER RATING:
4.0/5 24
HOMEPAGE:
silvercoders.com
CATEGORY:
ROOT \ Text Editing&Processing \ Others
1 SILVERCODERS DocToText Screenshot:
SILVERCODERS DocToText - Usage example of SILVERCODERS DocToText from the command-line
SILVERCODERS DocToText is an open source, multi-platform, free and powerful command-line utility that allows you to effortlessly convert a single or multiple documents, in different file formats, to the Plain Text format.

Supports numerous file formats

The application supports numerous file formats, including Microsoft Word (DOC, DOCX), Microsoft Excel (XLS, XLSX, XLSB), Microsoft PowerPoint (PPT, PPTX), Rich Text Format (RTF), OpenDocument, OASIS text documents (ODT), MSOOXML or OpenXML, OpenOffice.org XML (OOXML), OASIS spreadsheets (ODS), OASIS presentations (ODP).

In addition, the OASIS graphics (ODG), iWork formats (NUMBERS, PAGES, KEYNOTE), OpenDocument Flat XML formats (FODS, FODP, FODT), Email files (EML), HyperText Markup Language (HTML) and Portable Document Format (PDF) are also supported by SILVERCODERS DocToText.

Command-line options

As mentioned, this is a command-line utility, which means that you can’t interact with it through a pretty graphical user interface (GUI), but only via an X11 terminal emulator. Type the “sh doctotext.sh” command, after you’ve extracted the binary archive that corresponds to your computer’s hardware architecture, to view its command-line options.

From there, the user can try to parse the file that he/she tries to convert as RTF, ODF, OOXML, XLS, XLSB, iWork, PPT, DOC, HTML, PDF, EML or ODFXML documents first, fix corrupted XML files, strip XML tags instead of parsing them, use a specific command to unzip files from archives, instead of using the built-in decompression utility, as well as to write logs to a specified file.

Supported operating systems and platforms

SILVERCODERS DocToText has been designed from the offset as a cross-platform software written in the UNIX Shell programming language, which means that it has been successfully tested with some of the most popular GNU/Linux distributions, as well as with the Microsoft Windows and Mac OS X operating systems. Both 64-bit and 32-bit hardware platforms are supported at this time.

SILVERCODERS DocToText was reviewed by , last updated on October 10th, 2014

#doc converter #doc to text #text converter #Plain Text #extractor #document #DOC

Add your review!

SUBMIT