A cross-platform and Open Source CLI utility for extracting plain text from many documents
SILVERCODERS DocToText is an open source and powerful command-line software that allows users to effortlessly convert various documents, in different formats, to plain text.

SILVERCODERS DocToText supports numerous Microsoft Word (DOC, DOCX), Microsoft Excel (XLS, XLSX, XLSB), Microsoft PowerPoint (PPT, PPTX), Rich Text Format (RTF), OpenDocument, OASIS text documents (ODT), OASIS spreadsheets (ODS), OASIS presentations (ODP), OASIS graphics (ODG), Office Open XML (OOXML, MSOOXML or OpenXML), iWork formats (NUMBERS, PAGES, KEYNOTE), OpenDocument Flat XML formats (FODS, FODP, FODT), Email files (EML), HyperText Markup Language (HTML), and Portable Document Format (PDF).

It is a cross-platform software that has been successfully tested under Linux, Microsoft Windows and Mac OS X operating systems. Both 32-bit and 64-bit architectures are supported at this time.

last updated on:
January 8th, 2014, 8:50 GMT
developed by:
license type:
GPL (GNU General Public License) 
ROOT \ Text Editing&Processing \ Others


In a hurry? Add it to your Download Basket!

user rating 22



1 Screenshot
SILVERCODERS DocToText - Usage example of SILVERCODERS DocToText from the command-line
What's New in version 0.14.0
  • DocToText version 0.14.0 was oficially released today.
  • HyperText Markup Language (HTML) format support was introduced in this version.
  • The possibility to retrieve metadata like document author, last modification date or number of pages was added.
  • The new important feature is extracting text from annotations (comments) embedded in odt, doc, docx or rtf files. Some malfunctions were also fixed.
read full changelog

Add your review!