SILVERCODERS DocToText 4.0 Build 1512
Supports numerous file formats
The application supports numerous file formats, including Microsoft Word (DOC, DOCX), Microsoft Excel (XLS, XLSX, XLSB), Microsoft PowerPoint (PPT, PPTX), Rich Text Format (RTF), OpenDocument, OASIS text documents (ODT), MSOOXML or OpenXML, OpenOffice.org XML (OOXML), OASIS spreadsheets (ODS), OASIS presentations (ODP).
In addition, the OASIS graphics (ODG), iWork formats (NUMBERS, PAGES, KEYNOTE), OpenDocument Flat XML formats (FODS, FODP, FODT), Email files (EML), HyperText Markup Language (HTML) and Portable Document Format (PDF) are also supported by SILVERCODERS DocToText.
As mentioned, this is a command-line utility, which means that you can’t interact with it through a pretty graphical user interface (GUI), but only via an X11 terminal emulator. Type the “sh doctotext.sh” command, after you’ve extracted the binary archive that corresponds to your computer’s hardware architecture, to view its command-line options.
From there, the user can try to parse the file that he/she tries to convert as RTF, ODF, OOXML, XLS, XLSB, iWork, PPT, DOC, HTML, PDF, EML or ODFXML documents first, fix corrupted XML files, strip XML tags instead of parsing them, use a specific command to unzip files from archives, instead of using the built-in decompression utility, as well as to write logs to a specified file.
Supported operating systems and platforms
SILVERCODERS DocToText has been designed from the offset as a cross-platform software written in the UNIX Shell programming language, which means that it has been successfully tested with some of the most popular GNU/Linux distributions, as well as with the Microsoft Windows and Mac OS X operating systems. Both 64-bit and 32-bit hardware platforms are supported at this time.
Reviewed by Marius Nestor, last updated on October 10th, 2014
In a hurry? Add it to your Download Basket!
- DocToText version 0.14.0 was oficially released today.
- HyperText Markup Language (HTML) format support was introduced in this version.
- The possibility to retrieve metadata like document author, last modification date or number of pages was added.
- The new important feature is extracting text from annotations (comments) embedded in odt, doc, docx or rtf files. Some malfunctions were also fixed.
Application descriptionSILVERCODERS DocToText is a freely distributed, universal, portable and open-source command-line software that allows ...