catdoc icon

catdoc For Linux

2.6/5 18
GPL    

catdoc is program which reads one or more Microsoft word files and outputs text.. #Microsoft Word file  #Output text  #Document reader  #Catdoc  #Word  #File  

Description

Free Download

catdoc is program which reads one or more Microsoft word files and outputs text, contained insinde them to standard output. Therefore it does same work for .doc files, as unix cat command for plain ASCII files.

catdoc project is now accompanied by xls2csv - program which converts Excel spreadsheet into comma-separated value file. Newest addition to catdoc suite is catppt - program, which extracts readable text from the PowerPoint files.

Optionaly, catdoc is able to translate some non-ASCII chars into correspoindig TeX escape sequences and convert charsets from Windows ANSI codepage or unicode to local codepage of target machine.

It also have database of substitution sequences which are used for symbols which are not present in the target encoding. So if you are trying to read Russian word file under C locale, you'll get a transliteration.

Under Unix it uses nl_langinfo function to find out which output encoding to use, under DOS it uses appropriate DOS function, which gets codepage value from the COUNTRY statement in config.sys.

catdoc is also able to read RTF files and even plain text, so it can be used as general-purpose encoding convertor. (Because catdoc is russian program, by default it converts cp1251 to koi8-r, when running under UNIX and to cp866 when running under DOS.

Catdoc has rudimentary table handling. In TeX mode it inserts & when encounters field delimiter and when encounters end of table row. No table headers are produced although.

Catdoc doesn't even try to preserver MS-Word character formatting. It's goal is to extract plain text and allow you to read it and, probably, reformat with TeX, according to TeXnical rules, most Word users haven't even heard about.

xls2csv does roughly same for Excel files. It extracts data and leaves out any formatting info and formulas. Concept is that you want to see data, not the way it was created.

There is tcl/tk GUI script wordview which provides GUI for viewing Word and RTF files using catdoc. Since internal representation of Tcl string is utf-8 and most systems now have unicode fonts, you'll probably be able to read document in any language using this script.

catdoc 0.94.2

add to watchlist add to download basket send us an update REPORT
  runs on:
Linux
  filename:
catdoc-0.94.tar.gz
  main category:
Utilities
  developer:
  visit homepage

7-Zip 23.01 / 24.04 Beta

An intuitive application with a very good compression ratio that can help you not only create and extract archives, but also test them for errors
7-Zip

paint.net 5.0.13 (5.13.8830.42291)

Packed with an array of options and an intuitive interface, this application enables you to create professional-looking photographs
paint.net

Bitdefender Antivirus Free 27.0.35.146

Feather-light and free antivirus solution from renowned developer that keeps the PC protected at all times from malware without requiring user configuration
Bitdefender Antivirus Free

ShareX 16.0.1

Capture your screen, create GIFs, and record videos through this versatile solution that includes various other amenities: an OCR scanner, image uploader, URL shortener, and much more
ShareX

Windows Sandbox Launcher 1.0.0

Set up the Windows Sandbox parameters to your specific requirements, with this dedicated launcher that features advanced parametrization
Windows Sandbox Launcher

Zoom Client 6.0.0.37205

The official desktop client for Zoom, the popular video conferencing and collaboration tool used by millions of people worldwide
Zoom Client

4k Video Downloader 1.5.3.0080 Plus / 4.30.0.5655

Export your favorite YouTube videos and playlists with this intuitive, lightweight program, built to facilitate downloading clips from the popular website
4k Video Downloader

Microsoft Teams 24060.3102.2733.5911 Home / 1.7.00.7956 Work

Effortlessly chat, collaborate on projects, and transfer files within a business-like environment by employing this Microsoft-vetted application
Microsoft Teams

IrfanView 4.67

With support for a long list of plugins, this minimalistic utility helps you view images, as well as edit and convert them using a built-in batch mode
IrfanView

calibre 7.9.0

Effortlessly keep your e-book library thoroughly organized with the help of the numerous features offered by this efficient and capable manager
calibre

% discount
Microsoft Teams
  • Microsoft Teams
  • IrfanView
  • calibre
  • 7-Zip
  • paint.net
  • Bitdefender Antivirus Free
  • ShareX
  • Windows Sandbox Launcher
  • Zoom Client
  • 4k Video Downloader
essentials


User Comments
This enables Disqus, Inc. to process some of your data. Disqus privacy policy