Docvert web service software takes multiple word processor files (typically .doc) and converts them to Oasis OpenDocument v1.0 format, and then optionally runs them through an XML pipeline. The result is returned in a .zip file.
Docvert builds upon OpenOffice.org because it has the best chance of dealing with the vagaries of the MS Word format.
The resulting XML is themable with XML/XSLT Pipelines, which supports breaking up content over headings or sections, multiple serializations and pipeline branching.
Docvert is easy to integrate as it uses a simple REST-style interface, and it's released under the LGPL so although it's open source there's no legal problems developing proprietary software ontop of it.
Right now I'm looking to bundle themes, so if you're an XSLT programmer send in your themes and I'll include them in the next release. Oh, and any patches or suggestions would be appreciated!
- OpenOffice 1.9.122 or later. The more current your version of OpenOffice.org the more likely it'll be compatible with MS Word.
- PHP 5.0 or later with Zlib and XSLT.
- And if you're running a *nix, OOo will need Xvfb (already bundled with most distros).
What's New in This Release: [ read full changelog ]
· This second release of the port to Python now has feature parity with the earlier PHP version, but includes superior list aggregation, improved DocBook XML generation, a simpler UI, and the beginnings of HTML5 document generation.