Description of freeware (Last Update: 27 May 2009:
docx2txt is a perl based utility to convert Microsoft Office(Tm) Docx documents to equivalent Text documents. Current version supports following features during text extraction.
- horizontal ruler, line breaks, paragraphs separation, tabs
- capitalisation of text blocks.
- character conversions (" ' < & > - ... etc.). Euro character is converted to E, however you can change this behaviour by comment/uncomment in perl script.
- naive nested list formatting - assumed 8 level nesting, however you can handle even deeper nesting by commenting/uncommenting appropriate lines in perl script.
- center and right justification of text fitting in a line of (adjustible) 80 columns.
- indicating hyperlinked text along with the hyperlink.