Tim's PHP Scripts
Tim's PHP scripts
I have developed a number of PHP scripts for processing Microsoft Word 'DOCX' files. So far the following scripts are available.
If anyone has any questions about these PHP scripts or has found a bug or discovered a missing feature that should be added for any of these scripts, please send me a message.
wordtohtml
This will convert a Word DOCX file to html and display the resultant code (including images) in a web page. It will cope with virtually all word formating except tabs and Word Art.
There is also a test/demonstration pages where you can try out the script with one of your own Word documents.
docxtoimg
This will extract all the images from a Word DOCX document and save them.
docxtoxml
The DOCX files is actually a zip file which contains a number of xml files. This will extract and save separately all the XML files from a Word DOCX document
docxtotext
This class will extract all the text from a Word DOCX document and save it a text file.