|
Savvy Repair for Microsoft Office
-
Version
1.0
Savvy Repair for Microsoft Office tries four methods
of repair of corrupt of Word DOCX, Excel XLSX and
PowerPoint PPTX files. DOCX, XLSX and PPTX files are
collections of mostly XML sub-files. All four methods
first try to repair the zip structure. The first 3
methods involve attempts to repair all the xml sub-
files. The repair is done by finding the first XML
error, truncating the file there and then using
xmllint to add the right tags to the end of the now
shorter file.
The first method is done with strict xml validation,
the 2nd with lax, the 3rd adding missing subfiles from
a blank file of the appropriate extension. The last
method is a data/text extraction one using
SilverCoder's DocxToText. No formatting is recovered.
|