Sample Header Ad - 728x90

How can I verify integrity of the document types?

4 votes
1 answer
447 views
Are there any utilities to verify integrity of common document types used in the office environment (docx, pptx, xlsx, pdf, common image, audio and video files) without using the actual applications? Contents of some documents of ours proved to be damaged due to storage issues. So, I need to sweep across the whole filesystem to verify all the files and report which of them are actually impacted. The damaged content can be anywhere in the file, so just verifying file headers - like the *file* command does - or checking the file properties (length, dates), etc., won't be good enough. I need a tool which would open the files, verify integrity of their internal structures, and report the result either on stdout or through an exit code. A solution which would do all the job and provide a final report would do as well, of course, if it already exists - it would save me a lot of scripting work. ;-) I assume random technical issues, not a deliberate tampering - so I do not need to verify the actual contents of the documents. What I need is to identify the files which would probably fail to open in their respective apps. Looking forward to receiving your suggestions.
Asked by Jasio (634 rep)
Sep 12, 2024, 11:55 AM
Last activity: Sep 12, 2024, 12:43 PM