How to identify doc, ppt, xls files
- by Shelby. S
So I was wondering how would you differentiate ppt, xls and doc files from each other in linux regardless of extensions. I tried 'file' but from the looks of it, all of MSOffice files are categorized under the same file type. Similarly I'm having trouble with docx, xlsx and pptx files, since they're essentially all zip files containing a bunch of xml.
Thank you for your help!
P.S. I also tried a python script importing the magic module, but no go.