'Apache Tika Parsers error for old MS document formats
We use Apache Tika Parsers for document validation and after updating to version 2.3.0 our regression tests are failing for older MS document types (ppt, doc, pub, etc.) with the following error thrown by tika.detect(): java.lang.NoSuchFieldError: WORKBOOK_DIR_ENTRY_NAMES.
It is true that these document types are old, but they can still be opened with their corresponding apps, so I believe they should be properly validated by Tika Parsers.
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|
