Skip to content

Test and fix extractors for Opendocument and OpenXML Spreadsheets and Presentations

Both file formats lacked any testcases for spreadsheets (.ods, .xlsx), and OfficeXML presentations (.pptx).

Both OpenXML document types also included a significant amount of garbage in the extracted fulltext.

Merge request reports