22 Commits

Author SHA1 Message Date
Dustin Spicuzza
bc6ee96e1a Add Visio OOXML text extractor + tests
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1709361 13f79535-47bb-0310-9956-ffa450edef68
2015-10-19 06:26:57 +00:00
Dominik Stadler
2f0ceddc7f Apply patch from bug 57890 to add support for different datatypes in XSSFImportFromXML
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1703665 13f79535-47bb-0310-9956-ffa450edef68
2015-09-17 18:30:13 +00:00
Dominik Stadler
d390a9bf3f Enhance integration testing to apply OPCPackage tests to more file-types and also run XSSFReader in XSSFFileHandler
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1702801 13f79535-47bb-0310-9956-ffa450edef68
2015-09-13 19:05:40 +00:00
Dominik Stadler
e41a5e93d3 Adjust excludes after adjusting multiple HeaderFooterRecord instances
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1697602 13f79535-47bb-0310-9956-ffa450edef68
2015-08-25 08:32:41 +00:00
Andreas Beeker
e7c3db9ecc Common stress test for X/HSLF and a few bug fixes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1695628 13f79535-47bb-0310-9956-ffa450edef68
2015-08-13 00:10:11 +00:00
Dominik Stadler
e2244c83f8 Bug 56479: don't hardcode dcterms as namespace alias in the attribute, but expect the actual alias that is used in the corresponding element.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1695212 13f79535-47bb-0310-9956-ffa450edef68
2015-08-11 05:37:15 +00:00
Dominik Stadler
0387edd759 Bug 57484: Allow processing of non-OOXML core namespace packages
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1691821 13f79535-47bb-0310-9956-ffa450edef68
2015-07-19 12:11:19 +00:00
Nick Burch
2d71e80930 Add ignore - file has no content streams
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1681809 13f79535-47bb-0310-9956-ffa450edef68
2015-05-26 18:38:34 +00:00
Dominik Stadler
5a925d115d Adjust test to handle a file correctly which works for normal handling but fails expectedly in text-extraction testing.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1676902 13f79535-47bb-0310-9956-ffa450edef68
2015-04-30 05:50:49 +00:00
Nick Burch
6d5e376c88 Skip the new password protected file, and re-org a little bit some other failures
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1676874 13f79535-47bb-0310-9956-ffa450edef68
2015-04-29 22:07:43 +00:00
Nick Burch
ee6a6d6584 Add another file exclude, and hae the test print out which fails failed to make it quicker to spot in the jenkins failure email
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1675719 13f79535-47bb-0310-9956-ffa450edef68
2015-04-23 21:09:42 +00:00
Dominik Stadler
b39c87b884 Integration tests: Expect exception for old word documents and still run the text extraction for them. Also add executing HPSFPropertiesExtractor where possible
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1668483 13f79535-47bb-0310-9956-ffa450edef68
2015-03-22 21:47:44 +00:00
Nick Burch
1f3f88a299 Exclude the OOXML strict files for now, not yet supported, see bug #57699
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1666433 13f79535-47bb-0310-9956-ffa450edef68
2015-03-13 13:03:53 +00:00
Nick Burch
ac92165d20 No support as yet for the OOXML visio files, so have these ignored
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1665971 13f79535-47bb-0310-9956-ffa450edef68
2015-03-11 19:08:59 +00:00
Nick Burch
8f8886986a Fix inconsistent indents
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1665970 13f79535-47bb-0310-9956-ffa450edef68
2015-03-11 19:07:13 +00:00
Dominik Stadler
76307fe94b * Add text-extraction verification to integration-tests via a new abstract base FileHandler
* Fix NullPointerException found in some documents when running against the test-data
* Add support for extracting text from Dir-Entries WORKBOOK and BOOK to support some old/strangely formatted XLS files.

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1662652 13f79535-47bb-0310-9956-ffa450edef68
2015-02-27 09:59:14 +00:00
Nick Burch
ffe2581d6d Another exclude
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1651512 13f79535-47bb-0310-9956-ffa450edef68
2015-01-13 21:48:43 +00:00
Dominik Stadler
3b4ab5e30b Extract ROOT_DIR into constant, ignore .csv files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1649124 13f79535-47bb-0310-9956-ffa450edef68
2015-01-02 21:07:14 +00:00
Andreas Beeker
b91e480006 Charset.forName() for known encodings makes catching UnknownEncodingException obsolete
Unify UTF-16LE conversion to StringUtil
BugFix for RecordInputStream.readFully in combination with continuing records
BugFix for integration tests - fix pathname for handler/exclude lookup on windows

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1648032 13f79535-47bb-0310-9956-ffa450edef68
2014-12-27 01:33:28 +00:00
Dominik Stadler
70d7ec0e69 One more change in file-pattern to actually exclude .svn files
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1647941 13f79535-47bb-0310-9956-ffa450edef68
2014-12-26 07:51:46 +00:00
Dominik Stadler
b33735cf08 Exclude .svn sub-dirs as it seems Apache Jenkins still runs with Subversion <=1.6
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1647939 13f79535-47bb-0310-9956-ffa450edef68
2014-12-26 07:04:34 +00:00
Dominik Stadler
8daac21f87 Add a test-suite which performs integration/stress tests which load and handle all stored test files in various ways.
It works by using handlers for each type of file which perform various operations on the files, e.g. loading,
iterating content, modify, ... This will trigger changes which break working with the available test-files and
thus provides another layer of regression testing which hopefully prevents some failures from making it into
releases.

It is runnable via a new ant-target 'test-integration' and also added to the jenkins-target.

git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1647885 13f79535-47bb-0310-9956-ffa450edef68
2014-12-25 09:16:44 +00:00