Bug 55142 - Newlines incorrectly added within SDT between runs
Summary: Newlines incorrectly added within SDT between runs
Alias: None
Product: POI
Classification: Unclassified
Component: XWPF (show other bugs)
Version: 3.10-dev
Hardware: PC All
: P2 trivial (vote)
Target Milestone: ---
Assignee: POI Developers List
Depends on:
Reported: 2013-06-25 11:34 UTC by Tim Allison
Modified: 2013-06-25 14:32 UTC (History)
0 users

[PATCH] (18.97 KB, patch)
2013-06-25 12:40 UTC, Tim Allison
Details | Diff

Note You need to log in before you can comment on or make changes to this bug.
Description Tim Allison 2013-06-25 11:34:41 UTC
Tika-1130 demonstrates that a newline character is incorrectly being inserted between runs within an SDT.  Will submit a patch + test shortly.

This is a cleanup of: https://issues.apache.org/bugzilla/show_bug.cgi?id=54849
Comment 1 Tim Allison 2013-06-25 12:40:37 UTC
Created attachment 30482 [details]

This issue appears to be limited to contiguous runs within something that isn't a paragraph (in Tika-1130, the runs are in a cell).  I added test cases to guarantee newline/tab behavior in contiguous runs within cells and in paragraphs going forward.
Comment 2 Nick Burch 2013-06-25 13:15:24 UTC
Thanks Tim, patch applied in r1496458, and changelog updated for it in r1496461.
Comment 3 Tim Allison 2013-06-25 14:32:27 UTC
Thank you!  I'll update the patch to tika 1130 this evening to include Ray's original test for "BigCompany" instead of just "Company."