Bug 55142

Summary: Newlines incorrectly added within SDT between runs
Product: POI Reporter: Tim Allison <tallison>
Component: XWPFAssignee: POI Developers List <dev>
Severity: trivial    
Priority: P2    
Version: 3.10-dev   
Target Milestone: ---   
Hardware: PC   
OS: All   
Attachments: [PATCH]

Description Tim Allison 2013-06-25 11:34:41 UTC
Tika-1130 demonstrates that a newline character is incorrectly being inserted between runs within an SDT.  Will submit a patch + test shortly.

This is a cleanup of: https://issues.apache.org/bugzilla/show_bug.cgi?id=54849
Comment 1 Tim Allison 2013-06-25 12:40:37 UTC
Created attachment 30482 [details]

This issue appears to be limited to contiguous runs within something that isn't a paragraph (in Tika-1130, the runs are in a cell).  I added test cases to guarantee newline/tab behavior in contiguous runs within cells and in paragraphs going forward.
Comment 2 Nick Burch 2013-06-25 13:15:24 UTC
Thanks Tim, patch applied in r1496458, and changelog updated for it in r1496461.
Comment 3 Tim Allison 2013-06-25 14:32:27 UTC
Thank you!  I'll update the patch to tika 1130 this evening to include Ray's original test for "BigCompany" instead of just "Company."