Bug 61798

Summary: Corrupted Excel file produced due to wrong dimension calculation during XSSFWorkbook.write(str) call
Product: POI Reporter: Marek <marek.zaplata>
Component: XSSFAssignee: POI Developers List <dev>
Status: RESOLVED FIXED    
Severity: major CC: dolphin.in.the.sky.51, jpstotz, richard.evans
Priority: P2    
Version: 3.17-FINAL   
Target Milestone: ---   
Hardware: PC   
OS: All   
Bug Depends on: 53611    
Bug Blocks:    
Attachments: Corrupted file generated by JUnit test attached

Description Marek 2017-11-22 13:33:09 UTC
Created attachment 35546 [details]
Corrupted file generated by JUnit test attached

Fix introduced for bug 53611 is causing another bug which produce a corrupted Excel file.

POI will generate corrupted file if dimension end column will be the last possible column (number 16384 /column XFD/).

Dimension calculation algorithm introduced in revision:
https://svn.apache.org/viewvc?view=revision&revision=1767096
has a bug, which sets dimention end cell value too big (+1 column).

This bug will lead to corrupted file when last cell will be in use.

Following test:
https://svn.apache.org/viewvc/poi/trunk/src/ooxml/testcases/org/apache/poi/xssf/usermodel/TestXSSFBugs.java?r1=1767096&r2=1767095&pathrev=1767096
has wrong assertion. Cell number 7 is H (not I) column.

Test code to easy reproduce this bug (corrupted Excel file):

@Test
public void test53611_bug() throws IOException {
    Workbook wb = new XSSFWorkbook();
    Sheet sheet = wb.createSheet("test");
    Row row = sheet.createRow(1);
    Cell cell = row.createCell(1);
    cell.setCellValue("blabla");

    row = sheet.createRow(4);
	// Allowable column range for EXCEL2007 is (0..16383) or ('A'..'XDF')
    cell = row.createCell(16383);
    cell.setCellValue("blabla");

    // we currently only populate the dimension during writing out
    // to avoid having to iterate all rows/cells in each add/remove of a row or cell
    //OutputStream str = new FileOutputStream("/tmp/53611_bug.xlsx");
    OutputStream str = new ByteArrayOutputStream();
    try {
        wb.write(str);
    } finally {
        str.close();
    }

	// Expected :B2:XFD5
	// Actual   :B2:XFE5   <-- which is out of Excel range. This will produce corrupted Excel file
    assertEquals("B2:XFD5", ((XSSFSheet)sheet).getCTWorksheet().getDimension().getRef());

    wb.close();
}

Corrupted file generated by this test also attached.
Comment 1 dolphin.in.the.sky.51 2017-12-06 12:39:37 UTC
Excuse my poor English.

I am applying the patch of the following contents.


in org.apache.xssf.usermodel.XSSFSheet

protected write(OutputStream out) throws IOException {
   ...
  
   // then calculate min/max cell-numbers for the worksheet-dimension
    if(row.getFirstCellNum() != -1) {
//     maxCell = Math.min(maxCell, row.getLastCellNum());
       maxCell = Math.min(maxCell, row.getLastCellNum()-1);
    }

Row.getLastCellNum() is return the index of the last cell contained in this row PLUS ONE. 

I would be pleased if it will be helpful.
Comment 2 dolphin.in.the.sky.51 2017-12-06 12:50:44 UTC
I made a mistake in patch code.

not "row.getFirstCellNum()"
Correctly,"row.getLastCellNum()"

Sorry.
Comment 3 Dominik Stadler 2017-12-28 08:47:06 UTC
Fixed with r1819404, now the dimension should be calculated correctly.
Comment 4 Jan Peter Stotz 2018-06-27 08:07:58 UTC
*** Bug 62490 has been marked as a duplicate of this bug. ***
Comment 5 Dominik Stadler 2018-09-04 20:10:15 UTC
*** Bug 62673 has been marked as a duplicate of this bug. ***