EXCEL解析之终极方法WorkbookFactory

Posted 奔跑的蜗牛-

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了EXCEL解析之终极方法WorkbookFactory相关的知识,希望对你有一定的参考价值。

Selenium做自动化测试当然不能避免和Excel打交道。

由于Excel版本的关系,文件扩展名分xls和xlsx,

以往的经验都是使用HSSFWorkbook和XSSFWorkbook来分别处理。具体的方式就是先判断文件的类型,然后根据文件扩展名来选择方法。

大概处理方式如下:

String extention= getExtention(path);
             if (!EMPTY.equals(extention)) {
                 if (XLS.equals(extention)) {
                     return readXlsForAllSheets(path);
                 } else if (XLSX.equals(extention)) {
                     return readXlsxForAllSheets(path);
                 }
             } else {
                 System.out.println(path + " is not a excel file.");
             }

 再接着实现readXlsForAllSheets和readXlsxForAllSheets两个方法。

public Object[][] readXlsxForAllSheets(String path) throws IOException{
         System.out.println(path);
         FileInputStream is = new FileInputStream(path);
		XSSFWorkbook xssfWorkbook = new XSSFWorkbook(is);
		System.out.println("There are totally "+xssfWorkbook.getNumberOfSheets()+" sheets in the workbook.");
         // Read the Sheet
        List<Object[]> records1=new ArrayList<Object[]>();
         for (int numSheet = 0; numSheet < xssfWorkbook.getNumberOfSheets(); numSheet++) {
             XSSFSheet xssfSheet = xssfWorkbook.getSheetAt(numSheet);
             int rowCount=xssfSheet.getLastRowNum()-xssfSheet.getFirstRowNum();
             List<Object[]> records=new ArrayList<Object[]>();
        	 String[] separative={"This is sheet "+xssfWorkbook.getSheetName(numSheet)};
        	 records.add(separative);
             for(int rowNum =1;rowNum<rowCount+1; rowNum++){
            	 XSSFRow xssfRow=xssfSheet.getRow(rowNum);
            	 String fields[]=new String[xssfRow.getLastCellNum()];
            	 for (int colNum=0;colNum<xssfRow.getLastCellNum();colNum++){
            		XSSFCell xssfCell=xssfRow.getCell(colNum);
            		 fields[colNum]=this.getXssfCellValue(xssfCell);
            	 }
            	 records.add(fields);
             }
             records1.addAll(records);
             }
         Object[][] results=new Object[records1.size()][];
         for(int i=0;i<records1.size();i++){
         	 results[i]=records1.get(i);
         }
         if (xssfWorkbook!=null){xssfWorkbook.close();}
         return results;
         }

 

public Object[][] readXlsForAllSheets(String path) throws IOException{
         System.out.println(PROCESSING + path);
         FileInputStream is = new FileInputStream(path);
		HSSFWorkbook hssfWorkbook = new HSSFWorkbook(is);
		System.out.println("There are totally "+hssfWorkbook.getNumberOfSheets()+" sheets in the workbook.");
         // Read the Sheet
        List<Object[]> records1=new ArrayList<Object[]>();
         for (int numSheet = 0; numSheet < hssfWorkbook.getNumberOfSheets(); numSheet++) {
             HSSFSheet hssfSheet = hssfWorkbook.getSheetAt(numSheet);
             int rowCount=hssfSheet.getLastRowNum()-hssfSheet.getFirstRowNum();
             List<Object[]> records=new ArrayList<Object[]>();
        	 String[] separative={"This is sheet "+hssfWorkbook.getSheetName(numSheet)};
        	 records.add(separative);
             for(int rowNum =1;rowNum<rowCount+1; rowNum++){
            	 HSSFRow xssfRow=hssfSheet.getRow(rowNum);
            	 String fields[]=new String[xssfRow.getLastCellNum()];
            	 for (int colNum=0;colNum<xssfRow.getLastCellNum();colNum++){
            		HSSFCell xssfCell=xssfRow.getCell(colNum);
            		 fields[colNum]=this.getHssfCellValue(xssfCell);
            	 }
            	 records.add(fields);
             }
             records1.addAll(records);
             }
         Object[][] results=new Object[records1.size()][];
         for(int i=0;i<records1.size();i++){
         	 results[i]=records1.get(i);
         }
         if (hssfWorkbook!=null){hssfWorkbook.close();}
         return results;
         }

 再实现上两个方法中调用的getXssfCellValue和getHssfCellValue方法。

 

private String getXssfCellValue(XSSFCell xssfCell) {
		String cellvalue="";
        DataFormatter formatter = new DataFormatter();
        if (null != xssfCell) {   
            switch (xssfCell.getCellType()) {   
            case XSSFCell.CELL_TYPE_NUMERIC: // 数字   
            	if (org.apache.poi.ss.usermodel.DateUtil.isCellDateFormatted(xssfCell)) {
                  cellvalue = formatter.formatCellValue(xssfCell);
              } else {
                  double value = xssfCell.getNumericCellValue();
                  int intValue = (int) value;
                  cellvalue = value - intValue == 0 ? String.valueOf(intValue) : String.valueOf(value);
              }
            	break;
			case XSSFCell.CELL_TYPE_STRING: // 字符串   
				cellvalue=xssfCell.getStringCellValue(); 
                break;   
            case XSSFCell.CELL_TYPE_BOOLEAN: // Boolean   
            	cellvalue=String.valueOf(xssfCell.getBooleanCellValue()); 
                break;   
            case XSSFCell.CELL_TYPE_FORMULA: // 公式   
            	cellvalue=String.valueOf(xssfCell.getCellFormula());   
                break;   
            case XSSFCell.CELL_TYPE_BLANK: // 空值   
            	cellvalue="";   
                break;   
            case XSSFCell.CELL_TYPE_ERROR: // 故障   
            	cellvalue="";   
                break;   
            default:   
            	cellvalue="UNKNOWN TYPE";   
                break;   
            }   
        } else {   
            System.out.print("-");   
        }
        return cellvalue.trim();
    }

 

private String getHssfCellValue(HSSFCell hssfCell) {
		String cellvalue="";
        DataFormatter formatter = new DataFormatter();
        if (null != hssfCell) {   
            switch (hssfCell.getCellType()) {   
            case HSSFCell.CELL_TYPE_NUMERIC: // 数字   
            	if (org.apache.poi.ss.usermodel.DateUtil.isCellDateFormatted(hssfCell)) {
                    cellvalue = formatter.formatCellValue(hssfCell);
                } else {
                    double value = hssfCell.getNumericCellValue();
                    int intValue = (int) value;
                    cellvalue = value - intValue == 0 ? String.valueOf(intValue) : String.valueOf(value);
                }
            	break;
			case HSSFCell.CELL_TYPE_STRING: // 字符串   
				cellvalue=hssfCell.getStringCellValue(); 
                break;   
            case HSSFCell.CELL_TYPE_BOOLEAN: // Boolean   
            	cellvalue=String.valueOf(hssfCell.getBooleanCellValue()); 
                break;   
            case HSSFCell.CELL_TYPE_FORMULA: // 公式   
            	cellvalue=String.valueOf(hssfCell.getCellFormula());   
                break;   
            case HSSFCell.CELL_TYPE_BLANK: // 空值   
            	cellvalue="";   
                break;   
            case HSSFCell.CELL_TYPE_ERROR: // 故障   
            	cellvalue="";   
                break;   
            default:   
            	cellvalue="UNKNOWN TYPE";   
                break;   
            }   
        } else {   
            System.out.print("-");   
        }
        return cellvalue.trim();
    }

 最终整个解析Excel文件的功能才算完成,我们需要实现4个方法readXlsForAllSheets和readXlsxForAllSheets,getXssfCellValue和getHssfCellValue,那么有没有更加简单实用的方法呢?

下面要介绍的是POI jar包提供的WorkbookFactory类。需要加载poi-ooxm-3.15.jar到build path。

只需要两行就可以实例化workbook,而不用管它是xls还是xlsx。

            inStream = new FileInputStream(new File(filePath));
            Workbook workBook = WorkbookFactory.create(inStream);

 后续可以直接操作sheet,Row,Cell,也不用管文件类型。

目前还没有发现这种方法的缺点。

 

以上是关于EXCEL解析之终极方法WorkbookFactory的主要内容,如果未能解决你的问题,请参考以下文章

POI处理Excel中各种日期格式问题

Net操作Excel(终极方法NPOI)

终极之shell-zsh全解析

36. Valid Sudoku 终极之牛逼方法

PHP 高效导入导出Excel(csv)方法之fgetcsv()和fputcsv()函数

数据可视化之DAX篇(十七)Power BI表格总计行错误的终极解决方案