org.apache.any23.plugin.officescraper
Class ExcelExtractor

java.lang.Object
  extended by org.apache.any23.plugin.officescraper.ExcelExtractor
All Implemented Interfaces:
Extractor<InputStream>, Extractor.ContentExtractor

public class ExcelExtractor
extends Object
implements Extractor.ContentExtractor

Implementation of ContentExtractor able to process a MS Excel 97-2007+ file format .xls/.xlsx and convert the detected content to triples. This extractor is based on Apache POI-HSSF and POI-XSSF Java API.

Author:
Michele Mostarda (mostarda@fbk.eu)

Nested Class Summary
 
Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor
Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor
 
Field Summary
static ExtractorFactory<ExcelExtractor> factory
           
 
Constructor Summary
ExcelExtractor()
           
 
Method Summary
 ExtractorDescription getDescription()
           
 boolean isStopAtFirstError()
           
 void run(ExtractionParameters extractionParameters, ExtractionContext context, InputStream in, ExtractionResult er)
           
 void setStopAtFirstError(boolean f)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

factory

public static final ExtractorFactory<ExcelExtractor> factory
Constructor Detail

ExcelExtractor

public ExcelExtractor()
Method Detail

isStopAtFirstError

public boolean isStopAtFirstError()

setStopAtFirstError

public void setStopAtFirstError(boolean f)
Specified by:
setStopAtFirstError in interface Extractor.ContentExtractor

getDescription

public ExtractorDescription getDescription()
Specified by:
getDescription in interface Extractor<InputStream>

run

public void run(ExtractionParameters extractionParameters,
                ExtractionContext context,
                InputStream in,
                ExtractionResult er)
         throws IOException,
                ExtractionException
Specified by:
run in interface Extractor<InputStream>
Throws:
IOException
ExtractionException


Copyright © 2010-2012 The Apache Software Foundation. All Rights Reserved.