Class CSVExtractor

  • All Implemented Interfaces:
    org.apache.any23.extractor.Extractor<InputStream>, org.apache.any23.extractor.Extractor.ContentExtractor

    public class CSVExtractor
    extends Object
    implements org.apache.any23.extractor.Extractor.ContentExtractor
    This extractor produces RDF from a CSV file . It automatically detects fields delimiter. If not able uses the one provided in the Any23 configuration.
    Author:
    Davide Palmisano ( dpalmisano@gmail.com )
    See Also:
    CSVReaderBuilder
    • Nested Class Summary

      • Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor

        org.apache.any23.extractor.Extractor.BlindExtractor, org.apache.any23.extractor.Extractor.ContentExtractor, org.apache.any23.extractor.Extractor.TagSoupDOMExtractor
    • Constructor Summary

      Constructors 
      Constructor Description
      CSVExtractor()  
    • Constructor Detail

      • CSVExtractor

        public CSVExtractor()
    • Method Detail

      • setStopAtFirstError

        public void setStopAtFirstError​(boolean f)
        Specified by:
        setStopAtFirstError in interface org.apache.any23.extractor.Extractor.ContentExtractor
      • run

        public void run​(org.apache.any23.extractor.ExtractionParameters extractionParameters,
                        org.apache.any23.extractor.ExtractionContext extractionContext,
                        InputStream in,
                        org.apache.any23.extractor.ExtractionResult out)
                 throws IOException,
                        org.apache.any23.extractor.ExtractionException
        Specified by:
        run in interface org.apache.any23.extractor.Extractor<InputStream>
        Throws:
        IOException
        org.apache.any23.extractor.ExtractionException
      • getDescription

        public org.apache.any23.extractor.ExtractorDescription getDescription()
        Specified by:
        getDescription in interface org.apache.any23.extractor.Extractor<InputStream>