Class XPathExtractor

  • All Implemented Interfaces:
    org.apache.any23.extractor.Extractor<Document>, org.apache.any23.extractor.Extractor.TagSoupDOMExtractor

    public class XPathExtractor
    extends Object
    implements org.apache.any23.extractor.Extractor.TagSoupDOMExtractor
    Implementation of an Extractor.TagSoupDOMExtractor able to apply XPathExtractionRules and generate quads.
    Author:
    Michele Mostarda (mostarda@fbk.eu)
    See Also:
    XPathExtractionRule
    • Constructor Detail

      • XPathExtractor

        public XPathExtractor()
    • Method Detail

      • run

        public void run​(org.apache.any23.extractor.ExtractionParameters extractionParameters,
                        org.apache.any23.extractor.ExtractionContext extractionContext,
                        Document in,
                        org.apache.any23.extractor.ExtractionResult out)
                 throws IOException,
                        org.apache.any23.extractor.ExtractionException
        Specified by:
        run in interface org.apache.any23.extractor.Extractor<Document>
        Throws:
        IOException
        org.apache.any23.extractor.ExtractionException
      • getDescription

        public org.apache.any23.extractor.ExtractorDescription getDescription()
        Specified by:
        getDescription in interface org.apache.any23.extractor.Extractor<Document>