Class ICBMExtractor

  • All Implemented Interfaces:
    org.apache.any23.extractor.Extractor<Document>, org.apache.any23.extractor.Extractor.TagSoupDOMExtractor

    public class ICBMExtractor
    extends Object
    implements org.apache.any23.extractor.Extractor.TagSoupDOMExtractor
    Extractor for "ICBM coordinates" provided as META headers in the head of an HTML page.
    Author:
    Gabriele Renzi, Richard Cyganiak (richard@cyganiak.de)
    • Nested Class Summary

      • Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor

        org.apache.any23.extractor.Extractor.BlindExtractor, org.apache.any23.extractor.Extractor.ContentExtractor, org.apache.any23.extractor.Extractor.TagSoupDOMExtractor
    • Constructor Summary

      Constructors 
      Constructor Description
      ICBMExtractor()  
    • Constructor Detail

      • ICBMExtractor

        public ICBMExtractor()
    • Method Detail

      • run

        public void run​(org.apache.any23.extractor.ExtractionParameters extractionParameters,
                        org.apache.any23.extractor.ExtractionContext extractionContext,
                        Document in,
                        org.apache.any23.extractor.ExtractionResult out)
                 throws IOException,
                        org.apache.any23.extractor.ExtractionException
        Specified by:
        run in interface org.apache.any23.extractor.Extractor<Document>
        Throws:
        IOException
        org.apache.any23.extractor.ExtractionException
      • getDescription

        public org.apache.any23.extractor.ExtractorDescription getDescription()
        Specified by:
        getDescription in interface org.apache.any23.extractor.Extractor<Document>