Class LicenseExtractor

  • All Implemented Interfaces:
    org.apache.any23.extractor.Extractor<Document>, org.apache.any23.extractor.Extractor.TagSoupDOMExtractor

    public class LicenseExtractor
    extends Object
    implements org.apache.any23.extractor.Extractor.TagSoupDOMExtractor
    Extractor for the rel-license microformat.
    Author:
    Gabriele Renzi, Richard Cyganiak
    • Nested Class Summary

      • Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor

        org.apache.any23.extractor.Extractor.BlindExtractor, org.apache.any23.extractor.Extractor.ContentExtractor, org.apache.any23.extractor.Extractor.TagSoupDOMExtractor
    • Constructor Detail

      • LicenseExtractor

        public LicenseExtractor()
    • Method Detail

      • run

        public void run​(org.apache.any23.extractor.ExtractionParameters extractionParameters,
                        org.apache.any23.extractor.ExtractionContext extractionContext,
                        Document in,
                        org.apache.any23.extractor.ExtractionResult out)
                 throws IOException,
                        org.apache.any23.extractor.ExtractionException
        Specified by:
        run in interface org.apache.any23.extractor.Extractor<Document>
        Throws:
        IOException
        org.apache.any23.extractor.ExtractionException
      • getDescription

        public org.apache.any23.extractor.ExtractorDescription getDescription()
        Specified by:
        getDescription in interface org.apache.any23.extractor.Extractor<Document>