Class HCalendarExtractor

  • All Implemented Interfaces:
    org.apache.any23.extractor.Extractor<Document>, org.apache.any23.extractor.Extractor.TagSoupDOMExtractor

    public class HCalendarExtractor
    extends MicroformatExtractor
    Extractor for the hCalendar microformat.
    Author:
    Gabriele Renzi
    • Constructor Detail

      • HCalendarExtractor

        public HCalendarExtractor()
    • Method Detail

      • getDescription

        public org.apache.any23.extractor.ExtractorDescription getDescription()
        Description copied from class: MicroformatExtractor
        Returns the description of this extractor.
        Specified by:
        getDescription in interface org.apache.any23.extractor.Extractor<Document>
        Specified by:
        getDescription in class MicroformatExtractor
        Returns:
        a human readable description.
      • extract

        protected boolean extract()
                           throws org.apache.any23.extractor.ExtractionException
        Description copied from class: MicroformatExtractor
        Performs the extraction of the data and writes them to the model. The nodes generated in the model can have any name or implicit label but if possible they SHOULD have names (either URIs or AnonId) that are uniquely derivable from their position in the DOM tree, so that multiple extractors can merge information.
        Specified by:
        extract in class MicroformatExtractor
        Returns:
        true if extraction is successful
        Throws:
        org.apache.any23.extractor.ExtractionException - if there is an error during extraction