Class TriXExtractor

  • All Implemented Interfaces:
    org.apache.any23.extractor.Extractor<InputStream>, org.apache.any23.extractor.Extractor.ContentExtractor

    public class TriXExtractor
    extends BaseRDFExtractor
    Concrete implementation of Extractor.ContentExtractor to perform extraction on TriX documents.
    Author:
    Michele Mostarda (mostarda@fbk.eu)
    • Constructor Detail

      • TriXExtractor

        public TriXExtractor​(boolean verifyDataType,
                             boolean stopAtFirstError)
        Constructor, allows to specify the validation and error handling policies.
        Parameters:
        verifyDataType - if true the data types will be verified, if false will be ignored.
        stopAtFirstError - if true the parser will stop at first parsing error, if false will ignore non blocking errors.
      • TriXExtractor

        public TriXExtractor()
        Default constructor, with no verification of data types and not stop at first error.
    • Method Detail

      • getDescription

        public org.apache.any23.extractor.ExtractorDescription getDescription()
      • getParser

        protected org.eclipse.rdf4j.rio.RDFParser getParser​(org.apache.any23.extractor.ExtractionContext extractionContext,
                                                            org.apache.any23.extractor.ExtractionResult extractionResult)
        Specified by:
        getParser in class BaseRDFExtractor