A B C D E F G H I J L M N O P Q R S T U V W X Y
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- ABOUT_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- AboutNotURIRule - Class in org.apache.any23.validator.rule
-
This rule is able to detect whether an about value is a valid URL or otherwise is a valid relative URL.
- AboutNotURIRule() - Constructor for class org.apache.any23.validator.rule.AboutNotURIRule
- AcceptHeaderBuilder - Class in org.apache.any23.http
-
Concatenates a collection of MIME specs in "type/subtype;q=x.x" notation into an HTTP Accept header value, and removes duplicates and types covered by wildcards.
- AcceptHeaderBuilder(Collection<MIMEType>) - Constructor for class org.apache.any23.http.AcceptHeaderBuilder
- acceptIRI(IRI) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
- acceptIRI(IRI) - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
-
Checks if the rule can be applied on the given document IRI.
- acquireProperty(ItemProp) - Method in class org.apache.any23.extractor.microdata.ItemScope
- add(QuadTemplate) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
-
Adds a
QuadTemplate
to the rule. - add(QuadTemplate) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
- add(Variable) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
-
Adds a variable to the template.
- add(Variable) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
- add(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
- addAttribute(String, String, String) - Method in class org.apache.any23.validator.DefaultDOMDocument
- addAttribute(String, String, String) - Method in interface org.apache.any23.validator.DOMDocument
-
Adds an attribute to a node addressed by the given XPath.
- addBNodeProperty(Resource, IRI, BNode) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a BNode property to a node.
- addBNodeProperty(Node, Resource, IRI, BNode) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a BNode property to a node.
- addChild(TripleHandler) - Method in class org.apache.any23.writer.CompositeTripleHandler
-
Adds a decorated handler.
- addIRIProperty(Resource, IRI, IRI) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a IRI property to a node.
- ADDITIONAL_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
- addPropertyPath(Class<? extends MicroformatExtractor>, Resource, Resource, BNode, String[]) - Method in class org.apache.any23.extractor.ExtractionResultImpl
- addPropertyPath(Class<? extends MicroformatExtractor>, Resource, Resource, BNode, String[]) - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
-
Adds a property path to the list of the extracted data.
- addResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Method in class org.apache.any23.extractor.ExtractionResultImpl
- addResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
-
Adds a root property to the extraction result, specifying also the path corresponding to the root of data which generated the property and the extractor responsible for such addition.
- addRule(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
- addRule(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
-
Allows to register a new rule to this validator
- addRule(Class<? extends Rule>, Class<? extends Fix>) - Method in class org.apache.any23.validator.DefaultValidator
- addRule(Class<? extends Rule>, Class<? extends Fix>) - Method in interface org.apache.any23.validator.Validator
-
Allows to register a new rule to this validator and associating it to a fix.
- AdrExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the adr microformat.
- AdrExtractor() - Constructor for class org.apache.any23.extractor.html.AdrExtractor
- AdrExtractorFactory - Class in org.apache.any23.extractor.html
- AdrExtractorFactory() - Constructor for class org.apache.any23.extractor.html.AdrExtractorFactory
- Any23 - Class in org.apache.any23
-
A facade with convenience methods for typical Any23 extraction operations.
- Any23() - Constructor for class org.apache.any23.Any23
-
Constructor with default configuration.
- Any23(String...) - Constructor for class org.apache.any23.Any23
-
Constructor that allows the specification of a list of extractor names.
- Any23(Configuration) - Constructor for class org.apache.any23.Any23
-
Constructor accepting
Configuration
. - Any23(Configuration, String...) - Constructor for class org.apache.any23.Any23
-
Constructor that allows the specification of a custom configuration and of list of extractor names.
- Any23(Configuration, ExtractorGroup) - Constructor for class org.apache.any23.Any23
-
Constructor that allows the specification of a custom configuration and of a list of extractors.
- Any23(ExtractorGroup) - Constructor for class org.apache.any23.Any23
-
Constructor that allows the specification of a list of extractors.
- Any23ValueFactoryWrapper - Class in org.apache.any23.rdf
-
Any23 specialization of the
ValueFactory
. - Any23ValueFactoryWrapper(ValueFactory) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
- Any23ValueFactoryWrapper(ValueFactory, IssueReport) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
- Any23ValueFactoryWrapper(ValueFactory, IssueReport, String) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
Constructor with error reporter.
- applyOn(DOMDocument, RuleContext<?>, ValidationReportBuilder) - Method in interface org.apache.any23.validator.Rule
-
Applies this rule to the given document.
- applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.AboutNotURIRule
- applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.MetaNameMisuseRule
- applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.MissingItemscopeAttributeValueRule
- applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
- applyTo(Document, Writer) - Method in class org.apache.any23.extractor.rdfa.XSLTStylesheet
-
Applies the XSLT transformation
- applyTo(Document, Writer, Map<String, String>) - Method in class org.apache.any23.extractor.rdfa.XSLTStylesheet
-
Applies the XSLT transformation
- asLines(InputStream) - Static method in class org.apache.any23.util.StreamUtils
-
Returns all the lines read from an input stream.
- asModel(IRI, Object, Value) - Method in class org.apache.any23.extractor.yaml.ElementsProcessor
-
Converts a data structure to
ElementsProcessor.ModelHolder
. - asString(InputStream) - Static method in class org.apache.any23.util.StreamUtils
-
Returns the string content of a stream, new line chars will be removed.
- asString(InputStream, boolean) - Static method in class org.apache.any23.util.StreamUtils
-
Returns the string content of a stream.
- ATTRIBUTE_CSS - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- available() - Method in class org.apache.any23.util.ReaderInputStream
B
- BaseRDFExtractor - Class in org.apache.any23.extractor.rdf
-
Base class for a generic RDF
Extractor.ContentExtractor
. - BaseRDFExtractor() - Constructor for class org.apache.any23.extractor.rdf.BaseRDFExtractor
- BaseRDFExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.BaseRDFExtractor
-
Constructor, allows to specify the validation and error handling policies.
- BEGIN_SCRIPT - Static variable in class org.apache.any23.extractor.html.MicroformatExtractor
- BenchmarkTripleHandler - Class in org.apache.any23.writer
-
TripleHandler
decorator useful to perform benchmarking. - BenchmarkTripleHandler(TripleHandler) - Constructor for class org.apache.any23.writer.BenchmarkTripleHandler
-
Constructor.
- blockContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
- bnode() - Static method in class org.apache.any23.rdf.RDFUtils
- bnode(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
BNode
. - BNODE - org.apache.any23.extractor.xpath.TemplateObject.Type
- BNODE - org.apache.any23.extractor.xpath.TemplateSubject.Type
- BODY_TAG - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- ByteArrayDocumentSource - Class in org.apache.any23.source
-
ByteArray implementation of
DocumentSource
. - ByteArrayDocumentSource(byte[], String, String) - Constructor for class org.apache.any23.source.ByteArrayDocumentSource
- ByteArrayDocumentSource(InputStream, String, String) - Constructor for class org.apache.any23.source.ByteArrayDocumentSource
C
- CLASS_PREFIX - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
- close() - Method in class org.apache.any23.extractor.ExtractionResultImpl
- close() - Method in class org.apache.any23.filter.ExtractionContextBlocker
- close() - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
- close() - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
- close() - Method in class org.apache.any23.http.DefaultHTTPClient
-
Shuts down the connection manager.
- close() - Method in interface org.apache.any23.http.HTTPClient
-
Release all static resources help by the instance.
- close() - Method in class org.apache.any23.util.ReaderInputStream
-
Closes the Stringreader.
- close() - Method in class org.apache.any23.writer.BenchmarkTripleHandler
- close() - Method in class org.apache.any23.writer.CompositeTripleHandler
- close() - Method in class org.apache.any23.writer.CountingTripleHandler
- close() - Method in class org.apache.any23.writer.JSONWriter
-
Deprecated.
- close() - Method in class org.apache.any23.writer.LoggingTripleHandler
- close() - Method in class org.apache.any23.writer.RDFWriterTripleHandler
- close() - Method in class org.apache.any23.writer.ReportingTripleHandler
- close() - Method in class org.apache.any23.writer.RepositoryWriter
- close() - Method in class org.apache.any23.writer.URIListWriter
- closeContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
- closeContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
- closeContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
- closeContext(ExtractionContext) - Method in class org.apache.any23.writer.TripleWriterHandler
-
Writers may override this method to handle a "closeContext" extraction event.
- closeGracefully(Closeable) - Static method in class org.apache.any23.util.StreamUtils
-
Closes the closable interface and reports error if any.
- CompositeTripleHandler - Class in org.apache.any23.writer
-
A
TripleHandler
multi decorator, that wraps zero or more other triple handlers and dispatches all events to each of them. - CompositeTripleHandler() - Constructor for class org.apache.any23.writer.CompositeTripleHandler
-
Constructor with empty decorated list.
- CompositeTripleHandler(Collection<TripleHandler>) - Constructor for class org.apache.any23.writer.CompositeTripleHandler
-
Constructor with initial list of decorated handlers.
- conditionallyAddLiteralProperty(Node, Resource, IRI, Literal) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a literal property to a node.
- conditionallyAddResourceProperty(Resource, IRI, IRI) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a IRI property to a node.
- conditionallyAddStringProperty(Node, Resource, IRI, String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Helper method that adds a literal property to a subject only if the value of the property is a valid string.
- contains(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
- containsField(String) - Method in class org.apache.any23.extractor.html.HCardName
- CONTENT_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- CountingTripleHandler - Class in org.apache.any23.writer
-
A simple
TripleHandler
that merely counts the number of triples it has received. - CountingTripleHandler() - Constructor for class org.apache.any23.writer.CountingTripleHandler
- CountingTripleHandler(boolean) - Constructor for class org.apache.any23.writer.CountingTripleHandler
- countNL(String) - Static method in class org.apache.any23.util.StringUtils
-
Counts the number of
NL
in the given in string. - countOccurrences(String, String) - Static method in class org.apache.any23.util.StringUtils
-
Counts how many times
content
appears withincontainer
without string overlapping. - cp(File, File) - Static method in class org.apache.any23.util.FileUtils
-
Copies a file
src
to thedest
. - cp(InputStream, File) - Static method in class org.apache.any23.util.FileUtils
-
Copies the content of the input stream within the given dest file.
- createBNode() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createBNode(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createDocumentSource(String) - Method in class org.apache.any23.Any23
-
Returns the most appropriate
DocumentSource
for the givendocumentIRI
. - createExtractor() - Method in class org.apache.any23.extractor.calendar.ICalExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.calendar.JCalExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.calendar.XCalExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.csv.CSVExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.AdrExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.GeoExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.HCalendarExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.HCardExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.HeadLinkExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.HListingExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.HRecipeExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.HResumeExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.HReviewAggregateExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.HReviewExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.HTMLMetaExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.ICBMExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.LicenseExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HAdrExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HEntryExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HGeoExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HItemExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HProductExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HResumeExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.SpeciesExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.TitleExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.TurtleHTMLExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.html.XFNExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.microdata.MicrodataExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.rdf.JSONLDExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.rdf.NQuadsExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.rdf.NTriplesExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.rdf.RDFXMLExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.rdf.TriXExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.rdf.TurtleExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.rdfa.RDFa11ExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.rdfa.RDFaExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.xpath.XPathExtractorFactory
- createExtractor() - Method in class org.apache.any23.extractor.yaml.YAMLExtractorFactory
- createInitializedHTTPClient() - Static method in class org.apache.any23.http.DefaultHTTPClient
-
Creates a
DefaultHTTPClient
instance already initialized - createIRI(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createIRI(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(boolean) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(byte) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(double) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(float) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(int) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(long) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(short) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(String, IRI) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(BigDecimal) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(BigInteger) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(Date) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLiteral(XMLGregorianCalendar) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createLocalCopy(DocumentSource) - Method in interface org.apache.any23.source.LocalCopyFactory
- createLocalCopy(DocumentSource) - Method in class org.apache.any23.source.MemCopyFactory
- createStatement(Resource, IRI, Value) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createStatement(Resource, IRI, Value, Resource) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- createSubset(String...) - Static method in class org.apache.any23.rdf.PopularPrefixes
-
This method perform a prefix lookup.
- CSVExtractor - Class in org.apache.any23.extractor.csv
-
This extractor produces RDF from a CSV file .
- CSVExtractor() - Constructor for class org.apache.any23.extractor.csv.CSVExtractor
- CSVExtractorFactory - Class in org.apache.any23.extractor.csv
- CSVExtractorFactory() - Constructor for class org.apache.any23.extractor.csv.CSVExtractorFactory
- CURIE_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
D
- DATATYPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- Date - org.apache.any23.extractor.microdata.ItemPropValue.Type
- DEFAULT_HTTP_CLIENT_USER_AGENT - Static variable in class org.apache.any23.Any23
-
Default HTTP User Agent defined in default configuration.
- DefaultDOMDocument - Class in org.apache.any23.validator
-
This class wraps the DOM document.
- DefaultDOMDocument(URI, Document) - Constructor for class org.apache.any23.validator.DefaultDOMDocument
- DefaultHTTPClient - Class in org.apache.any23.http
-
Opens an
InputStream
on an HTTP IRI. - DefaultHTTPClient() - Constructor for class org.apache.any23.http.DefaultHTTPClient
- DefaultHTTPClientConfiguration - Class in org.apache.any23.http
-
Default implementation of
HTTPClientConfiguration
. - DefaultHTTPClientConfiguration() - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
-
Constructor.
- DefaultHTTPClientConfiguration(String) - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
-
Constructor.
- DefaultHTTPClientConfiguration(String, int, int, String) - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
-
Constructor.
- DefaultRuleContext - Class in org.apache.any23.validator
-
Default implementation of
RuleContext
. - DefaultRuleContext() - Constructor for class org.apache.any23.validator.DefaultRuleContext
- DefaultValidationReport - Class in org.apache.any23.validator
-
Default implementation of
ValidationReport
. - DefaultValidationReport(List<ValidationReport.Issue>, List<ValidationReport.RuleActivation>, List<ValidationReport.Error>) - Constructor for class org.apache.any23.validator.DefaultValidationReport
- DefaultValidationReportBuilder - Class in org.apache.any23.validator
-
Default implementation of
ValidationReportBuilder
. - DefaultValidationReportBuilder() - Constructor for class org.apache.any23.validator.DefaultValidationReportBuilder
- DefaultValidator - Class in org.apache.any23.validator
-
Default implementation of
Validator
. - DefaultValidator() - Constructor for class org.apache.any23.validator.DefaultValidator
- deferProperties(String...) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Given a document and a list of itemprop names this method will return such itemprops.
- DiscoveryUtils - Class in org.apache.any23.util
-
This class provides utility methods for discovering classes in packages.
- disownProperty(ItemProp) - Method in class org.apache.any23.extractor.microdata.ItemScope
- DocumentReport - Class in org.apache.any23.extractor.html
-
Represents the validationReportBuilder generated by a the
TagSoupParser
when a document is retrieved and validated. - DocumentReport(ValidationReport, Document) - Constructor for class org.apache.any23.extractor.html.DocumentReport
- documentToInputStream(Document) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Given a
Document
this method will return an input stream representing that document. - documentToInputStream(Document) - Static method in class org.apache.any23.util.StreamUtils
-
Converts a
Document
to anInputStream
- DOMDocument - Interface in org.apache.any23.validator
-
This interface models a document to be processed by the
Validator
. - DomUtils - Class in org.apache.any23.extractor.html
-
This class provides utility methods for DOM manipulation.
- dumpContent(File, String) - Static method in class org.apache.any23.util.FileUtils
-
Dumps the given string within a file.
- dumpContent(File, Throwable) - Static method in class org.apache.any23.util.FileUtils
-
Dumps the stack trace of the given exception into the specified file.
E
- ELEMENT_LOCATION - Static variable in class org.apache.any23.extractor.html.TagSoupParser
- ElementsProcessor - Class in org.apache.any23.extractor.yaml
-
Converts Object into RDF graph encoded to
ElementsProcessor.ModelHolder
. - ElementsProcessor.ModelHolder - Class in org.apache.any23.extractor.yaml
-
A model holder describes the two required parameters which makes a model useful in further processing: a root node and model itself.
- EMBEDDED_PROPERTY_PREFIX - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
- EmbeddedJSONLDExtractor - Class in org.apache.any23.extractor.html
-
This extractor represents the HTML script tags used to embed blocks of data in documents.
- EmbeddedJSONLDExtractor() - Constructor for class org.apache.any23.extractor.html.EmbeddedJSONLDExtractor
- EmbeddedJSONLDExtractorFactory - Class in org.apache.any23.extractor.html
- EmbeddedJSONLDExtractorFactory() - Constructor for class org.apache.any23.extractor.html.EmbeddedJSONLDExtractorFactory
- EmptyValidationReport - Class in org.apache.any23.validator
-
An implementation of
ValidationReportBuilder
with no data. - END_SCRIPT - Static variable in class org.apache.any23.extractor.html.MicroformatExtractor
- endDocument(IRI) - Method in class org.apache.any23.filter.ExtractionContextBlocker
- endDocument(IRI) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
- endDocument(IRI) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
- endDocument(IRI) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
- endDocument(IRI) - Method in class org.apache.any23.writer.CompositeTripleHandler
- endDocument(IRI) - Method in class org.apache.any23.writer.CountingTripleHandler
- endDocument(IRI) - Method in class org.apache.any23.writer.JSONWriter
-
Deprecated.
- endDocument(IRI) - Method in class org.apache.any23.writer.LoggingTripleHandler
- endDocument(IRI) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
- endDocument(IRI) - Method in class org.apache.any23.writer.ReportingTripleHandler
- endDocument(IRI) - Method in class org.apache.any23.writer.RepositoryWriter
- endDocument(IRI) - Method in class org.apache.any23.writer.TripleWriterHandler
-
Writers may override this method to handle an "endDocument" extraction event.
- endDocument(IRI) - Method in class org.apache.any23.writer.URIListWriter
- endRDF() - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
- EntityBasedMicroformatExtractor - Class in org.apache.any23.extractor.html
-
Base class for microformat extractors based on entities.
- EntityBasedMicroformatExtractor() - Constructor for class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
- equals(Object) - Method in class org.apache.any23.extractor.microdata.Item
- equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemProp
- equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemScope
- Error(Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.Error
- ERROR - org.apache.any23.validator.ValidationReport.IssueLevel
- ERRORED_META_NODES - Static variable in class org.apache.any23.validator.rule.MetaNameMisuseRule
- escapeAsJSONString(String) - Static method in class org.apache.any23.util.StringUtils
-
Escapes the
in
string as JSON string to let it being embeddable within a string field. - escapeDoubleQuotes(String) - Static method in class org.apache.any23.util.StringUtils
-
Escapes all the unescaped double quotes when needed.
- ExampleInputOutput - Class in org.apache.any23.extractor
-
A reporter for example input and output of an extractor.
- ExampleInputOutput(String) - Constructor for class org.apache.any23.extractor.ExampleInputOutput
- ExampleInputOutput(ExtractorFactory<?>) - Constructor for class org.apache.any23.extractor.ExampleInputOutput
- execute(Rule, RuleContext, DOMDocument) - Method in interface org.apache.any23.validator.Fix
-
Executes this fix over a document.
- execute(Rule, RuleContext, DOMDocument) - Method in class org.apache.any23.validator.rule.MetaNameMisuseFix
- execute(Rule, RuleContext, DOMDocument) - Method in class org.apache.any23.validator.rule.MissingItemscopeAttributeValueFix
- execute(Rule, RuleContext, DOMDocument) - Method in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
- extract() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
- extract() - Method in class org.apache.any23.extractor.html.HCalendarExtractor
- extract() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Performs the extraction of the data and writes them to the model.
- extract(File, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
file
sending the generated events to the specifiedoutputHandler
. - extract(String, String, String, String, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction on the
in
string associated to thedocumentIRI
IRI, declaringcontentType
andencoding
. - extract(String, String, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction on the
in
string associated to thedocumentIRI
IRI, sending the generated events to the specifiedoutputHandler
. - extract(String, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
documentIRI
sending the generated events to the specifiedoutputHandler
. - extract(ExtractionParameters, String, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
documentIRI
sending the generated events to the specifiedoutputHandler
. - extract(ExtractionParameters, DocumentSource, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
in
document source, sending the generated events to the specifiedoutputHandler
. - extract(ExtractionParameters, DocumentSource, TripleHandler, String) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
in
document source, sending the generated events to the specifiedoutputHandler
. - extract(DocumentSource, TripleHandler) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
in
document source, sending the generated events to the specifiedoutputHandler
. - extract(DocumentSource, TripleHandler, String) - Method in class org.apache.any23.Any23
-
Performs metadata extraction from the content of the given
in
document source, sending the generated events to the specifiedoutputHandler
. - extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.AdrExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
-
Extracts an entity from a DOM node.
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.GeoExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HCardExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HListingExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HRecipeExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HResumeExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HReviewAggregateExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HReviewExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HAdrExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HEntryExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HGeoExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HItemExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HProductExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HResumeExtractor
- extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.SpeciesExtractor
-
Extracts an entity from a DOM node.
- extractEntityAsEmbeddedProperty(HTMLDocument, BNode, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractor
- extractEntityAsEmbeddedProperty(HTMLDocument, BNode, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractor
- ExtractionContextBlocker - Class in org.apache.any23.filter
-
A wrapper around a
TripleHandler
that can block and unblock calls to the handler, either for the entire document, or for individualExtractionContext
s. - ExtractionContextBlocker(TripleHandler) - Constructor for class org.apache.any23.filter.ExtractionContextBlocker
- ExtractionReport - Class in org.apache.any23
-
This class contains some statistics and general information about an extraction.
- ExtractionReport(List<Extractor>, String, String, ValidationReport, Map<String, Collection<IssueReport.Issue>>) - Constructor for class org.apache.any23.ExtractionReport
- ExtractionResultImpl - Class in org.apache.any23.extractor
-
A default implementation of
ExtractionResult
; it receives extraction output from oneExtractor
working on one document, and passes the output on to aTripleHandler
. - ExtractionResultImpl(ExtractionContext, Extractor<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.ExtractionResultImpl
- ExtractorRegistryImpl - Class in org.apache.any23.extractor
-
Singleton class acting as a register for all the various
Extractor
. - ExtractorRegistryImpl() - Constructor for class org.apache.any23.extractor.ExtractorRegistryImpl
-
Public constructor for ExtractorRegistryImpl.
- extractors() - Method in annotation type org.apache.any23.extractor.html.annotations.Includes
- extractors() - Method in annotation type org.apache.any23.extractor.html.microformats2.annotations.Includes
- extractPrefixSections(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.Given a prefix declaration returns a list of
prefixID:prefixURL
strings normalizing blanks where present. - extractRelTag(String) - Static method in class org.apache.any23.extractor.html.HTMLDocument
-
Extracts the href specific rel-tag string.
- extractRelTag(NamedNodeMap) - Static method in class org.apache.any23.extractor.html.HTMLDocument
-
Extracts the href specific rel-tag string.
- extractRelTagNodes() - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Extracts all the
rel
tag nodes.
F
- FAMILY_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
- FIELDS - Static variable in class org.apache.any23.extractor.html.HCardName
- FileDocumentSource - Class in org.apache.any23.source
-
File implementation of
DocumentSource
. - FileDocumentSource(File) - Constructor for class org.apache.any23.source.FileDocumentSource
- FileDocumentSource(File, String) - Constructor for class org.apache.any23.source.FileDocumentSource
- FileUtils - Class in org.apache.any23.util
-
Utility class for handling files.
- find(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
- find(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Gets the string value of an XPath expression.
- findAll(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
- findAll(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Returns a NodeList composed of all the nodes that match an XPath expression, which must be valid.
- findAllByAttributeContains(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
- findAllByAttributeName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Finds all nodes that have a declared attribute.
- findAllByClassName(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Finds all the nodes by class name.
- findAllByClassName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Finds all nodes that have a declared class.
- findAllByTag(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
- findAllByTagAndClassName(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
- findMicroformattedObjectNode(String, String) - Method in class org.apache.any23.extractor.html.HTMLDocument
- findMicroformattedValue(String, String, String, String, String) - Method in class org.apache.any23.extractor.html.HTMLDocument
- findNodeById(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
- findNodeById(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Mimics the JS DOM API, or prototype's $()
- Fix - Interface in org.apache.any23.validator
-
This interface models a fix for an issue.
- fixAbsoluteIRI(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Fixes typical errors in an absolute org.eclipse.rdf4j.model.IRI, such as unescaped spaces.
- FixError(Fix, Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.FixError
- fixIRI(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- fixIRIWithException(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Tries to fix a potentially broken relative or absolute URI.
- fixLink(String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
- fixLink(String, String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
- fixLink(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
Helper method to conditionally add a schema to a URI unless it's there, or null if link is empty.
- fixWhiteSpace(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- formatDateTime(Date) - Static method in class org.apache.any23.extractor.microdata.ItemPropValue
- fromStrings(Collection<String>) - Static method in class org.apache.any23.http.AcceptHeaderBuilder
- FunctionalSyntaxExtractor - Class in org.apache.any23.extractor.rdf
-
Concrete implementation of
Extractor.ContentExtractor
handling OWL2 Functional-Style Syntax format. - FunctionalSyntaxExtractor() - Constructor for class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractor
- FunctionalSyntaxExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractor
- FunctionalSyntaxExtractorFactory - Class in org.apache.any23.extractor.rdf
- FunctionalSyntaxExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractorFactory
G
- GeoExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the Geo microformat.
- GeoExtractor() - Constructor for class org.apache.any23.extractor.html.GeoExtractor
- GeoExtractorFactory - Class in org.apache.any23.extractor.html
- GeoExtractorFactory() - Constructor for class org.apache.any23.extractor.html.GeoExtractorFactory
- get() - Static method in class org.apache.any23.rdf.PopularPrefixes
- getAcceptHeader() - Method in class org.apache.any23.http.AcceptHeaderBuilder
-
Builds and returns an accept header.
- getAcceptHeader() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
- getAcceptHeader() - Method in interface org.apache.any23.http.HTTPClientConfiguration
- getActualDocumentIRI() - Method in class org.apache.any23.http.DefaultHTTPClient
- getActualDocumentIRI() - Method in interface org.apache.any23.http.HTTPClient
-
Returns the actual IRI from which the document was fetched.
- getAllNames() - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
Returns the names of all registered extractors, sorted alphabetically.
- getAllRules() - Method in class org.apache.any23.validator.DefaultValidator
- getAllRules() - Method in interface org.apache.any23.validator.Validator
-
Returns all the registered rules.
- getAsDate() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- getAsFloat() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- getAsInteger() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- getAsLink() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- getAsNested() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- getAsPlainLiteral(Node, String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- getAsXMLLiteral(Node) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- getBaseClassName() - Method in class org.apache.any23.extractor.html.AdrExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
-
Returns the base class name for the extractor.
- getBaseClassName() - Method in class org.apache.any23.extractor.html.GeoExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HCardExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HListingExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HResumeExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HReviewAggregateExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.HReviewExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HAdrExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HEntryExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HGeoExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HItemExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HProductExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HResumeExtractor
- getBaseClassName() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
-
Returns the base class name for the extractor.
- getBeginColumnNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
- getBeginLineNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
- getBlankNodeFor(Node) - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
- getBNode(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
BNode
with an MD5 digest as part of the ID. - getCause() - Method in class org.apache.any23.validator.ValidationReport.Error
- getChilds() - Method in class org.apache.any23.writer.CompositeTripleHandler
- getClassesInPackage(String) - Static method in class org.apache.any23.util.DiscoveryUtils
-
Scans all classes accessible from the context class loader which belong to the given package and sub-packages.
- getClassesInPackage(String, Class<?>) - Static method in class org.apache.any23.util.DiscoveryUtils
-
Scans all classes accessible from the context class loader which belong to the given package and sub-packages and filter them by ones implementing the specified interface
iface
. - getConnectionTimeout() - Method in class org.apache.any23.http.DefaultHTTPClient
- getContent() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- getContentLength() - Method in class org.apache.any23.http.DefaultHTTPClient
- getContentLength() - Method in interface org.apache.any23.http.HTTPClient
- getContentLength() - Method in class org.apache.any23.source.ByteArrayDocumentSource
- getContentLength() - Method in class org.apache.any23.source.FileDocumentSource
- getContentLength() - Method in class org.apache.any23.source.HTTPDocumentSource
- getContentLength() - Method in class org.apache.any23.source.StringDocumentSource
- getContentType() - Method in class org.apache.any23.http.DefaultHTTPClient
- getContentType() - Method in interface org.apache.any23.http.HTTPClient
-
The value of the Content-Type header reported by the server.
- getContentType() - Method in class org.apache.any23.source.ByteArrayDocumentSource
- getContentType() - Method in class org.apache.any23.source.FileDocumentSource
- getContentType() - Method in class org.apache.any23.source.HTTPDocumentSource
- getContentType() - Method in class org.apache.any23.source.StringDocumentSource
- getCount() - Method in class org.apache.any23.writer.CountingTripleHandler
- getCurrentExtractionResult() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Returns the
ExtractionResult
associated to the extraction session. - getData(String) - Method in class org.apache.any23.validator.DefaultRuleContext
- getData(String) - Method in interface org.apache.any23.validator.RuleContext
-
Retrieves a registered object.
- getDefaultLanguage() - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns the document default language.
- getDefaultLiteralLanguage() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- getDefaultTimeout() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
- getDefaultTimeout() - Method in interface org.apache.any23.http.HTTPClientConfiguration
- getDescription() - Method in class org.apache.any23.extractor.calendar.ICalExtractor
- getDescription() - Method in class org.apache.any23.extractor.calendar.JCalExtractor
- getDescription() - Method in class org.apache.any23.extractor.calendar.XCalExtractor
- getDescription() - Method in class org.apache.any23.extractor.csv.CSVExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.AdrExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.GeoExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.HCalendarExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.HCardExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.HeadLinkExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.HListingExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.HResumeExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.HReviewAggregateExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.HReviewExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.HTMLMetaExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.ICBMExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.LicenseExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
-
Returns the description of this extractor.
- getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HAdrExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HEntryExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HGeoExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HItemExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HProductExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HResumeExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
-
Returns the description of this extractor.
- getDescription() - Method in class org.apache.any23.extractor.html.TitleExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.TurtleHTMLExtractor
- getDescription() - Method in class org.apache.any23.extractor.html.XFNExtractor
- getDescription() - Method in class org.apache.any23.extractor.microdata.MicrodataExtractor
- getDescription() - Method in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractor
- getDescription() - Method in class org.apache.any23.extractor.rdf.JSONLDExtractor
- getDescription() - Method in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractor
- getDescription() - Method in class org.apache.any23.extractor.rdf.NQuadsExtractor
- getDescription() - Method in class org.apache.any23.extractor.rdf.NTriplesExtractor
- getDescription() - Method in class org.apache.any23.extractor.rdf.RDFXMLExtractor
- getDescription() - Method in class org.apache.any23.extractor.rdf.TriXExtractor
- getDescription() - Method in class org.apache.any23.extractor.rdf.TurtleExtractor
- getDescription() - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
- getDescription() - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
- getDescription() - Method in class org.apache.any23.extractor.xpath.XPathExtractor
- getDescription() - Method in class org.apache.any23.extractor.yaml.YAMLExtractor
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.calendar.ICalExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.calendar.JCalExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.calendar.XCalExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.csv.CSVExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.AdrExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.GeoExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HCalendarExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HCardExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HeadLinkExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HListingExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HRecipeExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HResumeExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HReviewAggregateExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HReviewExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HTMLMetaExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.ICBMExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.LicenseExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HAdrExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HCardExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HEntryExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HEventExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HGeoExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HItemExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HProductExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HResumeExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.SpeciesExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.TitleExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.TurtleHTMLExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.XFNExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.microdata.MicrodataExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.JSONLDExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.NQuadsExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.NTriplesExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.RDFXMLExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.TriXExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.TurtleExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdfa.RDFa11ExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdfa.RDFaExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.xpath.XPathExtractorFactory
- getDescriptionInstance() - Static method in class org.apache.any23.extractor.yaml.YAMLExtractorFactory
- getDetectedItemScopes() - Method in class org.apache.any23.extractor.microdata.MicrodataParserReport
- getDetectedMimeType() - Method in class org.apache.any23.ExtractionReport
- getDetectedMIMEType() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Returns the detected mimetype for the given
DocumentSource
. - getDocument() - Method in class org.apache.any23.extractor.html.DocumentReport
- getDocument() - Method in class org.apache.any23.extractor.html.HTMLDocument
- getDocumentBase(URL, Document) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- getDocumentIRI() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
- getDocumentIRI() - Method in class org.apache.any23.source.ByteArrayDocumentSource
- getDocumentIRI() - Method in class org.apache.any23.source.FileDocumentSource
- getDocumentIRI() - Method in class org.apache.any23.source.HTTPDocumentSource
- getDocumentIRI() - Method in class org.apache.any23.source.StringDocumentSource
- getDocumentIRI() - Method in class org.apache.any23.validator.DefaultDOMDocument
- getDocumentIRI() - Method in interface org.apache.any23.validator.DOMDocument
- getDOM() - Method in class org.apache.any23.extractor.html.TagSoupParser
-
Returns the DOM of the given document IRI.
- getEncoding() - Method in class org.apache.any23.ExtractionReport
- getEndColumnNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
- getEndLineNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
- getErrorLocationBeginCol() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
- getErrorLocationBeginRow() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
- getErrorLocationEndCol() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
- getErrorLocationEndRow() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
- getErrorMode() - Method in class org.apache.any23.extractor.microdata.MicrodataParser
- getErrorPath() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
- getErrors() - Method in class org.apache.any23.extractor.microdata.MicrodataParser
- getErrors() - Method in class org.apache.any23.extractor.microdata.MicrodataParserReport
- getErrors() - Method in class org.apache.any23.validator.DefaultValidationReport
- getErrors() - Method in class org.apache.any23.validator.EmptyValidationReport
- getErrors() - Method in interface org.apache.any23.validator.ValidationReport
-
Returns the list of detected errors.
- getExampleInput() - Method in class org.apache.any23.extractor.ExampleInputOutput
- getExampleInput() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
- getExampleIRI() - Method in class org.apache.any23.extractor.ExampleInputOutput
- getExampleOutput() - Method in class org.apache.any23.extractor.ExampleInputOutput
- getExtractionContext() - Method in class org.apache.any23.extractor.ExtractionResultImpl
- getExtractionContext() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
- getExtractor() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
- getExtractor() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
- getExtractorGroup() - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
- getExtractorGroup(List<String>) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
Returns an
ExtractorGroup
containing theExtractorFactory
mathing the names provided as input. - getExtractorIssues(String) - Method in class org.apache.any23.ExtractionReport
- getExtractorLabel() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
- getExtractorName() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
- getExtractorNames() - Method in class org.apache.any23.writer.ReportingTripleHandler
- getExtractorToIssues() - Method in class org.apache.any23.extractor.SingleDocumentExtractionReport
- getFactory(String) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
Retrieves a
ExtractorFactory
given its name - getField(String) - Method in class org.apache.any23.extractor.html.HCardName
- getFields(String) - Method in class org.apache.any23.extractor.html.HCardName
- getFixes(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
- getFixes(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
-
Returns all fixes registered for the give rule.
- getFormatByExtension(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Returns a parser type from the given extension.
- getFormats() - Static method in class org.apache.any23.rdf.RDFUtils
-
Returns all the available
RDFFormat
s. - getFullName() - Method in class org.apache.any23.extractor.html.HCardName
- getFunctionalSyntaxParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured FunctionalSyntaxParser.
- getGraph() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
- getHRName() - Method in interface org.apache.any23.validator.Fix
- getHRName() - Method in class org.apache.any23.validator.rule.AboutNotURIRule
- getHRName() - Method in interface org.apache.any23.validator.Rule
- getHRName() - Method in class org.apache.any23.validator.rule.MetaNameMisuseFix
- getHRName() - Method in class org.apache.any23.validator.rule.MetaNameMisuseRule
- getHRName() - Method in class org.apache.any23.validator.rule.MissingItemscopeAttributeValueFix
- getHRName() - Method in class org.apache.any23.validator.rule.MissingItemscopeAttributeValueRule
- getHRName() - Method in class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
- getHRName() - Method in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
- getHTMLDocument() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
- getHTTPClient() - Method in class org.apache.any23.Any23
-
Returns the current
HTTPClient
implementation. - getHTTPUserAgent() - Method in class org.apache.any23.Any23
-
Returns the HTTP Header User Agent, see RFC 2616-14.43.
- getId() - Method in class org.apache.any23.extractor.microdata.ItemScope
- getIdentifier() - Method in class org.apache.any23.writer.JSONLDWriterFactory
- getIdentifier() - Method in class org.apache.any23.writer.JSONWriterFactory
-
Deprecated.
- getIdentifier() - Method in class org.apache.any23.writer.NQuadsWriterFactory
- getIdentifier() - Method in class org.apache.any23.writer.NTriplesWriterFactory
- getIdentifier() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
- getIdentifier() - Method in class org.apache.any23.writer.TriXWriterFactory
- getIdentifier() - Method in class org.apache.any23.writer.TurtleWriterFactory
- getIdentifier() - Method in class org.apache.any23.writer.URIListWriterFactory
- getIndexInParent(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Given a node this method returns the index corresponding to such node within the list of the children of its parent node.
- getInstance() - Static method in class org.apache.any23.extractor.ExtractorRegistryImpl
- getInstance() - Static method in class org.apache.any23.extractor.rdf.RDFParserFactory
- getInstance() - Static method in class org.apache.any23.extractor.yaml.ElementsProcessor
- getInstance() - Static method in class org.apache.any23.validator.EmptyValidationReport
- getInternalValue() - Method in class org.apache.any23.extractor.xpath.Term
- getIssueReport() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- getIssues() - Method in class org.apache.any23.extractor.ExtractionResultImpl
- getIssues() - Method in class org.apache.any23.validator.DefaultValidationReport
- getIssues() - Method in class org.apache.any23.validator.EmptyValidationReport
- getIssues() - Method in interface org.apache.any23.validator.ValidationReport
-
Returns the list of detected issues.
- getIssuesCount() - Method in class org.apache.any23.extractor.ExtractionResultImpl
- getItemId() - Method in class org.apache.any23.extractor.microdata.ItemScope
- getItemPropNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns all the itemProps detected within the given root node.
- getItemProps(Node, boolean) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns all the itemprops for the given itemscope node.
- getItemScope(Node) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns the
ItemScope
instance described within the specifiednode
. - getItemScopeNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns all the itemScopes detected within the given root node.
- getJSONLDParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured SesameJSONLDParser.
- getKey(ExtractorFactory) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
- getLevel() - Method in class org.apache.any23.validator.ValidationReport.Issue
- getManchesterSyntaxParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured ManchesterSyntaxParser.
- getMapping(String) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.Returns a IRI mapping for a given prefix.
- getMatchingExtractors() - Method in class org.apache.any23.ExtractionReport
- getMatchingExtractors() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
- getMaxConnections() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
- getMaxConnections() - Method in interface org.apache.any23.http.HTTPClientConfiguration
- getMessage() - Method in class org.apache.any23.validator.ValidationReport.Error
- getMessage() - Method in class org.apache.any23.validator.ValidationReport.Issue
- getMicrodata(Document) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns all the Microdata items detected within the given
document
, works in full report mode. - getMicrodata(Document, MicrodataParser.ErrorMode) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns all the Microdata items detected within the given
document
. - getMicrodataAsJSON(Document, PrintStream) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns a JSON containing the list of all extracted Microdata, as described at Microdata JSON Specification.
- getModel() - Method in class org.apache.any23.extractor.yaml.ElementsProcessor.ModelHolder
- getName() - Method in class org.apache.any23.extractor.microdata.ItemProp
- getName() - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
- getName() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
- getName() - Method in class org.apache.any23.extractor.xpath.Variable
- getName() - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
- getNode(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
- getNode(String) - Method in interface org.apache.any23.validator.DOMDocument
-
Returns the node addressed by the given XPath, if more then one an exception will be raised.
- getNodeLocation(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Returns the row/col location of the given node.
- getNodes(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
- getNodes(String) - Method in interface org.apache.any23.validator.DOMDocument
-
Returns the list of nodes addressed by the given XPath.
- getNodesWithAttribute(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
- getNodesWithAttribute(String) - Method in interface org.apache.any23.validator.DOMDocument
-
Returns all the nodes declaring an attribute with the specified name.
- getNQuadsParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured NQuadsParser.
- getNTriplesParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured NTriplesParser.
- getObject() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
- getObject() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
- getOrganization() - Method in class org.apache.any23.extractor.html.HCardName
- getOrganizationUnit() - Method in class org.apache.any23.extractor.html.HCardName
- getOrigin() - Method in class org.apache.any23.validator.ValidationReport.FixError
- getOrigin() - Method in class org.apache.any23.validator.ValidationReport.Issue
- getOrigin() - Method in class org.apache.any23.validator.ValidationReport.RuleError
- getOriginalDocument() - Method in class org.apache.any23.validator.DefaultDOMDocument
- getOriginalDocument() - Method in interface org.apache.any23.validator.DOMDocument
-
Returns the original document.
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractor
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.JSONLDExtractor
-
Deprecated.since 2.4. This extractor no longer wraps an RDF4J
RDFParser
. Do not use this method. - getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractor
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.NQuadsExtractor
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.NTriplesExtractor
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFXMLExtractor
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.TriXExtractor
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.TurtleExtractor
- getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
Deprecated.since 2.4. This extractor no longer wraps an RDF4J
RDFParser
. Do not use this method. - getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
Deprecated.since 2.4. This extractor no longer wraps an RDF4J
RDFParser
. Do not use this method. - getParser(RDFFormat) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a new
RDFParser
instance. - getParserEncoding() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
- getPath() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
- getPath() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
- getPathToLocalRoot() - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns the sequence of ancestors from the document root to the local root (document).
- getPluralTextField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns a plural text field.
- getPluralUrlField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns the list of URLs associated to the fields marked with class className.
- getPredicate() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
- getPrefixes() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
- getProperties() - Method in class org.apache.any23.extractor.microdata.ItemScope
- getProperty() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
- getPropertyPaths() - Method in class org.apache.any23.extractor.ExtractionResultImpl
- getPropertyPaths() - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
-
Returns all the collected property paths.
- getPropertyValue(Node) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Reads the value of a itemprop node.
- getRDFa10Parser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured RDFaParser, set to RDFa-1.0 compatibility mode.
- getRDFa11Parser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured RDFaParser, set to RDFa-1.1 compatibility mode.
- getRDFXMLParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured RDFXMLParser.
- getReason() - Method in exception org.apache.any23.extractor.yaml.YAMLProcessorException
- getRefs() - Method in class org.apache.any23.extractor.microdata.ItemScope
- getReport() - Method in class org.apache.any23.extractor.html.DocumentReport
- getReport() - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
- getReport() - Method in interface org.apache.any23.validator.ValidationReportBuilder
- getResourceRoots() - Method in class org.apache.any23.extractor.ExtractionResultImpl
- getResourceRoots() - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
-
Returns all the collected property roots.
- getRoot() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
- getRoot() - Method in class org.apache.any23.extractor.yaml.ElementsProcessor.ModelHolder
- getRuleActivations() - Method in class org.apache.any23.validator.DefaultValidationReport
- getRuleActivations() - Method in class org.apache.any23.validator.EmptyValidationReport
- getRuleActivations() - Method in interface org.apache.any23.validator.ValidationReport
-
Returns the list of activated rules.
- getRuleStr() - Method in class org.apache.any23.validator.ValidationReport.RuleActivation
- getSingularTextField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns a singular text field.
- getSingularUrlField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns the URL associated to the field marked with class className.
- getSoTimeout() - Method in class org.apache.any23.http.DefaultHTTPClient
- getSubject() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
- getSubject() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
- getSupportedMIMETypes() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
- getSupportedSettings() - Method in class org.apache.any23.writer.JSONLDWriterFactory
- getSupportedSettings() - Method in class org.apache.any23.writer.JSONWriterFactory
-
Deprecated.
- getSupportedSettings() - Method in class org.apache.any23.writer.NQuadsWriterFactory
- getSupportedSettings() - Method in class org.apache.any23.writer.NTriplesWriterFactory
- getSupportedSettings() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
- getSupportedSettings() - Method in class org.apache.any23.writer.TriXWriterFactory
- getSupportedSettings() - Method in class org.apache.any23.writer.TurtleWriterFactory
- getSupportedSettings() - Method in class org.apache.any23.writer.URIListWriterFactory
- getText() - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Returns the text contained inside a node if leaf,
null
otherwise. - getTopLevelItemScopeNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Returns only the itemScopes that are top level items.
- getTotalDocuments() - Method in class org.apache.any23.writer.ReportingTripleHandler
- getTotalTriples() - Method in class org.apache.any23.writer.ReportingTripleHandler
- getTripleFormat() - Method in class org.apache.any23.writer.JSONLDWriterFactory
- getTripleFormat() - Method in class org.apache.any23.writer.JSONWriterFactory
-
Deprecated.
- getTripleFormat() - Method in class org.apache.any23.writer.NQuadsWriterFactory
- getTripleFormat() - Method in class org.apache.any23.writer.NTriplesWriterFactory
- getTripleFormat() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
- getTripleFormat() - Method in class org.apache.any23.writer.TriXWriterFactory
- getTripleFormat() - Method in class org.apache.any23.writer.TurtleWriterFactory
- getTripleFormat() - Method in class org.apache.any23.writer.URIListWriterFactory
- getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.JSONLDWriterFactory
- getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.JSONWriterFactory
-
Deprecated.
- getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.NQuadsWriterFactory
- getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.NTriplesWriterFactory
- getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.RDFXMLWriterFactory
- getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.TriXWriterFactory
- getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.TurtleWriterFactory
- getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.URIListWriterFactory
- getTriXParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured TriXParser.
- getTurtleParserInstance(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
-
Returns a new instance of a configured TurtleParser.
- getType() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- getType() - Method in class org.apache.any23.extractor.microdata.ItemScope
- getUriRegex() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
- getUserAgent() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
- getUserAgent() - Method in interface org.apache.any23.http.HTTPClientConfiguration
- getValidatedDOM(boolean) - Method in class org.apache.any23.extractor.html.TagSoupParser
-
Returns the validated DOM and applies fixes on it if applyFix is set to
true
. - getValidationReport() - Method in class org.apache.any23.ExtractionReport
- getValidationReport() - Method in class org.apache.any23.extractor.SingleDocumentExtractionReport
- getValue() - Method in class org.apache.any23.extractor.microdata.ItemProp
- getValue(Map<String, String>) - Method in class org.apache.any23.extractor.xpath.Term
-
Returns the value represented by this
Term
given thevarMapping
, the #isVar and #internalValue parameters. - getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateGraph
- getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateObject
- getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplatePredicate
- getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateSubject
- getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.Term
- getWriter(RDFFormat, OutputStream) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a new
RDFWriter
instance. - getWriter(RDFFormat, Writer) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a new
RDFWriter
instance. - getxPath() - Method in class org.apache.any23.extractor.xpath.Variable
- getXpath() - Method in class org.apache.any23.extractor.microdata.Item
- getXPathForNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Does a reverse walking of the DOM tree to generate a unique XPath expression leading to this node.
- getXPathListForNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Returns a list of tag names representing the path from the document root to the given node n.
- getXSDDate(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
This method allows to obtain an XML Schema compliant date providing a textual representation of a date and textual a pattern for parsing it.
- GIVEN_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
H
- HAdrExtractor - Class in org.apache.any23.extractor.html.microformats2
-
Extractor for the h-adr microformat.
- HAdrExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HAdrExtractor
- HAdrExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
- HAdrExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HAdrExtractorFactory
- handleComment(String) - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
- handleNamespace(String, String) - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
- handleStatement(Statement) - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
- hasAnyField() - Method in class org.apache.any23.extractor.html.HCardName
- hasAttribute(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Checks the presence of an attribute in the given
node
. - hasAttribute(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Checks the presence of an attribute value in attributes that contain whitespace-separated lists of values.
- hasClassName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Tells if an element has a class name not checking the parents in the hierarchy mimicking the CSS .foo match.
- hasField(String) - Method in class org.apache.any23.extractor.html.HCardName
- hashCode() - Method in class org.apache.any23.extractor.microdata.Item
- hashCode() - Method in class org.apache.any23.extractor.microdata.ItemProp
- hashCode() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- hashCode() - Method in class org.apache.any23.extractor.microdata.ItemScope
- hasIssues() - Method in class org.apache.any23.extractor.ExtractionResultImpl
- hasMatchingExtractors() - Method in class org.apache.any23.ExtractionReport
- hasMatchingExtractors() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Check whether the given
DocumentSource
content activates of not at least an extractor. - HCalendarExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the hCalendar microformat.
- HCalendarExtractor() - Constructor for class org.apache.any23.extractor.html.HCalendarExtractor
- HCalendarExtractorFactory - Class in org.apache.any23.extractor.html
- HCalendarExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HCalendarExtractorFactory
- HCardExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the hCard microformat.
- HCardExtractor - Class in org.apache.any23.extractor.html.microformats2
-
Extractor for the h-Card microformat.
- HCardExtractor() - Constructor for class org.apache.any23.extractor.html.HCardExtractor
- HCardExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HCardExtractor
- HCardExtractorFactory - Class in org.apache.any23.extractor.html
- HCardExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
- HCardExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HCardExtractorFactory
- HCardExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HCardExtractorFactory
- HCardName - Class in org.apache.any23.extractor.html
-
An HCard name, consisting of various parts.
- HCardName() - Constructor for class org.apache.any23.extractor.html.HCardName
- HEAD_TAG - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- HeadLinkExtractor - Class in org.apache.any23.extractor.html
-
This
Extractor.TagSoupDOMExtractor
implementation retrieves theLINK
s declared within theHTML/HEAD
page header. - HeadLinkExtractor() - Constructor for class org.apache.any23.extractor.html.HeadLinkExtractor
- HeadLinkExtractorFactory - Class in org.apache.any23.extractor.html
- HeadLinkExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HeadLinkExtractorFactory
- HEntryExtractor - Class in org.apache.any23.extractor.html.microformats2
-
Extractor for the h-entry microformat.
- HEntryExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HEntryExtractor
- HEntryExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
-
Extractor for the h-entry microformat.
- HEntryExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HEntryExtractorFactory
- HEventExtractor - Class in org.apache.any23.extractor.html.microformats2
-
Extractor for the h-event microformat.
- HEventExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HEventExtractor
- HEventExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
- HEventExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HEventExtractorFactory
- HGeoExtractor - Class in org.apache.any23.extractor.html.microformats2
-
Extractor for the h-geo microformat.
- HGeoExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HGeoExtractor
- HGeoExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
- HGeoExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HGeoExtractorFactory
- HItemExtractor - Class in org.apache.any23.extractor.html.microformats2
-
Extractor for the h-item microformat.
- HItemExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HItemExtractor
- HItemExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
- HItemExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HItemExtractorFactory
- HListingExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the hListing microformat.
- HListingExtractor() - Constructor for class org.apache.any23.extractor.html.HListingExtractor
- HListingExtractorFactory - Class in org.apache.any23.extractor.html
- HListingExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HListingExtractorFactory
- HONORIFIC_PREFIX - Static variable in class org.apache.any23.extractor.html.HCardName
- HONORIFIC_SUFFIX - Static variable in class org.apache.any23.extractor.html.HCardName
- HProductExtractor - Class in org.apache.any23.extractor.html.microformats2
-
Extractor for the h-product microformat.
- HProductExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HProductExtractor
- HProductExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
- HProductExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HProductExtractorFactory
- HRecipeExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the hRecipe microformat.
- HRecipeExtractor - Class in org.apache.any23.extractor.html.microformats2
-
Extractor for the hRecipe microformat.
- HRecipeExtractor() - Constructor for class org.apache.any23.extractor.html.HRecipeExtractor
- HRecipeExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HRecipeExtractor
- HRecipeExtractorFactory - Class in org.apache.any23.extractor.html
- HRecipeExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
- HRecipeExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HRecipeExtractorFactory
- HRecipeExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HRecipeExtractorFactory
- HREF_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- HREF_TAGS - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
-
List of tags providing the
href
property. - HResumeExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the hResume microformat.
- HResumeExtractor - Class in org.apache.any23.extractor.html.microformats2
-
Extractor for the hResume microformat.
- HResumeExtractor() - Constructor for class org.apache.any23.extractor.html.HResumeExtractor
- HResumeExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HResumeExtractor
- HResumeExtractorFactory - Class in org.apache.any23.extractor.html
- HResumeExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
- HResumeExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HResumeExtractorFactory
- HResumeExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HResumeExtractorFactory
- HReviewAggregateExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the hReview-aggregate microformat.
- HReviewAggregateExtractor() - Constructor for class org.apache.any23.extractor.html.HReviewAggregateExtractor
- HReviewAggregateExtractorFactory - Class in org.apache.any23.extractor.html
- HReviewAggregateExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HReviewAggregateExtractorFactory
- HReviewExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the hReview microformat.
- HReviewExtractor() - Constructor for class org.apache.any23.extractor.html.HReviewExtractor
- HReviewExtractorFactory - Class in org.apache.any23.extractor.html
- HReviewExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HReviewExtractorFactory
- HTMLDocument - Class in org.apache.any23.extractor.html
-
A wrapper around the DOM representation of an HTML document.
- HTMLDocument(Node) - Constructor for class org.apache.any23.extractor.html.HTMLDocument
-
Constructor accepting the root node.
- HTMLDocument.TextField - Class in org.apache.any23.extractor.html
-
This class represents a text extracted from the HTML DOM related to the node from which such test has been retrieved.
- HTMLMetaExtractor - Class in org.apache.any23.extractor.html
-
This extractor represents the HTML META tag values according the HTML4 specification.
- HTMLMetaExtractor() - Constructor for class org.apache.any23.extractor.html.HTMLMetaExtractor
- HTMLMetaExtractorFactory - Class in org.apache.any23.extractor.html
- HTMLMetaExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HTMLMetaExtractorFactory
- HTTPClient - Interface in org.apache.any23.http
-
Abstraction for opening an
InputStream
on an HTTP IRI. - HTTPClientConfiguration - Interface in org.apache.any23.http
-
Defines the
HTTPClient
configuration. - HTTPDocumentSource - Class in org.apache.any23.source
-
Http implementation of
DocumentSource
. - HTTPDocumentSource(HTTPClient, String) - Constructor for class org.apache.any23.source.HTTPDocumentSource
I
- ICalExtractor - Class in org.apache.any23.extractor.calendar
- ICalExtractor() - Constructor for class org.apache.any23.extractor.calendar.ICalExtractor
- ICalExtractorFactory - Class in org.apache.any23.extractor.calendar
- ICalExtractorFactory() - Constructor for class org.apache.any23.extractor.calendar.ICalExtractorFactory
- ICBMExtractor - Class in org.apache.any23.extractor.html
-
Extractor for "ICBM coordinates" provided as META headers in the head of an HTML page.
- ICBMExtractor() - Constructor for class org.apache.any23.extractor.html.ICBMExtractor
- ICBMExtractorFactory - Class in org.apache.any23.extractor.html
- ICBMExtractorFactory() - Constructor for class org.apache.any23.extractor.html.ICBMExtractorFactory
- IDENTIFIER - Static variable in class org.apache.any23.writer.JSONLDWriterFactory
- IDENTIFIER - Static variable in class org.apache.any23.writer.JSONWriterFactory
-
Deprecated.
- IDENTIFIER - Static variable in class org.apache.any23.writer.NQuadsWriterFactory
- IDENTIFIER - Static variable in class org.apache.any23.writer.NTriplesWriterFactory
- IDENTIFIER - Static variable in class org.apache.any23.writer.RDFXMLWriterFactory
- IDENTIFIER - Static variable in class org.apache.any23.writer.TriXWriterFactory
- IDENTIFIER - Static variable in class org.apache.any23.writer.TurtleWriterFactory
- IDENTIFIER - Static variable in class org.apache.any23.writer.URIListWriterFactory
- IgnoreAccidentalRDFa - Class in org.apache.any23.filter
-
A
TripleHandler
that suppresses output of the RDFa parser if the document only contains "accidental" RDFa, like stylesheet links and other non-RDFa uses of HTML's - IgnoreAccidentalRDFa(TripleHandler) - Constructor for class org.apache.any23.filter.IgnoreAccidentalRDFa
- IgnoreAccidentalRDFa(TripleHandler, boolean) - Constructor for class org.apache.any23.filter.IgnoreAccidentalRDFa
-
Constructor.
- IgnoreTitlesOfEmptyDocuments - Class in org.apache.any23.filter
-
A
TripleHandler
that suppresses output of theTitleExtractor
unless some other triples could be parsed from the document. - IgnoreTitlesOfEmptyDocuments(TripleHandler) - Constructor for class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
- implementJavaNaming(String) - Static method in class org.apache.any23.util.StringUtils
-
Changes string with following convention: Changes '-' -> '_' remove space characters and make first letter word uppercase: 'some string' -> 'someString' If input string does not contains a whitespace than return unchanged.
- includes(Class<? extends MicroformatExtractor>, Class<? extends MicroformatExtractor>) - Static method in class org.apache.any23.extractor.html.MicroformatExtractor
-
This method checks if there is a native nesting relationship between two
MicroformatExtractor
. - Includes - Annotation Type in org.apache.any23.extractor.html.annotations
-
This annotation could be used to decorate a
MicroformatExtractor
to represent which of the other Microformats could it nest. - Includes - Annotation Type in org.apache.any23.extractor.html.microformats2.annotations
-
This annotation could be used to decorate a
MicroformatExtractor
to represent which of the other Microformats could it nest. - INFO - org.apache.any23.validator.ValidationReport.IssueLevel
- init(HTTPClientConfiguration) - Method in class org.apache.any23.http.DefaultHTTPClient
- init(HTTPClientConfiguration) - Method in interface org.apache.any23.http.HTTPClient
-
Initializes the HTTP client.
- inputStreamToDocument(InputStream) - Static method in class org.apache.any23.util.StreamUtils
- iri(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
IRI
. - iri(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
IRI
. - IRI_PATH_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- IRI_PREFIX_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- IRI_SCHEMA_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- isAbsoluteIRI(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- isAbsoluteIRI(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Checks if
href
is absolute or not. - isAncestorOf(Node, Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Checks whether a node is ancestor or same of another node.
- isAncestorOf(Node, Node, boolean) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Checks whether a node is ancestor or same of another node.
- isAnnotated() - Method in class org.apache.any23.writer.JSONWriter
-
Deprecated.
- isAnnotated() - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
If
true
then the produced RDF is annotated with the extractors used to generate the specific statements. - isAnnotated() - Method in class org.apache.any23.writer.URIListWriter
- isCURIE(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- isCURIEBNode(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- isDate() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- isDocBlocked() - Method in class org.apache.any23.filter.ExtractionContextBlocker
- isElementNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Verifies if the given target node is an element.
- isFloat() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- isInteger() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- isItemProp(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Check whether a node is an itemProp.
- isItemScope(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
-
Check whether a node is an itemScope.
- isLink() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- isLocal() - Method in class org.apache.any23.source.ByteArrayDocumentSource
- isLocal() - Method in class org.apache.any23.source.FileDocumentSource
- isLocal() - Method in class org.apache.any23.source.HTTPDocumentSource
- isLocal() - Method in class org.apache.any23.source.StringDocumentSource
- isMultiField(String) - Method in class org.apache.any23.extractor.html.HCardName
- isNested() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- isNumber() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- isOnline(String) - Static method in class org.apache.any23.util.URLUtils
-
Verifies if the specified URL is reachable online.
- isPlain() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- isPrefix(String, String) - Static method in class org.apache.any23.util.StringUtils
-
Check whether string
candidatePrefix
is prefix of stringcontainer
. - isRegisteredName(String) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
- isRelativeNode(Node) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- isStopAtFirstError() - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
- Issue(ValidationReport.IssueLevel, String, Node) - Constructor for class org.apache.any23.validator.ValidationReport.Issue
- isSuffix(String, String) - Static method in class org.apache.any23.util.StringUtils
-
Check whether string
candidateSuffix
is suffix of stringcontainer
. - isVar() - Method in class org.apache.any23.extractor.xpath.Term
- isVerifyDataType() - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
- isXMLNSDeclared(Document) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- Item - Class in org.apache.any23.extractor.microdata
-
Base class for Microdata items.
- Item(String) - Constructor for class org.apache.any23.extractor.microdata.Item
-
Constructor.
- ItemProp - Class in org.apache.any23.extractor.microdata
-
Describes a Microdata item property.
- ItemProp(String, String, ItemPropValue) - Constructor for class org.apache.any23.extractor.microdata.ItemProp
-
Constructor.
- ITEMPROP_ATTRIBUTE - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
- ItemPropValue - Class in org.apache.any23.extractor.microdata
-
Describes a possible value for a Microdata item property.
- ItemPropValue(Object, ItemPropValue.Type) - Constructor for class org.apache.any23.extractor.microdata.ItemPropValue
-
Constructor.
- ItemPropValue.Type - Enum in org.apache.any23.extractor.microdata
-
Supported types.
- ItemScope - Class in org.apache.any23.extractor.microdata
-
This class describes a Microdata itemscope.
- ItemScope(String, ItemProp[], String, String[], String, String) - Constructor for class org.apache.any23.extractor.microdata.ItemScope
-
Constructor.
- ITEMSCOPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
J
- JCalExtractor - Class in org.apache.any23.extractor.calendar
- JCalExtractor() - Constructor for class org.apache.any23.extractor.calendar.JCalExtractor
- JCalExtractorFactory - Class in org.apache.any23.extractor.calendar
- JCalExtractorFactory() - Constructor for class org.apache.any23.extractor.calendar.JCalExtractorFactory
- join(String, String...) - Static method in class org.apache.any23.util.StringUtils
-
Joins the given input sting
data
list using the specifieddelimiter
. - JSONLDExtractor - Class in org.apache.any23.extractor.rdf
-
Concrete implementation of
Extractor.ContentExtractor
handling JSON-LD format. - JSONLDExtractor() - Constructor for class org.apache.any23.extractor.rdf.JSONLDExtractor
- JSONLDExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.JSONLDExtractor
-
Deprecated.since 2.4. This extractor has never supported these settings. Use
JSONLDExtractor()
instead. - JSONLDExtractorFactory - Class in org.apache.any23.extractor.rdf
- JSONLDExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.JSONLDExtractorFactory
- JSONLDWriter - Class in org.apache.any23.writer
-
Implementation of JSON-LD
TripleWriter
. - JSONLDWriter(OutputStream) - Constructor for class org.apache.any23.writer.JSONLDWriter
- JSONLDWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.JSONLDWriter
- JSONLDWriterFactory - Class in org.apache.any23.writer
- JSONLDWriterFactory() - Constructor for class org.apache.any23.writer.JSONLDWriterFactory
- JSONWriter - Class in org.apache.any23.writer
-
Deprecated.since 2.3. Use
JSONLDWriter
instead. - JSONWriter(OutputStream) - Constructor for class org.apache.any23.writer.JSONWriter
-
Deprecated.
- JSONWriterFactory - Class in org.apache.any23.writer
-
Deprecated.since 2.3. Use
JSONLDWriterFactory
instead. - JSONWriterFactory() - Constructor for class org.apache.any23.writer.JSONWriterFactory
-
Deprecated.
- JsoupUtils - Class in org.apache.any23.extractor.html
- JsoupUtils() - Constructor for class org.apache.any23.extractor.html.JsoupUtils
L
- LicenseExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the rel-license microformat.
- LicenseExtractor() - Constructor for class org.apache.any23.extractor.html.LicenseExtractor
- LicenseExtractorFactory - Class in org.apache.any23.extractor.html
- LicenseExtractorFactory() - Constructor for class org.apache.any23.extractor.html.LicenseExtractorFactory
- Link - org.apache.any23.extractor.microdata.ItemPropValue.Type
- listFilesRecursively(File, FilenameFilter) - Static method in class org.apache.any23.util.FileUtils
-
Lists the content of a dir applying the specified filter.
- literal(boolean) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Literal
. - literal(byte) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Literal
. - literal(double) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Literal
. - literal(float) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Literal
. - literal(int) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Literal
. - literal(long) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Literal
. - literal(short) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Literal
. - literal(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Literal
. - literal(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Literal
. - literal(String, IRI) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Literal
. - LITERAL - org.apache.any23.extractor.xpath.TemplateObject.Type
- LocalCopyFactory - Interface in org.apache.any23.source
-
A factory that creates local copies of
DocumentSource
s. - logger - Static variable in class org.apache.any23.Any23
- LoggingTripleHandler - Class in org.apache.any23.writer
-
Triple handler decorator useful for logging purposes.
- LoggingTripleHandler(TripleHandler, PrintWriter) - Constructor for class org.apache.any23.writer.LoggingTripleHandler
- LogUtils - Class in org.apache.any23.util
-
Deprecated.since 2.3. JUL is no longer used in this project.
- LogUtils() - Constructor for class org.apache.any23.util.LogUtils
-
Deprecated.
M
- makeIRI() - Static method in class org.apache.any23.rdf.RDFUtils
- makeIRI(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Convert string to either IRI or Literal.
- makeIRI(String, IRI) - Static method in class org.apache.any23.rdf.RDFUtils
- makeIRI(String, IRI, boolean) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates implementation of
Resource
from given arguments: type and docIRI. - makeIRI(IRI) - Static method in class org.apache.any23.rdf.RDFUtils
- ManchesterSyntaxExtractor - Class in org.apache.any23.extractor.rdf
-
Concrete implementation of
Extractor.ContentExtractor
handling Manchester Syntax format. - ManchesterSyntaxExtractor() - Constructor for class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractor
- ManchesterSyntaxExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractor
- ManchesterSyntaxExtractorFactory - Class in org.apache.any23.extractor.rdf
- ManchesterSyntaxExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractorFactory
- mark(int) - Method in class org.apache.any23.util.ReaderInputStream
-
Marks the read limit of the StringReader.
- markSupported() - Method in class org.apache.any23.util.ReaderInputStream
- MathUtils - Class in org.apache.any23.util
-
Mathematical utility functions.
- md5(String) - Static method in class org.apache.any23.util.MathUtils
-
Create a MD5 weak hash for a given string.
- MemCopyFactory - Class in org.apache.any23.source
-
Creates local copies of
DocumentSource
by reading them into an in-memory buffer. - MemCopyFactory() - Constructor for class org.apache.any23.source.MemCopyFactory
- MetaNameMisuseFix - Class in org.apache.any23.validator.rule
-
Fixes the misuse of the meta name attribute.
- MetaNameMisuseFix() - Constructor for class org.apache.any23.validator.rule.MetaNameMisuseFix
- MetaNameMisuseRule - Class in org.apache.any23.validator.rule
-
Checks whether the meta attribute name is used to contain a property.
- MetaNameMisuseRule() - Constructor for class org.apache.any23.validator.rule.MetaNameMisuseRule
- MicrodataExtractor - Class in org.apache.any23.extractor.microdata
-
Default implementation of Microdata extractor, based on
Extractor.TagSoupDOMExtractor
. - MicrodataExtractor() - Constructor for class org.apache.any23.extractor.microdata.MicrodataExtractor
- MicrodataExtractorFactory - Class in org.apache.any23.extractor.microdata
- MicrodataExtractorFactory() - Constructor for class org.apache.any23.extractor.microdata.MicrodataExtractorFactory
- MicrodataParser - Class in org.apache.any23.extractor.microdata
-
This class provides utility methods for handling Microdata nodes contained within a DOM document.
- MicrodataParser(Document) - Constructor for class org.apache.any23.extractor.microdata.MicrodataParser
- MicrodataParserException - Exception in org.apache.any23.extractor.microdata
-
Defines an exception occurring while parsing Microdata.
- MicrodataParserException(String, Throwable, Node) - Constructor for exception org.apache.any23.extractor.microdata.MicrodataParserException
- MicrodataParserException(String, Node) - Constructor for exception org.apache.any23.extractor.microdata.MicrodataParserException
- MicrodataParserReport - Class in org.apache.any23.extractor.microdata
-
This class describes the report of the
MicrodataParser
. - MicrodataParserReport(ItemScope[]) - Constructor for class org.apache.any23.extractor.microdata.MicrodataParserReport
- MicrodataParserReport(ItemScope[], MicrodataParserException[]) - Constructor for class org.apache.any23.extractor.microdata.MicrodataParserReport
- MicroformatExtractor - Class in org.apache.any23.extractor.html
-
The abstract base class for any Microformat specification extractor.
- MicroformatExtractor() - Constructor for class org.apache.any23.extractor.html.MicroformatExtractor
- Microformats2Prefixes - Class in org.apache.any23.extractor.html.microformats2
- Microformats2Prefixes() - Constructor for class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
- MIME_TYPE - Static variable in class org.apache.any23.writer.JSONLDWriterFactory
- MIME_TYPE - Static variable in class org.apache.any23.writer.JSONWriterFactory
-
Deprecated.
- MIME_TYPE - Static variable in class org.apache.any23.writer.NQuadsWriterFactory
- MIME_TYPE - Static variable in class org.apache.any23.writer.NTriplesWriterFactory
- MIME_TYPE - Static variable in class org.apache.any23.writer.RDFXMLWriterFactory
- MIME_TYPE - Static variable in class org.apache.any23.writer.TriXWriterFactory
- MIME_TYPE - Static variable in class org.apache.any23.writer.TurtleWriterFactory
- MIME_TYPE - Static variable in class org.apache.any23.writer.URIListWriterFactory
- MissingItemscopeAttributeValueFix - Class in org.apache.any23.validator.rule
-
Fix for the issue described within
MissingItemscopeAttributeValueRule
- MissingItemscopeAttributeValueFix() - Constructor for class org.apache.any23.validator.rule.MissingItemscopeAttributeValueFix
-
Default constructor
- MissingItemscopeAttributeValueRule - Class in org.apache.any23.validator.rule
-
This fixes missing attribute values for the 'itemscope' attribute Typically when such a snippet of XHTML is fed through the
RDFa11Extractor
, and subsequently to Sesame's SesameRDFaParser, it will result in the following behavior. - MissingItemscopeAttributeValueRule() - Constructor for class org.apache.any23.validator.rule.MissingItemscopeAttributeValueRule
-
Default constructor
- MissingOpenGraphNamespaceRule - Class in org.apache.any23.validator.rule
-
This rule detects the issue of missing Open Graph namespace.
- MissingOpenGraphNamespaceRule() - Constructor for class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
- ModelHolder(Value, Model) - Constructor for class org.apache.any23.extractor.yaml.ElementsProcessor.ModelHolder
- multiply(char, int) - Static method in class org.apache.any23.util.StringUtils
-
Builds a string composed of the given char
c
n
times. - mv(File, File) - Static method in class org.apache.any23.util.FileUtils
-
Moves a
target
file to a newdest
location.
N
- NAME - Static variable in class org.apache.any23.extractor.csv.CSVExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.AdrExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.GeoExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.HCalendarExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.HCardExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.HeadLinkExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.HListingExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.HRecipeExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.HResumeExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.HReviewAggregateExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.HReviewExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.HTMLMetaExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.ICBMExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.LicenseExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HAdrExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HCardExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HEntryExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HEventExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HGeoExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HItemExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HProductExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HRecipeExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HResumeExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.SpeciesExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.TitleExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.TurtleHTMLExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.html.XFNExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.microdata.MicrodataExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.rdf.JSONLDExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.rdf.NQuadsExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.rdf.NTriplesExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.rdf.RDFXMLExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.rdf.TriXExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.rdf.TurtleExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.rdfa.RDFa11ExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.rdfa.RDFaExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.xpath.XPathExtractorFactory
- NAME - Static variable in class org.apache.any23.extractor.yaml.YAMLExtractorFactory
- Nested - org.apache.any23.extractor.microdata.ItemPropValue.Type
- NICKNAME - Static variable in class org.apache.any23.extractor.html.HCardName
- NODES_WITH_INVALID_ABOUT - Static variable in class org.apache.any23.validator.rule.AboutNotURIRule
- nodeToInputStream(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Convert a w3c dom node to a InputStream
- notifyIssue(IssueReport.IssueLevel, String, long, long) - Method in class org.apache.any23.extractor.ExtractionResultImpl
- NQuadsExtractor - Class in org.apache.any23.extractor.rdf
-
Concrete implementation of
Extractor.ContentExtractor
handling N-Quads format. - NQuadsExtractor() - Constructor for class org.apache.any23.extractor.rdf.NQuadsExtractor
- NQuadsExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.NQuadsExtractor
- NQuadsExtractorFactory - Class in org.apache.any23.extractor.rdf
- NQuadsExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.NQuadsExtractorFactory
- NQuadsWriter - Class in org.apache.any23.writer
-
Implementation of an N-Quads
TripleWriter
. - NQuadsWriter(OutputStream) - Constructor for class org.apache.any23.writer.NQuadsWriter
- NQuadsWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.NQuadsWriter
- NQuadsWriterFactory - Class in org.apache.any23.writer
- NQuadsWriterFactory() - Constructor for class org.apache.any23.writer.NQuadsWriterFactory
- NTriplesExtractor - Class in org.apache.any23.extractor.rdf
-
Concrete implementation of
Extractor.ContentExtractor
handling NTriples NTriples format. - NTriplesExtractor() - Constructor for class org.apache.any23.extractor.rdf.NTriplesExtractor
-
Default constructor, with no verification of data types and no stop at first error.
- NTriplesExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.NTriplesExtractor
- NTriplesExtractorFactory - Class in org.apache.any23.extractor.rdf
- NTriplesExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.NTriplesExtractorFactory
- NTriplesWriter - Class in org.apache.any23.writer
-
Implementation of an N-Triples
TripleWriter
. - NTriplesWriter(OutputStream) - Constructor for class org.apache.any23.writer.NTriplesWriter
- NTriplesWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.NTriplesWriter
- NTriplesWriterFactory - Class in org.apache.any23.writer
- NTriplesWriterFactory() - Constructor for class org.apache.any23.writer.NTriplesWriterFactory
O
- openContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
- openContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
- openContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
- openContext(ExtractionContext) - Method in class org.apache.any23.writer.TripleWriterHandler
-
Writers may override this method to handle an "openContext" extraction event.
- OPENGRAPH_PROTOCOL_NS - Static variable in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
- OpenGraphNamespaceFix - Class in org.apache.any23.validator.rule
-
This fixes the missing Open Graph protocol.
- OpenGraphNamespaceFix() - Constructor for class org.apache.any23.validator.rule.OpenGraphNamespaceFix
- openInputStream() - Method in class org.apache.any23.source.ByteArrayDocumentSource
- openInputStream() - Method in class org.apache.any23.source.FileDocumentSource
- openInputStream() - Method in class org.apache.any23.source.HTTPDocumentSource
- openInputStream() - Method in class org.apache.any23.source.StringDocumentSource
- openInputStream(String) - Method in class org.apache.any23.http.DefaultHTTPClient
-
Opens an
InputStream
from a given IRI. - openInputStream(String) - Method in interface org.apache.any23.http.HTTPClient
-
Opens the input stream for the given target IRI.
- openSubResult(ExtractionContext) - Method in class org.apache.any23.extractor.ExtractionResultImpl
- openSubResult(ExtractionContext) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
- org.apache.any23 - package org.apache.any23
-
The Any23 root package defines the
Any23
library facade and auxiliary classes. - org.apache.any23.extractor - package org.apache.any23.extractor
-
This package contains classes and interfaces modeling the
Extractor
API. - org.apache.any23.extractor.calendar - package org.apache.any23.extractor.calendar
-
This package contains extractors for various calendar serialization formats.
- org.apache.any23.extractor.csv - package org.apache.any23.extractor.csv
-
This package contains the definition of a specific
Extractor
for CSV files as specified on RFC 4180. - org.apache.any23.extractor.html - package org.apache.any23.extractor.html
-
All the various
Extractor
needed to distill RDF from Microformats in HTML pages are contained in this package. - org.apache.any23.extractor.html.annotations - package org.apache.any23.extractor.html.annotations
-
This package contains the annotations needed to describe the single nesting relations among different Microformats.
- org.apache.any23.extractor.html.microformats2 - package org.apache.any23.extractor.html.microformats2
-
All the various
Extractor
needed to distill RDF from Microformats in HTML pages are contained in this package. - org.apache.any23.extractor.html.microformats2.annotations - package org.apache.any23.extractor.html.microformats2.annotations
-
This package contains the annotations needed to describe the single nesting relations among different Microformats.
- org.apache.any23.extractor.microdata - package org.apache.any23.extractor.microdata
-
This package provides classes for dealing with Microdata extraction.
- org.apache.any23.extractor.rdf - package org.apache.any23.extractor.rdf
-
This package contains the concrete implementations (and utilities classes) to perform the extraction from various RDF serialization formats.
- org.apache.any23.extractor.rdfa - package org.apache.any23.extractor.rdfa
-
This package contains the definition of a specific
Extractor
for RDFa. - org.apache.any23.extractor.xpath - package org.apache.any23.extractor.xpath
-
This package provides an XPath extractor able to generate RDF statements from any HTML page on the basis of a set of specified rules and a list of statement templates.
- org.apache.any23.extractor.yaml - package org.apache.any23.extractor.yaml
-
This package provides an YAML extractor able to generate RDF statements from any YAML files.
- org.apache.any23.filter - package org.apache.any23.filter
-
This package defines specific
TripleHandler
s able to filter document and triples that don't respect particular conditions. - org.apache.any23.http - package org.apache.any23.http
-
This package contains classes and interfaces to implement a small multi-threaded HTTP Client used to fetch data sources.
- org.apache.any23.rdf - package org.apache.any23.rdf
-
This package provides some utility classes to manage RDF models and prefixes.
- org.apache.any23.source - package org.apache.any23.source
-
Package containing some auxiliary classes to handle different types of
InputStream
sources. - org.apache.any23.util - package org.apache.any23.util
-
This package collects a set of internal library utility classes.
- org.apache.any23.validator - package org.apache.any23.validator
- org.apache.any23.validator.rule - package org.apache.any23.validator.rule
- org.apache.any23.vocab - package org.apache.any23.vocab
- org.apache.any23.writer - package org.apache.any23.writer
-
This package collects a set of
TripleHandler
decorators and specific RDF formatTripleWriter
implementations.
P
- parse(InputStream, String, String) - Static method in class org.apache.any23.extractor.html.JsoupUtils
- parseDateTime(String) - Static method in class org.apache.any23.extractor.microdata.ItemPropValue
- parseRDF(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Parses the content of the
resource
file guessing the content format from the extension. - parseRDF(RDFFormat, InputStream) - Static method in class org.apache.any23.rdf.RDFUtils
-
Parses the content of
is
input stream with the specified parserp
using''
as base org.eclipse.rdf4j.model.IRI. - parseRDF(RDFFormat, InputStream, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Parses the content of
is
input stream with the specified parserp
usingbaseIRI
. - parseRDF(RDFFormat, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Parses the content of
in
string with the specified parserp
using''
as base org.eclipse.rdf4j.model.IRI. - Plain - org.apache.any23.extractor.microdata.ItemPropValue.Type
- PopularPrefixes - Class in org.apache.any23.rdf
-
This class act as a container for various well-known and adopted RDF Vocabulary prefixes.
- PopularPrefixes() - Constructor for class org.apache.any23.rdf.PopularPrefixes
- PREFIX_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- prefixes - Static variable in class org.apache.any23.extractor.yaml.YAMLExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.csv.CSVExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.AdrExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.GeoExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.HCalendarExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.HCardExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.HeadLinkExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.HListingExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.HRecipeExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.HResumeExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.HReviewAggregateExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.HReviewExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.HTMLMetaExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.ICBMExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.LicenseExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HAdrExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HCardExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HEntryExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HEventExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HGeoExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HItemExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HProductExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HRecipeExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HResumeExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.SpeciesExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.TitleExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.TurtleHTMLExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.html.XFNExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.microdata.MicrodataExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.rdf.JSONLDExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.rdf.NQuadsExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.rdf.NTriplesExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.rdf.RDFXMLExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.rdf.TriXExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.rdf.TurtleExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.rdfa.RDFa11ExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.rdfa.RDFaExtractorFactory
- PREFIXES - Static variable in class org.apache.any23.extractor.xpath.XPathExtractorFactory
- PRETTY_PRINT - Static variable in class org.apache.any23.writer.WriterSettings
-
Directive to writer that output should be printed in a way to maximize human readability.
- PRINT_ASCII - Static variable in class org.apache.any23.writer.WriterSettings
-
Directive to writer that at least the non-ASCII characters should be escaped.
- printOut(ExtractionResult, Map<String, String>) - Method in class org.apache.any23.extractor.xpath.QuadTemplate
-
Prints out this quad template in the given
ExtractionResult
, using the passed variableAssignment to expand variables. - printReport() - Method in class org.apache.any23.writer.ReportingTripleHandler
- printReport(PrintStream) - Method in class org.apache.any23.extractor.ExtractionResultImpl
- process(Document, ExtractionResult) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
- process(Document, ExtractionResult) - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
-
Processes this extraction rule on the given document.
- processDocument(URL, Document, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- processList(IRI, List<Object>) - Method in class org.apache.any23.extractor.yaml.ElementsProcessor
- processMap(IRI, Map<String, Object>, Value) - Method in class org.apache.any23.extractor.yaml.ElementsProcessor
-
This method processes a map with non bnode root.
- PROFILE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- PROPERTY_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- PROPERTY_PREFIX - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
- PropertyPath(String[], Resource, Resource, BNode, Class<? extends MicroformatExtractor>) - Constructor for class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
- putData(String, Object) - Method in class org.apache.any23.validator.DefaultRuleContext
- putData(String, T) - Method in interface org.apache.any23.validator.RuleContext
-
Puts a data within the context.
Q
- quad(String, String, String, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a statement of type:
toValue(s), toValue(p), toValue(o), toValue(g)
- quad(Resource, IRI, Value, Resource) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Statement
. - QuadTemplate - Class in org.apache.any23.extractor.xpath
-
This class models a NQuads template, that is a quadruple in which any component can be a variable.
- QuadTemplate(TemplateSubject, TemplatePredicate, TemplateObject) - Constructor for class org.apache.any23.extractor.xpath.QuadTemplate
-
Constructor for template with no graph.
- QuadTemplate(TemplateSubject, TemplatePredicate, TemplateObject, TemplateGraph) - Constructor for class org.apache.any23.extractor.xpath.QuadTemplate
-
Constructor.
R
- RDFa11Extractor - Class in org.apache.any23.extractor.rdfa
-
Extractor
implementation for RDFa 1.1 specification. - RDFa11Extractor() - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Extractor
- RDFa11Extractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
Deprecated.since 2.4. This extractor has never supported these settings. Use
RDFa11Extractor()
instead. - RDFa11ExtractorFactory - Class in org.apache.any23.extractor.rdfa
- RDFa11ExtractorFactory() - Constructor for class org.apache.any23.extractor.rdfa.RDFa11ExtractorFactory
- RDFa11Parser - Class in org.apache.any23.extractor.rdfa
-
Deprecated.since 2.3 the
Rio
implementations are used to parse RDFa. Look atRDFParserFactory.getRDFa10Parser(boolean, boolean, org.apache.any23.extractor.ExtractionContext, org.apache.any23.extractor.ExtractionResult)
andRDFParserFactory.getRDFa11Parser(boolean, boolean, org.apache.any23.extractor.ExtractionContext, org.apache.any23.extractor.ExtractionResult)
. - RDFa11Parser() - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- RDFa11ParserException - Exception in org.apache.any23.extractor.rdfa
-
Deprecated.since 2.3 the
Rio
implementations are used to parse RDFa. - RDFa11ParserException(String) - Constructor for exception org.apache.any23.extractor.rdfa.RDFa11ParserException
-
Deprecated.
- RDFa11ParserException(String, Throwable) - Constructor for exception org.apache.any23.extractor.rdfa.RDFa11ParserException
-
Deprecated.
- RDFaExtractor - Class in org.apache.any23.extractor.rdfa
-
Extractor
implementation for RDFa 1.0 specification. - RDFaExtractor() - Constructor for class org.apache.any23.extractor.rdfa.RDFaExtractor
- RDFaExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdfa.RDFaExtractor
-
Deprecated.since 2.4. This extractor has never supported these settings. Use
RDFaExtractor()
instead. - RDFaExtractorFactory - Class in org.apache.any23.extractor.rdfa
- RDFaExtractorFactory() - Constructor for class org.apache.any23.extractor.rdfa.RDFaExtractorFactory
- RDFHandlerAdapter - Class in org.apache.any23.extractor.rdf
-
An RDFHandler that relays statements and prefix definitions to an
ExtractionResult
. - RDFHandlerAdapter(ExtractionResult) - Constructor for class org.apache.any23.extractor.rdf.RDFHandlerAdapter
- RDFParserFactory - Class in org.apache.any23.extractor.rdf
-
This factory provides a common logic for creating and configuring correctly any RDF parser used within the library.
- RDFParserFactory() - Constructor for class org.apache.any23.extractor.rdf.RDFParserFactory
- RDFSchemaUtils - Class in org.apache.any23.vocab
-
This class provides a set of methods for generating RDF Schema.
- RDFUtils - Class in org.apache.any23.rdf
-
Basic class providing a set of utility methods when dealing with RDF.
- RDFWriterTripleHandler - Class in org.apache.any23.writer
-
A
TripleHandler
that writes triples to a SesameRDFWriter
, eg for serialization using one of Sesame's writers. - RDFXMLExtractor - Class in org.apache.any23.extractor.rdf
-
Concrete implementation of
Extractor.ContentExtractor
able to perform the extraction on RDF/XML documents. - RDFXMLExtractor() - Constructor for class org.apache.any23.extractor.rdf.RDFXMLExtractor
-
Default constructor, with no verification of data types and not stop at first error.
- RDFXMLExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.RDFXMLExtractor
-
Constructor, allows to specify the validation and error handling policies.
- RDFXMLExtractorFactory - Class in org.apache.any23.extractor.rdf
- RDFXMLExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.RDFXMLExtractorFactory
- RDFXMLWriter - Class in org.apache.any23.writer
-
RDF/XML
TripleWriter
implementation. - RDFXMLWriter(OutputStream) - Constructor for class org.apache.any23.writer.RDFXMLWriter
- RDFXMLWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.RDFXMLWriter
- RDFXMLWriterFactory - Class in org.apache.any23.writer
- RDFXMLWriterFactory() - Constructor for class org.apache.any23.writer.RDFXMLWriterFactory
- read() - Method in class org.apache.any23.extractor.html.SpanCloserInputStream
- read() - Method in class org.apache.any23.util.ReaderInputStream
-
Reads from the
Reader
, returning the same value. - read(byte[], int, int) - Method in class org.apache.any23.util.ReaderInputStream
-
Reads from the
Reader
into a byte array - readAttribute(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
-
Read an attribute avoiding NullPointerExceptions, if the attr is missing it just returns an empty string.
- readAttribute(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Reads the value of an
attribute
, returning the empty string if not present. - readAttribute(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Reads the value of the specified
attribute
, returning thedefaultValue
string if not present. - readAttributeWithPrefix(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Reads the value of the first attribute which name matches with the specified
attributePrefix
. - ReaderInputStream - Class in org.apache.any23.util
-
Adapts a
Reader
as anInputStream
. - ReaderInputStream(Reader) - Constructor for class org.apache.any23.util.ReaderInputStream
-
Construct a
ReaderInputStream
for the specifiedReader
. - ReaderInputStream(Reader, String) - Constructor for class org.apache.any23.util.ReaderInputStream
-
Construct a
ReaderInputStream
for the specifiedReader
, with the specified encoding. - readFileContent(File) - Static method in class org.apache.any23.util.FileUtils
-
Returns the content of a file a single string.
- readFileLines(File) - Static method in class org.apache.any23.util.FileUtils
-
Returns all the lines of a file.
- readNodeContent(Node, boolean) - Static method in class org.apache.any23.extractor.html.HTMLDocument
-
Reads the text content of the given node and returns it.
- readResourceContent(Class, String) - Static method in class org.apache.any23.util.FileUtils
-
Reads a resource file and returns the content as a string.
- readResourceContent(String) - Static method in class org.apache.any23.util.FileUtils
-
Reads a resource file and returns the content as a string.
- readStream() - Method in class org.apache.any23.source.FileDocumentSource
- readTextField(Node) - Static method in class org.apache.any23.extractor.html.HTMLDocument
-
Reads a text field from the given node adding the content to the given res list.
- readUrlField(List<HTMLDocument.TextField>, Node) - Static method in class org.apache.any23.extractor.html.HTMLDocument
-
Reads an URL field from the given node adding the content to the given res list.
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
- receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.TripleWriterHandler
-
Writers may override this method to handle a "receiveNamespace" extraction event.
- receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
- receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
- receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
- receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
- receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
- receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
- receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
- receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
- receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
- receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.TripleWriterHandler
-
Writers may override this method to handle a "receiveTriple" extraction event.
- register(ExtractorFactory<?>) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
Registers an
ExtractorFactory
. - REL_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- remove(QuadTemplate) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
-
Removes a quad template from the rule.
- remove(QuadTemplate) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
- remove(Variable) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
-
Removes a variable from the template.
- remove(Variable) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
- remove(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
- removeData(String) - Method in class org.apache.any23.validator.DefaultRuleContext
- removeData(String) - Method in interface org.apache.any23.validator.RuleContext
-
Removes a data from the context.
- removeRule(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
- removeRule(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
-
Allows to remove a rule from the validator and all the related
Fix
es. - report() - Method in class org.apache.any23.writer.BenchmarkTripleHandler
-
Returns the report as a human readable string.
- reportFixError(Fix, Exception, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
- reportFixError(Fix, Exception, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
Reports an error occurred while executing a
Fix
. - ReportingTripleHandler - Class in org.apache.any23.writer
-
A
TripleHandler
that collects various information about the extraction process, such as the extractors used and the total number of triples. - ReportingTripleHandler(TripleHandler) - Constructor for class org.apache.any23.writer.ReportingTripleHandler
- reportIssue(ValidationReport.IssueLevel, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
- reportIssue(ValidationReport.IssueLevel, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
Reports a detected issue.
- reportIssue(ValidationReport.IssueLevel, String, Node) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
- reportIssue(ValidationReport.IssueLevel, String, Node) - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
Reports an issue detected on a specified node.
- reportRuleError(Rule, Exception, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
- reportRuleError(Rule, Exception, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
Reports an error occurred while executing a
Rule
. - RepositoryWriter - Class in org.apache.any23.writer
-
An RDF4J repository triple writer.
- RepositoryWriter(RepositoryConnection) - Constructor for class org.apache.any23.writer.RepositoryWriter
- RepositoryWriter(RepositoryConnection, Resource) - Constructor for class org.apache.any23.writer.RepositoryWriter
- reset() - Method in class org.apache.any23.extractor.html.HCardName
-
Resets the content of the HName fields.
- reset() - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.Resets the parser to the original state.
- reset() - Method in class org.apache.any23.util.ReaderInputStream
-
Resets the StringReader.
- reset() - Method in class org.apache.any23.writer.CountingTripleHandler
- resetExtractor() - Method in class org.apache.any23.extractor.html.AdrExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
-
Resets the internal status of the extractor to prepare it to a new extraction section.
- resetExtractor() - Method in class org.apache.any23.extractor.html.GeoExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.HCardExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.HListingExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.HResumeExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.HReviewAggregateExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.HReviewExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HAdrExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HEntryExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HGeoExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HItemExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HProductExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HResumeExtractor
- resetExtractor() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
-
Resets the internal status of the extractor to prepare it to a new extraction section.
- resolveCIRIeOrIRIList(Node, String, boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.Resolves a whitelist separated list of CURIE or URI.
- resolveCURIEOrIRI(String, boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.Resolves a CURIE or IRI string.
- resolveIRI(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
- resolveIRI(String) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.Resolves a IRI string as IRI.
- resolveIRI(String, URI) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
-
Fixes typical errors in IRIs, and resolves relative IRIs against a base IRI.
- RESOURCE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- ResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Constructor for class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
- REV_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- Rule - Interface in org.apache.any23.validator
-
Models a rule for an issue detection.
- RuleActivation(Rule) - Constructor for class org.apache.any23.validator.ValidationReport.RuleActivation
- RuleContext<T> - Interface in org.apache.any23.validator
- RuleError(Rule, Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.RuleError
- run() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Triggers the execution of all the
Extractor
registered to this class using the default extraction parameters. - run(ExtractionParameters) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Triggers the execution of all the
Extractor
registered to this class using the specified extraction parameters. - run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.csv.CSVExtractor
- run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
- run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.JSONLDExtractor
- run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.yaml.YAMLExtractor
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractor
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.HeadLinkExtractor
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.HTMLMetaExtractor
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.ICBMExtractor
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.LicenseExtractor
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.TitleExtractor
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.TurtleHTMLExtractor
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.XFNExtractor
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.microdata.MicrodataExtractor
-
This extraction performs the Microdata to RDF conversion algorithm.
- run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
S
- SerializationException - Exception in org.apache.any23.validator
-
Defines an error raised from the
ValidationReportSerializer
. - SerializationException(String) - Constructor for exception org.apache.any23.validator.SerializationException
- SerializationException(String, Throwable) - Constructor for exception org.apache.any23.validator.SerializationException
- serialize(ValidationReport, OutputStream) - Method in interface org.apache.any23.validator.ValidationReportSerializer
-
Serializes the validation report on the given output stream.
- serialize(ValidationReport, OutputStream) - Method in class org.apache.any23.validator.XMLValidationReportSerializer
- serializeGetterValue(Object, Method, PrintStream) - Method in class org.apache.any23.validator.XMLValidationReportSerializer
- serializeToXML(Node, boolean) - Static method in class org.apache.any23.extractor.html.DomUtils
-
Given a DOM
Node
produces the XML serialization omitting the XML declaration. - serializeVocabularies(RDFFormat, PrintStream) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
-
Serializes all the vocabularies to NQuads over the given output stream.
- serializeVocabulary(Vocabulary, RDFFormat) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
-
Serialized the given
vocabulary
to NQuads and return them as string. - serializeVocabulary(Vocabulary, RDFFormat, boolean, PrintStream) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
-
Serializes the given
vocabulary
to NQuads over the given output stream. - serializeVocabulary(Vocabulary, RDFWriter) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
-
Serializes the given
vocabulary
to triples over the givenwriter
. - serializeVocabulary(IRI, IRI[], IRI[], Map<IRI, String>, RDFWriter) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
-
Serializes a vocabulary composed of the given
namespace
,resources
andproperties
. - setAnnotated(boolean) - Method in class org.apache.any23.writer.JSONWriter
-
Deprecated.
- setAnnotated(boolean) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
-
Sets the annotation flag.
- setAnnotated(boolean) - Method in class org.apache.any23.writer.URIListWriter
- setCacheFactory(LocalCopyFactory) - Method in class org.apache.any23.Any23
-
Allows to set a
LocalCopyFactory
instance. - setContentLength(long) - Method in class org.apache.any23.filter.ExtractionContextBlocker
- setContentLength(long) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
- setContentLength(long) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
- setContentLength(long) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
- setContentLength(long) - Method in class org.apache.any23.writer.CompositeTripleHandler
- setContentLength(long) - Method in class org.apache.any23.writer.CountingTripleHandler
- setContentLength(long) - Method in class org.apache.any23.writer.LoggingTripleHandler
- setContentLength(long) - Method in class org.apache.any23.writer.ReportingTripleHandler
- setContentLength(long) - Method in class org.apache.any23.writer.RepositoryWriter
- setContentLength(long) - Method in class org.apache.any23.writer.TripleWriterHandler
-
Writers may override this method to handle a "setContentLength" extraction event.
- setCurrentExtractionResult(ExtractionResult) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
- setDefaultLogging() - Static method in class org.apache.any23.util.LogUtils
-
Deprecated.
- setErrorMode(MicrodataParser.ErrorMode) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
- setErrorNode(Node) - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
- setField(String, HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
- setFullName(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
- setHTTPClient(HTTPClient) - Method in class org.apache.any23.Any23
-
Allows to set the
HTTPClient
implementation used to retrieve contents. - setHTTPUserAgent(String) - Method in class org.apache.any23.Any23
-
Sets the HTTP Header User Agent, see RFC 2616-14.43.
- setIssueReport(IssueReport) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
- setLocalCopyFactory(LocalCopyFactory) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Sets the internal factory for generating the document local copy, if
null
theMemCopyFactory
will be used. - setMIMETypeDetector(MIMETypeDetector) - Method in class org.apache.any23.Any23
-
Allows to set an instance of
MIMETypeDetector
. - setMIMETypeDetector(MIMETypeDetector) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Sets the internal mime type detector, if
null
mimetype detection will be skipped and all extractors will be activated. - setOrganization(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
- setOrganizationUnit(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
- setParserEncoding(String) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
-
Sets the document parser encoding.
- setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.csv.CSVExtractor
- setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
- setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdf.JSONLDExtractor
-
Deprecated.since 2.4. This extractor has never supported this setting. Do not use.
- setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
Deprecated.since 2.4. This extractor has never supported this setting. Do not use.
- setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
Deprecated.since 2.4. This extractor has never supported this setting. Do not use.
- setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.yaml.YAMLExtractor
- setVerboseLogging() - Static method in class org.apache.any23.util.LogUtils
-
Deprecated.
- setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
- setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdf.JSONLDExtractor
-
Deprecated.since 2.4. This extractor has never supported this setting. Do not use.
- setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
-
Deprecated.since 2.4. This extractor has never supported this setting. Do not use.
- setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
-
Deprecated.since 2.4. This extractor has never supported this setting. Do not use.
- SimpleExtractorFactory<T extends org.apache.any23.extractor.Extractor<?>> - Class in org.apache.any23.extractor
-
This class is a simple and default-like implementation of
ExtractorFactory
. - SimpleExtractorFactory(String, Prefixes) - Constructor for class org.apache.any23.extractor.SimpleExtractorFactory
- SimpleExtractorFactory(String, Prefixes, Collection<String>, String) - Constructor for class org.apache.any23.extractor.SimpleExtractorFactory
- SingleDocumentExtraction - Class in org.apache.any23.extractor
-
This class acts as a facade where all extractors (for a given MIMEType) can be called on a single document.
- SingleDocumentExtraction(Configuration, DocumentSource, ExtractorFactory<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
-
Builds an extractor by the specification of document source, extractors factory and output triple handler.
- SingleDocumentExtraction(Configuration, DocumentSource, ExtractorGroup, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
-
Builds an extractor by the specification of document source, list of extractors and output triple handler.
- SingleDocumentExtraction(DocumentSource, ExtractorFactory<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
-
Builds an extractor by the specification of document source, extractors factory and output triple handler, using the
DefaultConfiguration
. - SingleDocumentExtractionReport - Class in org.apache.any23.extractor
-
This class provides the report for a
SingleDocumentExtraction
run. - SingleDocumentExtractionReport(ValidationReport, Map<String, Collection<IssueReport.Issue>>) - Constructor for class org.apache.any23.extractor.SingleDocumentExtractionReport
- singleton() - Static method in class org.apache.any23.http.DefaultHTTPClientConfiguration
- source() - Method in class org.apache.any23.extractor.html.HTMLDocument.TextField
- SPACE_SEPARATOR - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
- SpanCloserInputStream - Class in org.apache.any23.extractor.html
-
Extension of
InputStream
meant to detect and replace any occurrence of inline span: - SpanCloserInputStream(InputStream) - Constructor for class org.apache.any23.extractor.html.SpanCloserInputStream
- SpeciesExtractor - Class in org.apache.any23.extractor.html
-
Extractor able to extract the Species Microformat.
- SpeciesExtractor() - Constructor for class org.apache.any23.extractor.html.SpeciesExtractor
- SpeciesExtractorFactory - Class in org.apache.any23.extractor.html
- SpeciesExtractorFactory() - Constructor for class org.apache.any23.extractor.html.SpeciesExtractorFactory
- SRC_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- SRC_TAGS - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
-
List of tags providing the
src
property. - startDocument(IRI) - Method in class org.apache.any23.filter.ExtractionContextBlocker
- startDocument(IRI) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
- startDocument(IRI) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
- startDocument(IRI) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
- startDocument(IRI) - Method in class org.apache.any23.writer.CompositeTripleHandler
- startDocument(IRI) - Method in class org.apache.any23.writer.CountingTripleHandler
- startDocument(IRI) - Method in class org.apache.any23.writer.JSONWriter
-
Deprecated.
- startDocument(IRI) - Method in class org.apache.any23.writer.LoggingTripleHandler
- startDocument(IRI) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
- startDocument(IRI) - Method in class org.apache.any23.writer.ReportingTripleHandler
- startDocument(IRI) - Method in class org.apache.any23.writer.RepositoryWriter
- startDocument(IRI) - Method in class org.apache.any23.writer.TripleWriterHandler
-
Writers may override this method to handle a "startDocument" extraction event.
- startRDF() - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
- StreamUtils - Class in org.apache.any23.util
-
Contains general utility functions for handling streams.
- StringDocumentSource - Class in org.apache.any23.source
-
String implementation of
DocumentSource
. - StringDocumentSource(String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
- StringDocumentSource(String, String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
- StringDocumentSource(String, String, String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
- StringUtils - Class in org.apache.any23.util
-
This class provides a set of string utility methods.
- SUBJECT_ATTRIBUTES - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
T
- TagSoupExtractionResult - Interface in org.apache.any23.extractor
-
This interface models a specific
ExtractionResult
able to collect property roots generated by HTML Microformat extractions. - TagSoupExtractionResult.PropertyPath - Class in org.apache.any23.extractor
-
Defines a property path object.
- TagSoupExtractionResult.ResourceRoot - Class in org.apache.any23.extractor
-
Defines a property root object.
- TagSoupParser - Class in org.apache.any23.extractor.html
-
Parses an
InputStream
into an HTML DOM tree. - TagSoupParser(InputStream, String) - Constructor for class org.apache.any23.extractor.html.TagSoupParser
- TagSoupParser(InputStream, String, String) - Constructor for class org.apache.any23.extractor.html.TagSoupParser
- TagSoupParser.ElementLocation - Class in org.apache.any23.extractor.html
-
Describes a DOM Element location.
- TemplateGraph - Class in org.apache.any23.extractor.xpath
-
Represents an Quad graph IRI template.
- TemplateGraph(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateGraph
-
Constructor.
- TemplateObject - Class in org.apache.any23.extractor.xpath
-
Represents a Quad object template.
- TemplateObject(TemplateObject.Type, String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateObject
-
Constructor.
- TemplateObject.Type - Enum in org.apache.any23.extractor.xpath
-
Supported object types.
- TemplatePredicate - Class in org.apache.any23.extractor.xpath
-
Represents a Quad predicate template.
- TemplatePredicate(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplatePredicate
-
Constructor.
- TemplateSubject - Class in org.apache.any23.extractor.xpath
-
Represents a Quad subject template.
- TemplateSubject(TemplateSubject.Type, String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateSubject
-
Constructor.
- TemplateSubject.Type - Enum in org.apache.any23.extractor.xpath
-
Supported subject types.
- TemplateXPathExtractionRule - Interface in org.apache.any23.extractor.xpath
-
Defines an
XPathExtractionRule
able to expand aQuadTemplate
set based on a given template. - TemplateXPathExtractionRuleImpl - Class in org.apache.any23.extractor.xpath
-
Default implementation of
XPathExtractionRule
. - TemplateXPathExtractionRuleImpl(String, String) - Constructor for class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
- Term<T extends org.eclipse.rdf4j.model.Value> - Class in org.apache.any23.extractor.xpath
-
Represents a generic template term.
- Term(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.Term
-
Constructor.
- TextField(String, Node) - Constructor for class org.apache.any23.extractor.html.HTMLDocument.TextField
- TIME_PROPERTY_PREFIX - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
- TitleExtractor - Class in org.apache.any23.extractor.html
-
Extracts the value of the <title> element of an HTML or XHTML page.
- TitleExtractor() - Constructor for class org.apache.any23.extractor.html.TitleExtractor
- TitleExtractorFactory - Class in org.apache.any23.extractor.html
- TitleExtractorFactory() - Constructor for class org.apache.any23.extractor.html.TitleExtractorFactory
- toByteArray(InputStream) - Static method in class org.apache.any23.source.MemCopyFactory
- toJSON() - Method in class org.apache.any23.extractor.microdata.Item
- toJSON() - Method in class org.apache.any23.extractor.microdata.ItemProp
- toJSON() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- toJSON() - Method in class org.apache.any23.extractor.microdata.ItemScope
- toJSON() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
- toString() - Method in class org.apache.any23.extractor.ExtractionResultImpl
- toString() - Method in class org.apache.any23.extractor.microdata.ItemProp
- toString() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
- toString() - Method in class org.apache.any23.extractor.microdata.ItemScope
- toString() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
- toString() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
- toString() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
- toString() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
- toString() - Method in class org.apache.any23.extractor.xpath.TemplateGraph
- toString() - Method in class org.apache.any23.extractor.xpath.TemplateObject
- toString() - Method in class org.apache.any23.extractor.xpath.TemplatePredicate
- toString() - Method in class org.apache.any23.extractor.xpath.TemplateSubject
- toString() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
- toString() - Method in class org.apache.any23.extractor.xpath.Term
- toString() - Method in class org.apache.any23.validator.DefaultValidationReport
- toString() - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
- toString() - Method in class org.apache.any23.validator.EmptyValidationReport
- toString() - Method in class org.apache.any23.validator.ValidationReport.Error
- toString() - Method in class org.apache.any23.validator.ValidationReport.FixError
- toString() - Method in class org.apache.any23.validator.ValidationReport.Issue
- toString() - Method in class org.apache.any23.validator.ValidationReport.RuleActivation
- toString() - Method in class org.apache.any23.validator.ValidationReport.RuleError
- toValue(String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Value
. - toXSDDateTime(Date) - Static method in class org.apache.any23.rdf.RDFUtils
-
Prints a
date
to the XSD datetime format. - traceRuleActivation(Rule) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
- traceRuleActivation(Rule) - Method in interface org.apache.any23.validator.ValidationReportBuilder
-
Traces that a rule has been applied.
- triple(String, String, String) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a statement of type:
toValue(s), toValue(p), toValue(o)
- triple(Resource, IRI, Value) - Static method in class org.apache.any23.rdf.RDFUtils
-
Creates a
Statement
. - TripleWriterHandler - Class in org.apache.any23.writer
-
This class connects a
TripleHandler
to aTripleWriter
by writing received data. - TripleWriterHandler() - Constructor for class org.apache.any23.writer.TripleWriterHandler
- TriXExtractor - Class in org.apache.any23.extractor.rdf
-
Concrete implementation of
Extractor.ContentExtractor
to perform extraction on TriX documents. - TriXExtractor() - Constructor for class org.apache.any23.extractor.rdf.TriXExtractor
-
Default constructor, with no verification of data types and not stop at first error.
- TriXExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.TriXExtractor
-
Constructor, allows to specify the validation and error handling policies.
- TriXExtractorFactory - Class in org.apache.any23.extractor.rdf
- TriXExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.TriXExtractorFactory
- TriXWriter - Class in org.apache.any23.writer
-
TriX
TripleWriter
implementation. - TriXWriter(OutputStream) - Constructor for class org.apache.any23.writer.TriXWriter
- TriXWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.TriXWriter
- TriXWriterFactory - Class in org.apache.any23.writer
- TriXWriterFactory() - Constructor for class org.apache.any23.writer.TriXWriterFactory
- TurtleExtractor - Class in org.apache.any23.extractor.rdf
-
Concrete implementation of
Extractor.ContentExtractor
able to perform the extraction on Turtle documents. - TurtleExtractor() - Constructor for class org.apache.any23.extractor.rdf.TurtleExtractor
-
Default constructor, with no verification of data types and no stop at first error.
- TurtleExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.TurtleExtractor
-
Constructor, allows to specify the validation and error handling policies.
- TurtleExtractorFactory - Class in org.apache.any23.extractor.rdf
- TurtleExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.TurtleExtractorFactory
- TurtleHTMLExtractor - Class in org.apache.any23.extractor.html
-
Extractor for Turtle/N3 format embedded within HTML script tags.
- TurtleHTMLExtractor() - Constructor for class org.apache.any23.extractor.html.TurtleHTMLExtractor
- TurtleHTMLExtractorFactory - Class in org.apache.any23.extractor.html
- TurtleHTMLExtractorFactory() - Constructor for class org.apache.any23.extractor.html.TurtleHTMLExtractorFactory
- TurtleWriter - Class in org.apache.any23.writer
-
N3 notation
TripleWriter
implementation. - TurtleWriter(OutputStream) - Constructor for class org.apache.any23.writer.TurtleWriter
-
Constructor.
- TurtleWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.TurtleWriter
- TurtleWriterFactory - Class in org.apache.any23.writer
- TurtleWriterFactory() - Constructor for class org.apache.any23.writer.TurtleWriterFactory
- TYPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- TYPEOF_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
U
- unblockContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
- unblockDocument() - Method in class org.apache.any23.filter.ExtractionContextBlocker
- unregister(String) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
-
Unregisters the
ExtractorFactory
with the given name. - updateIRIMapping(Node) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.Updates the IRI mapping with the XMLNS attributes declared in the current node.
- updateVocabulary(Node) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.Updates the vocabulary context with possible @vocab declarations.
- URI - org.apache.any23.extractor.xpath.TemplateObject.Type
- URI - org.apache.any23.extractor.xpath.TemplateSubject.Type
- URIListWriter - Class in org.apache.any23.writer
-
This writer simply produces a list of unique IRI present in the subject or in the object of every single extracted RDF Statement.
- URIListWriter(OutputStream) - Constructor for class org.apache.any23.writer.URIListWriter
- URIListWriterFactory - Class in org.apache.any23.writer
- URIListWriterFactory() - Constructor for class org.apache.any23.writer.URIListWriterFactory
- URL_PROPERTY_PREFIX - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
- URLUtils - Class in org.apache.any23.util
-
Contains general utility functions for handling URLs.
V
- validate(URI, Document, boolean) - Method in class org.apache.any23.validator.DefaultValidator
- validate(URI, Document, boolean) - Method in interface org.apache.any23.validator.Validator
-
Performs a validation - fixing of the provided document.
- validate(DOMDocument, boolean) - Method in class org.apache.any23.validator.DefaultValidator
- validate(DOMDocument, boolean) - Method in interface org.apache.any23.validator.Validator
-
Performs a validation - fixing of the provided document.
- ValidationReport - Interface in org.apache.any23.validator
-
This class contains the report of a validation performed by the
Validator
class. - ValidationReport.Error - Class in org.apache.any23.validator
-
An error occurred while performing the validation process.
- ValidationReport.FixError - Class in org.apache.any23.validator
-
An error occurred while executing a fix.
- ValidationReport.Issue - Class in org.apache.any23.validator
-
An issue found during the validation process.
- ValidationReport.IssueLevel - Enum in org.apache.any23.validator
-
Defines the different issue levels.
- ValidationReport.RuleActivation - Class in org.apache.any23.validator
-
This class describes the activation of a rule.
- ValidationReport.RuleError - Class in org.apache.any23.validator
-
An error occurred while executing a rule.
- ValidationReportBuilder - Interface in org.apache.any23.validator
-
The report interface is used to generate diagnostics about validation.
- ValidationReportSerializer - Interface in org.apache.any23.validator
-
Defines a serializer for validation reports.
- Validator - Interface in org.apache.any23.validator
-
The validator class allows to perform validation - correction of related to HTML
Document
instances. - ValidatorException - Exception in org.apache.any23.validator
-
Models any exception raised by the
Validator
. - ValidatorException(String) - Constructor for exception org.apache.any23.validator.ValidatorException
- ValidatorException(String, Throwable) - Constructor for exception org.apache.any23.validator.ValidatorException
- value() - Method in class org.apache.any23.extractor.html.HTMLDocument.TextField
- value() - Method in annotation type org.apache.any23.validator.XMLValidationReportSerializer.NodeName
- valueFactory - Variable in class org.apache.any23.extractor.html.MicroformatExtractor
- valueOf(String) - Static method in enum org.apache.any23.extractor.microdata.ItemPropValue.Type
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.any23.extractor.xpath.TemplateObject.Type
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.any23.extractor.xpath.TemplateSubject.Type
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.any23.validator.ValidationReport.IssueLevel
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum org.apache.any23.extractor.microdata.ItemPropValue.Type
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.apache.any23.extractor.xpath.TemplateObject.Type
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.apache.any23.extractor.xpath.TemplateSubject.Type
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.apache.any23.validator.ValidationReport.IssueLevel
-
Returns an array containing the constants of this enum type, in the order they are declared.
- Variable - Class in org.apache.any23.extractor.xpath
-
Represents a variable in
TemplateXPathExtractionRule
. - Variable(String, String) - Constructor for class org.apache.any23.extractor.xpath.Variable
- VERSION - Static variable in class org.apache.any23.Any23
-
Any23 core library version.
- vf - Variable in class org.apache.any23.extractor.yaml.ElementsProcessor
- VOCAB_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
W
- WARNING - org.apache.any23.validator.ValidationReport.IssueLevel
- writeNamespace(String, String) - Method in class org.apache.any23.extractor.ExtractionResultImpl
- writeNamespace(String, String) - Method in class org.apache.any23.writer.JSONWriter
-
Deprecated.
- writeNamespace(String, String) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
- writeNamespace(String, String) - Method in class org.apache.any23.writer.URIListWriter
- WriterSettings - Class in org.apache.any23.writer
-
This class encapsulates commonly supported settings for
TripleWriter
implementations. - writeTriple(Resource, IRI, Value) - Method in class org.apache.any23.extractor.ExtractionResultImpl
- writeTriple(Resource, IRI, Value, IRI) - Method in class org.apache.any23.extractor.ExtractionResultImpl
- writeTriple(Resource, IRI, Value, Resource) - Method in class org.apache.any23.writer.JSONWriter
-
Deprecated.
- writeTriple(Resource, IRI, Value, Resource) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
- writeTriple(Resource, IRI, Value, Resource) - Method in class org.apache.any23.writer.URIListWriter
X
- XCalExtractor - Class in org.apache.any23.extractor.calendar
- XCalExtractor() - Constructor for class org.apache.any23.extractor.calendar.XCalExtractor
- XCalExtractorFactory - Class in org.apache.any23.extractor.calendar
- XCalExtractorFactory() - Constructor for class org.apache.any23.extractor.calendar.XCalExtractorFactory
- XFNExtractor - Class in org.apache.any23.extractor.html
-
Extractor for the XFN microformat.
- XFNExtractor() - Constructor for class org.apache.any23.extractor.html.XFNExtractor
- XFNExtractorFactory - Class in org.apache.any23.extractor.html
- XFNExtractorFactory() - Constructor for class org.apache.any23.extractor.html.XFNExtractorFactory
- XML_LANG_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- XML_LITERAL_DATATYPE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- XMLNS_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- XMLNS_DEFAULT - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
-
Deprecated.
- XMLValidationReportSerializer - Class in org.apache.any23.validator
-
Default implementation of
ValidationReportSerializer
for XML. - XMLValidationReportSerializer() - Constructor for class org.apache.any23.validator.XMLValidationReportSerializer
- XMLValidationReportSerializer.NodeName - Annotation Type in org.apache.any23.validator
-
Allows to specify a custom node name.
- XPathExtractionRule - Interface in org.apache.any23.extractor.xpath
-
Defines an extraction rule for the
XPathExtractor
. - XPathExtractor - Class in org.apache.any23.extractor.xpath
-
Implementation of an
Extractor.TagSoupDOMExtractor
able to applyXPathExtractionRule
s and generate quads. - XPathExtractor() - Constructor for class org.apache.any23.extractor.xpath.XPathExtractor
- XPathExtractor(List<XPathExtractionRule>) - Constructor for class org.apache.any23.extractor.xpath.XPathExtractor
- XPathExtractorFactory - Class in org.apache.any23.extractor.xpath
- XPathExtractorFactory() - Constructor for class org.apache.any23.extractor.xpath.XPathExtractorFactory
- XSLTStylesheet - Class in org.apache.any23.extractor.rdfa
-
An XSLT stylesheet loaded from an InputStream, can be applied to DOM trees and writes the result to a
Writer
. - XSLTStylesheet(InputStream) - Constructor for class org.apache.any23.extractor.rdfa.XSLTStylesheet
- XSLTStylesheetException - Exception in org.apache.any23.extractor.rdfa
-
Defines an exception raised by the
XSLTStylesheet
class. - XSLTStylesheetException(String) - Constructor for exception org.apache.any23.extractor.rdfa.XSLTStylesheetException
- XSLTStylesheetException(String, Throwable) - Constructor for exception org.apache.any23.extractor.rdfa.XSLTStylesheetException
Y
- YAMLExtractor - Class in org.apache.any23.extractor.yaml
- YAMLExtractor() - Constructor for class org.apache.any23.extractor.yaml.YAMLExtractor
- YAMLExtractorFactory - Class in org.apache.any23.extractor.yaml
- YAMLExtractorFactory() - Constructor for class org.apache.any23.extractor.yaml.YAMLExtractorFactory
- YAMLProcessorException - Exception in org.apache.any23.extractor.yaml
- YAMLProcessorException() - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
- YAMLProcessorException(Object) - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
- YAMLProcessorException(Object, String) - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
- YAMLProcessorException(Object, String, Throwable) - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
- YAMLProcessorException(Object, String, Throwable, boolean, boolean) - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
- YAMLProcessorException(Object, Throwable) - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
All Classes All Packages