A B C D E F G H I J L M N O P Q R S T U V W X Y 
All Classes All Packages

A

ABOUT_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
AboutNotURIRule - Class in org.apache.any23.validator.rule
This rule is able to detect whether an about value is a valid URL or otherwise is a valid relative URL.
AboutNotURIRule() - Constructor for class org.apache.any23.validator.rule.AboutNotURIRule
 
AcceptHeaderBuilder - Class in org.apache.any23.http
Concatenates a collection of MIME specs in "type/subtype;q=x.x" notation into an HTTP Accept header value, and removes duplicates and types covered by wildcards.
AcceptHeaderBuilder(Collection<MIMEType>) - Constructor for class org.apache.any23.http.AcceptHeaderBuilder
 
acceptIRI(IRI) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
acceptIRI(IRI) - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
Checks if the rule can be applied on the given document IRI.
acquireProperty(ItemProp) - Method in class org.apache.any23.extractor.microdata.ItemScope
 
add(QuadTemplate) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
Adds a QuadTemplate to the rule.
add(QuadTemplate) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
add(Variable) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
Adds a variable to the template.
add(Variable) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
add(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
 
addAttribute(String, String, String) - Method in class org.apache.any23.validator.DefaultDOMDocument
 
addAttribute(String, String, String) - Method in interface org.apache.any23.validator.DOMDocument
Adds an attribute to a node addressed by the given XPath.
addBNodeProperty(Resource, IRI, BNode) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a BNode property to a node.
addBNodeProperty(Node, Resource, IRI, BNode) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a BNode property to a node.
addChild(TripleHandler) - Method in class org.apache.any23.writer.CompositeTripleHandler
Adds a decorated handler.
addIRIProperty(Resource, IRI, IRI) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a IRI property to a node.
ADDITIONAL_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
 
addPropertyPath(Class<? extends MicroformatExtractor>, Resource, Resource, BNode, String[]) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
addPropertyPath(Class<? extends MicroformatExtractor>, Resource, Resource, BNode, String[]) - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
Adds a property path to the list of the extracted data.
addResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
addResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
Adds a root property to the extraction result, specifying also the path corresponding to the root of data which generated the property and the extractor responsible for such addition.
addRule(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
 
addRule(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
Allows to register a new rule to this validator
addRule(Class<? extends Rule>, Class<? extends Fix>) - Method in class org.apache.any23.validator.DefaultValidator
 
addRule(Class<? extends Rule>, Class<? extends Fix>) - Method in interface org.apache.any23.validator.Validator
Allows to register a new rule to this validator and associating it to a fix.
AdrExtractor - Class in org.apache.any23.extractor.html
Extractor for the adr microformat.
AdrExtractor() - Constructor for class org.apache.any23.extractor.html.AdrExtractor
 
AdrExtractorFactory - Class in org.apache.any23.extractor.html
 
AdrExtractorFactory() - Constructor for class org.apache.any23.extractor.html.AdrExtractorFactory
 
Any23 - Class in org.apache.any23
A facade with convenience methods for typical Any23 extraction operations.
Any23() - Constructor for class org.apache.any23.Any23
Constructor with default configuration.
Any23(String...) - Constructor for class org.apache.any23.Any23
Constructor that allows the specification of a list of extractor names.
Any23(Configuration) - Constructor for class org.apache.any23.Any23
Constructor accepting Configuration.
Any23(Configuration, String...) - Constructor for class org.apache.any23.Any23
Constructor that allows the specification of a custom configuration and of list of extractor names.
Any23(Configuration, ExtractorGroup) - Constructor for class org.apache.any23.Any23
Constructor that allows the specification of a custom configuration and of a list of extractors.
Any23(ExtractorGroup) - Constructor for class org.apache.any23.Any23
Constructor that allows the specification of a list of extractors.
Any23ValueFactoryWrapper - Class in org.apache.any23.rdf
Any23 specialization of the ValueFactory.
Any23ValueFactoryWrapper(ValueFactory) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
Any23ValueFactoryWrapper(ValueFactory, IssueReport) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
Any23ValueFactoryWrapper(ValueFactory, IssueReport, String) - Constructor for class org.apache.any23.rdf.Any23ValueFactoryWrapper
Constructor with error reporter.
applyOn(DOMDocument, RuleContext<?>, ValidationReportBuilder) - Method in interface org.apache.any23.validator.Rule
Applies this rule to the given document.
applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.AboutNotURIRule
 
applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.MetaNameMisuseRule
 
applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.MissingItemscopeAttributeValueRule
 
applyOn(DOMDocument, RuleContext, ValidationReportBuilder) - Method in class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
 
applyTo(Document, Writer) - Method in class org.apache.any23.extractor.rdfa.XSLTStylesheet
Applies the XSLT transformation
applyTo(Document, Writer, Map<String, String>) - Method in class org.apache.any23.extractor.rdfa.XSLTStylesheet
Applies the XSLT transformation
asLines(InputStream) - Static method in class org.apache.any23.util.StreamUtils
Returns all the lines read from an input stream.
asModel(IRI, Object, Value) - Method in class org.apache.any23.extractor.yaml.ElementsProcessor
Converts a data structure to ElementsProcessor.ModelHolder.
asString(InputStream) - Static method in class org.apache.any23.util.StreamUtils
Returns the string content of a stream, new line chars will be removed.
asString(InputStream, boolean) - Static method in class org.apache.any23.util.StreamUtils
Returns the string content of a stream.
ATTRIBUTE_CSS - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
available() - Method in class org.apache.any23.util.ReaderInputStream
 

B

BaseRDFExtractor - Class in org.apache.any23.extractor.rdf
Base class for a generic RDF Extractor.ContentExtractor.
BaseRDFExtractor() - Constructor for class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
BaseRDFExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.BaseRDFExtractor
Constructor, allows to specify the validation and error handling policies.
BEGIN_SCRIPT - Static variable in class org.apache.any23.extractor.html.MicroformatExtractor
 
BenchmarkTripleHandler - Class in org.apache.any23.writer
TripleHandler decorator useful to perform benchmarking.
BenchmarkTripleHandler(TripleHandler) - Constructor for class org.apache.any23.writer.BenchmarkTripleHandler
Constructor.
blockContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
bnode() - Static method in class org.apache.any23.rdf.RDFUtils
 
bnode(String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a BNode.
BNODE - org.apache.any23.extractor.xpath.TemplateObject.Type
 
BNODE - org.apache.any23.extractor.xpath.TemplateSubject.Type
 
BODY_TAG - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
ByteArrayDocumentSource - Class in org.apache.any23.source
ByteArray implementation of DocumentSource.
ByteArrayDocumentSource(byte[], String, String) - Constructor for class org.apache.any23.source.ByteArrayDocumentSource
 
ByteArrayDocumentSource(InputStream, String, String) - Constructor for class org.apache.any23.source.ByteArrayDocumentSource
 

C

CLASS_PREFIX - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
 
close() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
close() - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
close() - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
close() - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
close() - Method in class org.apache.any23.http.DefaultHTTPClient
Shuts down the connection manager.
close() - Method in interface org.apache.any23.http.HTTPClient
Release all static resources help by the instance.
close() - Method in class org.apache.any23.util.ReaderInputStream
Closes the Stringreader.
close() - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
close() - Method in class org.apache.any23.writer.CompositeTripleHandler
 
close() - Method in class org.apache.any23.writer.CountingTripleHandler
 
close() - Method in class org.apache.any23.writer.JSONWriter
Deprecated.
 
close() - Method in class org.apache.any23.writer.LoggingTripleHandler
 
close() - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
close() - Method in class org.apache.any23.writer.ReportingTripleHandler
 
close() - Method in class org.apache.any23.writer.RepositoryWriter
 
close() - Method in class org.apache.any23.writer.URIListWriter
 
closeContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
closeContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
closeContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
 
closeContext(ExtractionContext) - Method in class org.apache.any23.writer.TripleWriterHandler
Writers may override this method to handle a "closeContext" extraction event.
closeGracefully(Closeable) - Static method in class org.apache.any23.util.StreamUtils
Closes the closable interface and reports error if any.
CompositeTripleHandler - Class in org.apache.any23.writer
A TripleHandler multi decorator, that wraps zero or more other triple handlers and dispatches all events to each of them.
CompositeTripleHandler() - Constructor for class org.apache.any23.writer.CompositeTripleHandler
Constructor with empty decorated list.
CompositeTripleHandler(Collection<TripleHandler>) - Constructor for class org.apache.any23.writer.CompositeTripleHandler
Constructor with initial list of decorated handlers.
conditionallyAddLiteralProperty(Node, Resource, IRI, Literal) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a literal property to a node.
conditionallyAddResourceProperty(Resource, IRI, IRI) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a IRI property to a node.
conditionallyAddStringProperty(Node, Resource, IRI, String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Helper method that adds a literal property to a subject only if the value of the property is a valid string.
contains(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
 
containsField(String) - Method in class org.apache.any23.extractor.html.HCardName
 
CONTENT_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
CountingTripleHandler - Class in org.apache.any23.writer
A simple TripleHandler that merely counts the number of triples it has received.
CountingTripleHandler() - Constructor for class org.apache.any23.writer.CountingTripleHandler
 
CountingTripleHandler(boolean) - Constructor for class org.apache.any23.writer.CountingTripleHandler
 
countNL(String) - Static method in class org.apache.any23.util.StringUtils
Counts the number of NL in the given in string.
countOccurrences(String, String) - Static method in class org.apache.any23.util.StringUtils
Counts how many times content appears within container without string overlapping.
cp(File, File) - Static method in class org.apache.any23.util.FileUtils
Copies a file src to the dest.
cp(InputStream, File) - Static method in class org.apache.any23.util.FileUtils
Copies the content of the input stream within the given dest file.
createBNode() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createBNode(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createDocumentSource(String) - Method in class org.apache.any23.Any23
Returns the most appropriate DocumentSource for the givendocumentIRI.
createExtractor() - Method in class org.apache.any23.extractor.calendar.ICalExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.calendar.JCalExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.calendar.XCalExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.csv.CSVExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.AdrExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.GeoExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.HCalendarExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.HCardExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.HeadLinkExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.HListingExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.HRecipeExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.HResumeExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.HReviewAggregateExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.HReviewExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.HTMLMetaExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.ICBMExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.LicenseExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HAdrExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HEntryExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HGeoExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HItemExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HProductExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HResumeExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.SpeciesExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.TitleExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.TurtleHTMLExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.html.XFNExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.microdata.MicrodataExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.rdf.JSONLDExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.rdf.NQuadsExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.rdf.NTriplesExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.rdf.RDFXMLExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.rdf.TriXExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.rdf.TurtleExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.rdfa.RDFa11ExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.rdfa.RDFaExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.xpath.XPathExtractorFactory
 
createExtractor() - Method in class org.apache.any23.extractor.yaml.YAMLExtractorFactory
 
createInitializedHTTPClient() - Static method in class org.apache.any23.http.DefaultHTTPClient
Creates a DefaultHTTPClient instance already initialized
createIRI(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createIRI(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(boolean) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(byte) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(double) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(float) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(int) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(long) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(short) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(String, IRI) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(BigDecimal) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(BigInteger) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(Date) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLiteral(XMLGregorianCalendar) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createLocalCopy(DocumentSource) - Method in interface org.apache.any23.source.LocalCopyFactory
 
createLocalCopy(DocumentSource) - Method in class org.apache.any23.source.MemCopyFactory
 
createStatement(Resource, IRI, Value) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createStatement(Resource, IRI, Value, Resource) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
createSubset(String...) - Static method in class org.apache.any23.rdf.PopularPrefixes
This method perform a prefix lookup.
CSVExtractor - Class in org.apache.any23.extractor.csv
This extractor produces RDF from a CSV file .
CSVExtractor() - Constructor for class org.apache.any23.extractor.csv.CSVExtractor
 
CSVExtractorFactory - Class in org.apache.any23.extractor.csv
 
CSVExtractorFactory() - Constructor for class org.apache.any23.extractor.csv.CSVExtractorFactory
 
CURIE_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 

D

DATATYPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
Date - org.apache.any23.extractor.microdata.ItemPropValue.Type
 
DEFAULT_HTTP_CLIENT_USER_AGENT - Static variable in class org.apache.any23.Any23
Default HTTP User Agent defined in default configuration.
DefaultDOMDocument - Class in org.apache.any23.validator
This class wraps the DOM document.
DefaultDOMDocument(URI, Document) - Constructor for class org.apache.any23.validator.DefaultDOMDocument
 
DefaultHTTPClient - Class in org.apache.any23.http
Opens an InputStream on an HTTP IRI.
DefaultHTTPClient() - Constructor for class org.apache.any23.http.DefaultHTTPClient
 
DefaultHTTPClientConfiguration - Class in org.apache.any23.http
Default implementation of HTTPClientConfiguration.
DefaultHTTPClientConfiguration() - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
Constructor.
DefaultHTTPClientConfiguration(String) - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
Constructor.
DefaultHTTPClientConfiguration(String, int, int, String) - Constructor for class org.apache.any23.http.DefaultHTTPClientConfiguration
Constructor.
DefaultRuleContext - Class in org.apache.any23.validator
Default implementation of RuleContext.
DefaultRuleContext() - Constructor for class org.apache.any23.validator.DefaultRuleContext
 
DefaultValidationReport - Class in org.apache.any23.validator
Default implementation of ValidationReport.
DefaultValidationReport(List<ValidationReport.Issue>, List<ValidationReport.RuleActivation>, List<ValidationReport.Error>) - Constructor for class org.apache.any23.validator.DefaultValidationReport
 
DefaultValidationReportBuilder - Class in org.apache.any23.validator
Default implementation of ValidationReportBuilder.
DefaultValidationReportBuilder() - Constructor for class org.apache.any23.validator.DefaultValidationReportBuilder
 
DefaultValidator - Class in org.apache.any23.validator
Default implementation of Validator.
DefaultValidator() - Constructor for class org.apache.any23.validator.DefaultValidator
 
deferProperties(String...) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
Given a document and a list of itemprop names this method will return such itemprops.
DiscoveryUtils - Class in org.apache.any23.util
This class provides utility methods for discovering classes in packages.
disownProperty(ItemProp) - Method in class org.apache.any23.extractor.microdata.ItemScope
 
DocumentReport - Class in org.apache.any23.extractor.html
Represents the validationReportBuilder generated by a the TagSoupParser when a document is retrieved and validated.
DocumentReport(ValidationReport, Document) - Constructor for class org.apache.any23.extractor.html.DocumentReport
 
documentToInputStream(Document) - Static method in class org.apache.any23.extractor.html.DomUtils
Given a Document this method will return an input stream representing that document.
documentToInputStream(Document) - Static method in class org.apache.any23.util.StreamUtils
Converts a Document to an InputStream
DOMDocument - Interface in org.apache.any23.validator
This interface models a document to be processed by the Validator.
DomUtils - Class in org.apache.any23.extractor.html
This class provides utility methods for DOM manipulation.
dumpContent(File, String) - Static method in class org.apache.any23.util.FileUtils
Dumps the given string within a file.
dumpContent(File, Throwable) - Static method in class org.apache.any23.util.FileUtils
Dumps the stack trace of the given exception into the specified file.

E

ELEMENT_LOCATION - Static variable in class org.apache.any23.extractor.html.TagSoupParser
 
ElementsProcessor - Class in org.apache.any23.extractor.yaml
Converts Object into RDF graph encoded to ElementsProcessor.ModelHolder.
ElementsProcessor.ModelHolder - Class in org.apache.any23.extractor.yaml
A model holder describes the two required parameters which makes a model useful in further processing: a root node and model itself.
EMBEDDED_PROPERTY_PREFIX - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
 
EmbeddedJSONLDExtractor - Class in org.apache.any23.extractor.html
This extractor represents the HTML script tags used to embed blocks of data in documents.
EmbeddedJSONLDExtractor() - Constructor for class org.apache.any23.extractor.html.EmbeddedJSONLDExtractor
 
EmbeddedJSONLDExtractorFactory - Class in org.apache.any23.extractor.html
 
EmbeddedJSONLDExtractorFactory() - Constructor for class org.apache.any23.extractor.html.EmbeddedJSONLDExtractorFactory
 
EmptyValidationReport - Class in org.apache.any23.validator
An implementation of ValidationReportBuilder with no data.
END_SCRIPT - Static variable in class org.apache.any23.extractor.html.MicroformatExtractor
 
endDocument(IRI) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
endDocument(IRI) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
endDocument(IRI) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
endDocument(IRI) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
endDocument(IRI) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
endDocument(IRI) - Method in class org.apache.any23.writer.CountingTripleHandler
 
endDocument(IRI) - Method in class org.apache.any23.writer.JSONWriter
Deprecated.
 
endDocument(IRI) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
endDocument(IRI) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
endDocument(IRI) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
endDocument(IRI) - Method in class org.apache.any23.writer.RepositoryWriter
 
endDocument(IRI) - Method in class org.apache.any23.writer.TripleWriterHandler
Writers may override this method to handle an "endDocument" extraction event.
endDocument(IRI) - Method in class org.apache.any23.writer.URIListWriter
 
endRDF() - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
EntityBasedMicroformatExtractor - Class in org.apache.any23.extractor.html
Base class for microformat extractors based on entities.
EntityBasedMicroformatExtractor() - Constructor for class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
 
equals(Object) - Method in class org.apache.any23.extractor.microdata.Item
 
equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemProp
 
equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
equals(Object) - Method in class org.apache.any23.extractor.microdata.ItemScope
 
Error(Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.Error
 
ERROR - org.apache.any23.validator.ValidationReport.IssueLevel
 
ERRORED_META_NODES - Static variable in class org.apache.any23.validator.rule.MetaNameMisuseRule
 
escapeAsJSONString(String) - Static method in class org.apache.any23.util.StringUtils
Escapes the in string as JSON string to let it being embeddable within a string field.
escapeDoubleQuotes(String) - Static method in class org.apache.any23.util.StringUtils
Escapes all the unescaped double quotes when needed.
ExampleInputOutput - Class in org.apache.any23.extractor
A reporter for example input and output of an extractor.
ExampleInputOutput(String) - Constructor for class org.apache.any23.extractor.ExampleInputOutput
 
ExampleInputOutput(ExtractorFactory<?>) - Constructor for class org.apache.any23.extractor.ExampleInputOutput
 
execute(Rule, RuleContext, DOMDocument) - Method in interface org.apache.any23.validator.Fix
Executes this fix over a document.
execute(Rule, RuleContext, DOMDocument) - Method in class org.apache.any23.validator.rule.MetaNameMisuseFix
 
execute(Rule, RuleContext, DOMDocument) - Method in class org.apache.any23.validator.rule.MissingItemscopeAttributeValueFix
 
execute(Rule, RuleContext, DOMDocument) - Method in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
 
extract() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
 
extract() - Method in class org.apache.any23.extractor.html.HCalendarExtractor
 
extract() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Performs the extraction of the data and writes them to the model.
extract(File, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given file sending the generated events to the specified outputHandler.
extract(String, String, String, String, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction on the in string associated to the documentIRI IRI, declaring contentType and encoding.
extract(String, String, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction on the in string associated to the documentIRI IRI, sending the generated events to the specified outputHandler.
extract(String, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given documentIRI sending the generated events to the specified outputHandler.
extract(ExtractionParameters, String, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given documentIRI sending the generated events to the specified outputHandler.
extract(ExtractionParameters, DocumentSource, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given in document source, sending the generated events to the specified outputHandler.
extract(ExtractionParameters, DocumentSource, TripleHandler, String) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given in document source, sending the generated events to the specified outputHandler.
extract(DocumentSource, TripleHandler) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given in document source, sending the generated events to the specified outputHandler.
extract(DocumentSource, TripleHandler, String) - Method in class org.apache.any23.Any23
Performs metadata extraction from the content of the given in document source, sending the generated events to the specified outputHandler.
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.AdrExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
Extracts an entity from a DOM node.
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.GeoExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HCardExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HListingExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HRecipeExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HResumeExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HReviewAggregateExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.HReviewExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HAdrExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HEntryExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HGeoExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HItemExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HProductExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HResumeExtractor
 
extractEntity(Node, ExtractionResult) - Method in class org.apache.any23.extractor.html.SpeciesExtractor
Extracts an entity from a DOM node.
extractEntityAsEmbeddedProperty(HTMLDocument, BNode, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractor
 
extractEntityAsEmbeddedProperty(HTMLDocument, BNode, ExtractionResult) - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractor
 
ExtractionContextBlocker - Class in org.apache.any23.filter
A wrapper around a TripleHandler that can block and unblock calls to the handler, either for the entire document, or for individual ExtractionContexts.
ExtractionContextBlocker(TripleHandler) - Constructor for class org.apache.any23.filter.ExtractionContextBlocker
 
ExtractionReport - Class in org.apache.any23
This class contains some statistics and general information about an extraction.
ExtractionReport(List<Extractor>, String, String, ValidationReport, Map<String, Collection<IssueReport.Issue>>) - Constructor for class org.apache.any23.ExtractionReport
 
ExtractionResultImpl - Class in org.apache.any23.extractor
A default implementation of ExtractionResult; it receives extraction output from one Extractor working on one document, and passes the output on to a TripleHandler.
ExtractionResultImpl(ExtractionContext, Extractor<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.ExtractionResultImpl
 
ExtractorRegistryImpl - Class in org.apache.any23.extractor
Singleton class acting as a register for all the various Extractor.
ExtractorRegistryImpl() - Constructor for class org.apache.any23.extractor.ExtractorRegistryImpl
Public constructor for ExtractorRegistryImpl.
extractors() - Method in annotation type org.apache.any23.extractor.html.annotations.Includes
 
extractors() - Method in annotation type org.apache.any23.extractor.html.microformats2.annotations.Includes
 
extractPrefixSections(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
Given a prefix declaration returns a list of prefixID:prefixURL strings normalizing blanks where present.
extractRelTag(String) - Static method in class org.apache.any23.extractor.html.HTMLDocument
Extracts the href specific rel-tag string.
extractRelTag(NamedNodeMap) - Static method in class org.apache.any23.extractor.html.HTMLDocument
Extracts the href specific rel-tag string.
extractRelTagNodes() - Method in class org.apache.any23.extractor.html.HTMLDocument
Extracts all the rel tag nodes.

F

FAMILY_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
 
FIELDS - Static variable in class org.apache.any23.extractor.html.HCardName
 
FileDocumentSource - Class in org.apache.any23.source
File implementation of DocumentSource.
FileDocumentSource(File) - Constructor for class org.apache.any23.source.FileDocumentSource
 
FileDocumentSource(File, String) - Constructor for class org.apache.any23.source.FileDocumentSource
 
FileUtils - Class in org.apache.any23.util
Utility class for handling files.
find(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
find(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Gets the string value of an XPath expression.
findAll(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
findAll(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Returns a NodeList composed of all the nodes that match an XPath expression, which must be valid.
findAllByAttributeContains(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
 
findAllByAttributeName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Finds all nodes that have a declared attribute.
findAllByClassName(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Finds all the nodes by class name.
findAllByClassName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Finds all nodes that have a declared class.
findAllByTag(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
 
findAllByTagAndClassName(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
 
findMicroformattedObjectNode(String, String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
findMicroformattedValue(String, String, String, String, String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
findNodeById(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
findNodeById(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Mimics the JS DOM API, or prototype's $()
Fix - Interface in org.apache.any23.validator
This interface models a fix for an issue.
fixAbsoluteIRI(String) - Static method in class org.apache.any23.rdf.RDFUtils
Fixes typical errors in an absolute org.eclipse.rdf4j.model.IRI, such as unescaped spaces.
FixError(Fix, Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.FixError
 
fixIRI(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
fixIRIWithException(String) - Static method in class org.apache.any23.rdf.RDFUtils
Tries to fix a potentially broken relative or absolute URI.
fixLink(String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
fixLink(String, String) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
fixLink(String, String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
Helper method to conditionally add a schema to a URI unless it's there, or null if link is empty.
fixWhiteSpace(String) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
formatDateTime(Date) - Static method in class org.apache.any23.extractor.microdata.ItemPropValue
 
fromStrings(Collection<String>) - Static method in class org.apache.any23.http.AcceptHeaderBuilder
 
FunctionalSyntaxExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor handling OWL2 Functional-Style Syntax format.
FunctionalSyntaxExtractor() - Constructor for class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractor
 
FunctionalSyntaxExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractor
 
FunctionalSyntaxExtractorFactory - Class in org.apache.any23.extractor.rdf
 
FunctionalSyntaxExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractorFactory
 

G

GeoExtractor - Class in org.apache.any23.extractor.html
Extractor for the Geo microformat.
GeoExtractor() - Constructor for class org.apache.any23.extractor.html.GeoExtractor
 
GeoExtractorFactory - Class in org.apache.any23.extractor.html
 
GeoExtractorFactory() - Constructor for class org.apache.any23.extractor.html.GeoExtractorFactory
 
get() - Static method in class org.apache.any23.rdf.PopularPrefixes
 
getAcceptHeader() - Method in class org.apache.any23.http.AcceptHeaderBuilder
Builds and returns an accept header.
getAcceptHeader() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
 
getAcceptHeader() - Method in interface org.apache.any23.http.HTTPClientConfiguration
 
getActualDocumentIRI() - Method in class org.apache.any23.http.DefaultHTTPClient
 
getActualDocumentIRI() - Method in interface org.apache.any23.http.HTTPClient
Returns the actual IRI from which the document was fetched.
getAllNames() - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
Returns the names of all registered extractors, sorted alphabetically.
getAllRules() - Method in class org.apache.any23.validator.DefaultValidator
 
getAllRules() - Method in interface org.apache.any23.validator.Validator
Returns all the registered rules.
getAsDate() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getAsFloat() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getAsInteger() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getAsLink() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getAsNested() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getAsPlainLiteral(Node, String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
getAsXMLLiteral(Node) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.AdrExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
Returns the base class name for the extractor.
getBaseClassName() - Method in class org.apache.any23.extractor.html.GeoExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HCardExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HListingExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HResumeExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HReviewAggregateExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.HReviewExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HAdrExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HEntryExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HGeoExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HItemExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HProductExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.microformats2.HResumeExtractor
 
getBaseClassName() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
Returns the base class name for the extractor.
getBeginColumnNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
 
getBeginLineNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
 
getBlankNodeFor(Node) - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
 
getBNode(String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a BNode with an MD5 digest as part of the ID.
getCause() - Method in class org.apache.any23.validator.ValidationReport.Error
 
getChilds() - Method in class org.apache.any23.writer.CompositeTripleHandler
 
getClassesInPackage(String) - Static method in class org.apache.any23.util.DiscoveryUtils
Scans all classes accessible from the context class loader which belong to the given package and sub-packages.
getClassesInPackage(String, Class<?>) - Static method in class org.apache.any23.util.DiscoveryUtils
Scans all classes accessible from the context class loader which belong to the given package and sub-packages and filter them by ones implementing the specified interface iface.
getConnectionTimeout() - Method in class org.apache.any23.http.DefaultHTTPClient
 
getContent() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getContentLength() - Method in class org.apache.any23.http.DefaultHTTPClient
 
getContentLength() - Method in interface org.apache.any23.http.HTTPClient
 
getContentLength() - Method in class org.apache.any23.source.ByteArrayDocumentSource
 
getContentLength() - Method in class org.apache.any23.source.FileDocumentSource
 
getContentLength() - Method in class org.apache.any23.source.HTTPDocumentSource
 
getContentLength() - Method in class org.apache.any23.source.StringDocumentSource
 
getContentType() - Method in class org.apache.any23.http.DefaultHTTPClient
 
getContentType() - Method in interface org.apache.any23.http.HTTPClient
The value of the Content-Type header reported by the server.
getContentType() - Method in class org.apache.any23.source.ByteArrayDocumentSource
 
getContentType() - Method in class org.apache.any23.source.FileDocumentSource
 
getContentType() - Method in class org.apache.any23.source.HTTPDocumentSource
 
getContentType() - Method in class org.apache.any23.source.StringDocumentSource
 
getCount() - Method in class org.apache.any23.writer.CountingTripleHandler
 
getCurrentExtractionResult() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Returns the ExtractionResult associated to the extraction session.
getData(String) - Method in class org.apache.any23.validator.DefaultRuleContext
 
getData(String) - Method in interface org.apache.any23.validator.RuleContext
Retrieves a registered object.
getDefaultLanguage() - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns the document default language.
getDefaultLiteralLanguage() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
getDefaultTimeout() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
 
getDefaultTimeout() - Method in interface org.apache.any23.http.HTTPClientConfiguration
 
getDescription() - Method in class org.apache.any23.extractor.calendar.ICalExtractor
 
getDescription() - Method in class org.apache.any23.extractor.calendar.JCalExtractor
 
getDescription() - Method in class org.apache.any23.extractor.calendar.XCalExtractor
 
getDescription() - Method in class org.apache.any23.extractor.csv.CSVExtractor
getDescription() - Method in class org.apache.any23.extractor.html.AdrExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.GeoExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HCalendarExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HCardExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HeadLinkExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HListingExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HResumeExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HReviewAggregateExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HReviewExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.HTMLMetaExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.ICBMExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.LicenseExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
Returns the description of this extractor.
getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HAdrExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HEntryExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HGeoExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HItemExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HProductExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.microformats2.HResumeExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
Returns the description of this extractor.
getDescription() - Method in class org.apache.any23.extractor.html.TitleExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.TurtleHTMLExtractor
 
getDescription() - Method in class org.apache.any23.extractor.html.XFNExtractor
 
getDescription() - Method in class org.apache.any23.extractor.microdata.MicrodataExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.JSONLDExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.NQuadsExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.NTriplesExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.RDFXMLExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.TriXExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdf.TurtleExtractor
 
getDescription() - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
 
getDescription() - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
 
getDescription() - Method in class org.apache.any23.extractor.xpath.XPathExtractor
 
getDescription() - Method in class org.apache.any23.extractor.yaml.YAMLExtractor
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.calendar.ICalExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.calendar.JCalExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.calendar.XCalExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.csv.CSVExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.AdrExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.GeoExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HCalendarExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HCardExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HeadLinkExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HListingExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HRecipeExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HResumeExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HReviewAggregateExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HReviewExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.HTMLMetaExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.ICBMExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.LicenseExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HAdrExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HCardExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HEntryExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HEventExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HGeoExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HItemExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HProductExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.microformats2.HResumeExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.SpeciesExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.TitleExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.TurtleHTMLExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.html.XFNExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.microdata.MicrodataExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.JSONLDExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.NQuadsExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.NTriplesExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.RDFXMLExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.TriXExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdf.TurtleExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdfa.RDFa11ExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.rdfa.RDFaExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.xpath.XPathExtractorFactory
 
getDescriptionInstance() - Static method in class org.apache.any23.extractor.yaml.YAMLExtractorFactory
 
getDetectedItemScopes() - Method in class org.apache.any23.extractor.microdata.MicrodataParserReport
 
getDetectedMimeType() - Method in class org.apache.any23.ExtractionReport
 
getDetectedMIMEType() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Returns the detected mimetype for the given DocumentSource.
getDocument() - Method in class org.apache.any23.extractor.html.DocumentReport
 
getDocument() - Method in class org.apache.any23.extractor.html.HTMLDocument
 
getDocumentBase(URL, Document) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
getDocumentIRI() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
getDocumentIRI() - Method in class org.apache.any23.source.ByteArrayDocumentSource
 
getDocumentIRI() - Method in class org.apache.any23.source.FileDocumentSource
 
getDocumentIRI() - Method in class org.apache.any23.source.HTTPDocumentSource
 
getDocumentIRI() - Method in class org.apache.any23.source.StringDocumentSource
 
getDocumentIRI() - Method in class org.apache.any23.validator.DefaultDOMDocument
 
getDocumentIRI() - Method in interface org.apache.any23.validator.DOMDocument
 
getDOM() - Method in class org.apache.any23.extractor.html.TagSoupParser
Returns the DOM of the given document IRI.
getEncoding() - Method in class org.apache.any23.ExtractionReport
 
getEndColumnNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
 
getEndLineNumber() - Method in class org.apache.any23.extractor.html.TagSoupParser.ElementLocation
 
getErrorLocationBeginCol() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
getErrorLocationBeginRow() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
getErrorLocationEndCol() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
getErrorLocationEndRow() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
getErrorMode() - Method in class org.apache.any23.extractor.microdata.MicrodataParser
 
getErrorPath() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
getErrors() - Method in class org.apache.any23.extractor.microdata.MicrodataParser
 
getErrors() - Method in class org.apache.any23.extractor.microdata.MicrodataParserReport
 
getErrors() - Method in class org.apache.any23.validator.DefaultValidationReport
 
getErrors() - Method in class org.apache.any23.validator.EmptyValidationReport
 
getErrors() - Method in interface org.apache.any23.validator.ValidationReport
Returns the list of detected errors.
getExampleInput() - Method in class org.apache.any23.extractor.ExampleInputOutput
 
getExampleInput() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
getExampleIRI() - Method in class org.apache.any23.extractor.ExampleInputOutput
 
getExampleOutput() - Method in class org.apache.any23.extractor.ExampleInputOutput
 
getExtractionContext() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
getExtractionContext() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
getExtractor() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
getExtractor() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
 
getExtractorGroup() - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
 
getExtractorGroup(List<String>) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
Returns an ExtractorGroup containing the ExtractorFactory mathing the names provided as input.
getExtractorIssues(String) - Method in class org.apache.any23.ExtractionReport
 
getExtractorLabel() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
getExtractorName() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
getExtractorNames() - Method in class org.apache.any23.writer.ReportingTripleHandler
 
getExtractorToIssues() - Method in class org.apache.any23.extractor.SingleDocumentExtractionReport
 
getFactory(String) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
Retrieves a ExtractorFactory given its name
getField(String) - Method in class org.apache.any23.extractor.html.HCardName
 
getFields(String) - Method in class org.apache.any23.extractor.html.HCardName
 
getFixes(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
 
getFixes(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
Returns all fixes registered for the give rule.
getFormatByExtension(String) - Static method in class org.apache.any23.rdf.RDFUtils
Returns a parser type from the given extension.
getFormats() - Static method in class org.apache.any23.rdf.RDFUtils
Returns all the available RDFFormats.
getFullName() - Method in class org.apache.any23.extractor.html.HCardName
 
getFunctionalSyntaxParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured FunctionalSyntaxParser.
getGraph() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
 
getHRName() - Method in interface org.apache.any23.validator.Fix
 
getHRName() - Method in class org.apache.any23.validator.rule.AboutNotURIRule
 
getHRName() - Method in interface org.apache.any23.validator.Rule
 
getHRName() - Method in class org.apache.any23.validator.rule.MetaNameMisuseFix
 
getHRName() - Method in class org.apache.any23.validator.rule.MetaNameMisuseRule
 
getHRName() - Method in class org.apache.any23.validator.rule.MissingItemscopeAttributeValueFix
 
getHRName() - Method in class org.apache.any23.validator.rule.MissingItemscopeAttributeValueRule
 
getHRName() - Method in class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
 
getHRName() - Method in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
 
getHTMLDocument() - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
getHTTPClient() - Method in class org.apache.any23.Any23
Returns the current HTTPClient implementation.
getHTTPUserAgent() - Method in class org.apache.any23.Any23
Returns the HTTP Header User Agent, see RFC 2616-14.43.
getId() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
getIdentifier() - Method in class org.apache.any23.writer.JSONLDWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.JSONWriterFactory
Deprecated.
 
getIdentifier() - Method in class org.apache.any23.writer.NQuadsWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.NTriplesWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.TriXWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.TurtleWriterFactory
 
getIdentifier() - Method in class org.apache.any23.writer.URIListWriterFactory
 
getIndexInParent(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Given a node this method returns the index corresponding to such node within the list of the children of its parent node.
getInstance() - Static method in class org.apache.any23.extractor.ExtractorRegistryImpl
 
getInstance() - Static method in class org.apache.any23.extractor.rdf.RDFParserFactory
 
getInstance() - Static method in class org.apache.any23.extractor.yaml.ElementsProcessor
 
getInstance() - Static method in class org.apache.any23.validator.EmptyValidationReport
 
getInternalValue() - Method in class org.apache.any23.extractor.xpath.Term
 
getIssueReport() - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
getIssues() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
getIssues() - Method in class org.apache.any23.validator.DefaultValidationReport
 
getIssues() - Method in class org.apache.any23.validator.EmptyValidationReport
 
getIssues() - Method in interface org.apache.any23.validator.ValidationReport
Returns the list of detected issues.
getIssuesCount() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
getItemId() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
getItemPropNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns all the itemProps detected within the given root node.
getItemProps(Node, boolean) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns all the itemprops for the given itemscope node.
getItemScope(Node) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns the ItemScope instance described within the specified node.
getItemScopeNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns all the itemScopes detected within the given root node.
getJSONLDParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured SesameJSONLDParser.
getKey(ExtractorFactory) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
 
getLevel() - Method in class org.apache.any23.validator.ValidationReport.Issue
 
getManchesterSyntaxParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured ManchesterSyntaxParser.
getMapping(String) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
Returns a IRI mapping for a given prefix.
getMatchingExtractors() - Method in class org.apache.any23.ExtractionReport
 
getMatchingExtractors() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
 
getMaxConnections() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
 
getMaxConnections() - Method in interface org.apache.any23.http.HTTPClientConfiguration
 
getMessage() - Method in class org.apache.any23.validator.ValidationReport.Error
 
getMessage() - Method in class org.apache.any23.validator.ValidationReport.Issue
 
getMicrodata(Document) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns all the Microdata items detected within the given document, works in full report mode.
getMicrodata(Document, MicrodataParser.ErrorMode) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns all the Microdata items detected within the given document.
getMicrodataAsJSON(Document, PrintStream) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns a JSON containing the list of all extracted Microdata, as described at Microdata JSON Specification.
getModel() - Method in class org.apache.any23.extractor.yaml.ElementsProcessor.ModelHolder
 
getName() - Method in class org.apache.any23.extractor.microdata.ItemProp
 
getName() - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
 
getName() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
getName() - Method in class org.apache.any23.extractor.xpath.Variable
 
getName() - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
 
getNode(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
 
getNode(String) - Method in interface org.apache.any23.validator.DOMDocument
Returns the node addressed by the given XPath, if more then one an exception will be raised.
getNodeLocation(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Returns the row/col location of the given node.
getNodes(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
 
getNodes(String) - Method in interface org.apache.any23.validator.DOMDocument
Returns the list of nodes addressed by the given XPath.
getNodesWithAttribute(String) - Method in class org.apache.any23.validator.DefaultDOMDocument
 
getNodesWithAttribute(String) - Method in interface org.apache.any23.validator.DOMDocument
Returns all the nodes declaring an attribute with the specified name.
getNQuadsParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured NQuadsParser.
getNTriplesParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured NTriplesParser.
getObject() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
getObject() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
 
getOrganization() - Method in class org.apache.any23.extractor.html.HCardName
 
getOrganizationUnit() - Method in class org.apache.any23.extractor.html.HCardName
 
getOrigin() - Method in class org.apache.any23.validator.ValidationReport.FixError
 
getOrigin() - Method in class org.apache.any23.validator.ValidationReport.Issue
 
getOrigin() - Method in class org.apache.any23.validator.ValidationReport.RuleError
 
getOriginalDocument() - Method in class org.apache.any23.validator.DefaultDOMDocument
 
getOriginalDocument() - Method in interface org.apache.any23.validator.DOMDocument
Returns the original document.
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.JSONLDExtractor
Deprecated.
since 2.4. This extractor no longer wraps an RDF4J RDFParser. Do not use this method.
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.NQuadsExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.NTriplesExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFXMLExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.TriXExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.TurtleExtractor
 
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
Deprecated.
since 2.4. This extractor no longer wraps an RDF4J RDFParser. Do not use this method.
getParser(ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
Deprecated.
since 2.4. This extractor no longer wraps an RDF4J RDFParser. Do not use this method.
getParser(RDFFormat) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a new RDFParser instance.
getParserEncoding() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
 
getPath() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
getPath() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
 
getPathToLocalRoot() - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns the sequence of ancestors from the document root to the local root (document).
getPluralTextField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns a plural text field.
getPluralUrlField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns the list of URLs associated to the fields marked with class className.
getPredicate() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
 
getPrefixes() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
getProperties() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
getProperty() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
getPropertyPaths() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
getPropertyPaths() - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
Returns all the collected property paths.
getPropertyValue(Node) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
Reads the value of a itemprop node.
getRDFa10Parser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured RDFaParser, set to RDFa-1.0 compatibility mode.
getRDFa11Parser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured RDFaParser, set to RDFa-1.1 compatibility mode.
getRDFXMLParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured RDFXMLParser.
getReason() - Method in exception org.apache.any23.extractor.yaml.YAMLProcessorException
 
getRefs() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
getReport() - Method in class org.apache.any23.extractor.html.DocumentReport
 
getReport() - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
getReport() - Method in interface org.apache.any23.validator.ValidationReportBuilder
 
getResourceRoots() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
getResourceRoots() - Method in interface org.apache.any23.extractor.TagSoupExtractionResult
Returns all the collected property roots.
getRoot() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
 
getRoot() - Method in class org.apache.any23.extractor.yaml.ElementsProcessor.ModelHolder
 
getRuleActivations() - Method in class org.apache.any23.validator.DefaultValidationReport
 
getRuleActivations() - Method in class org.apache.any23.validator.EmptyValidationReport
 
getRuleActivations() - Method in interface org.apache.any23.validator.ValidationReport
Returns the list of activated rules.
getRuleStr() - Method in class org.apache.any23.validator.ValidationReport.RuleActivation
 
getSingularTextField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns a singular text field.
getSingularUrlField(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns the URL associated to the field marked with class className.
getSoTimeout() - Method in class org.apache.any23.http.DefaultHTTPClient
 
getSubject() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
getSubject() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
 
getSupportedMIMETypes() - Method in class org.apache.any23.extractor.SimpleExtractorFactory
 
getSupportedSettings() - Method in class org.apache.any23.writer.JSONLDWriterFactory
 
getSupportedSettings() - Method in class org.apache.any23.writer.JSONWriterFactory
Deprecated.
 
getSupportedSettings() - Method in class org.apache.any23.writer.NQuadsWriterFactory
 
getSupportedSettings() - Method in class org.apache.any23.writer.NTriplesWriterFactory
 
getSupportedSettings() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
 
getSupportedSettings() - Method in class org.apache.any23.writer.TriXWriterFactory
 
getSupportedSettings() - Method in class org.apache.any23.writer.TurtleWriterFactory
 
getSupportedSettings() - Method in class org.apache.any23.writer.URIListWriterFactory
 
getText() - Method in class org.apache.any23.extractor.html.HTMLDocument
Returns the text contained inside a node if leaf, null otherwise.
getTopLevelItemScopeNodes(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Returns only the itemScopes that are top level items.
getTotalDocuments() - Method in class org.apache.any23.writer.ReportingTripleHandler
 
getTotalTriples() - Method in class org.apache.any23.writer.ReportingTripleHandler
 
getTripleFormat() - Method in class org.apache.any23.writer.JSONLDWriterFactory
 
getTripleFormat() - Method in class org.apache.any23.writer.JSONWriterFactory
Deprecated.
 
getTripleFormat() - Method in class org.apache.any23.writer.NQuadsWriterFactory
 
getTripleFormat() - Method in class org.apache.any23.writer.NTriplesWriterFactory
 
getTripleFormat() - Method in class org.apache.any23.writer.RDFXMLWriterFactory
 
getTripleFormat() - Method in class org.apache.any23.writer.TriXWriterFactory
 
getTripleFormat() - Method in class org.apache.any23.writer.TurtleWriterFactory
 
getTripleFormat() - Method in class org.apache.any23.writer.URIListWriterFactory
 
getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.JSONLDWriterFactory
 
getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.JSONWriterFactory
Deprecated.
 
getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.NQuadsWriterFactory
 
getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.NTriplesWriterFactory
 
getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.RDFXMLWriterFactory
 
getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.TriXWriterFactory
 
getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.TurtleWriterFactory
 
getTripleWriter(OutputStream, Settings) - Method in class org.apache.any23.writer.URIListWriterFactory
 
getTriXParser(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured TriXParser.
getTurtleParserInstance(boolean, boolean, ExtractionContext, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.RDFParserFactory
Returns a new instance of a configured TurtleParser.
getType() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
getType() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
getUriRegex() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
getUserAgent() - Method in class org.apache.any23.http.DefaultHTTPClientConfiguration
 
getUserAgent() - Method in interface org.apache.any23.http.HTTPClientConfiguration
 
getValidatedDOM(boolean) - Method in class org.apache.any23.extractor.html.TagSoupParser
Returns the validated DOM and applies fixes on it if applyFix is set to true.
getValidationReport() - Method in class org.apache.any23.ExtractionReport
 
getValidationReport() - Method in class org.apache.any23.extractor.SingleDocumentExtractionReport
 
getValue() - Method in class org.apache.any23.extractor.microdata.ItemProp
 
getValue(Map<String, String>) - Method in class org.apache.any23.extractor.xpath.Term
Returns the value represented by this Term given the varMapping, the #isVar and #internalValue parameters.
getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateGraph
 
getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateObject
 
getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplatePredicate
 
getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.TemplateSubject
 
getValueInternal(String) - Method in class org.apache.any23.extractor.xpath.Term
 
getWriter(RDFFormat, OutputStream) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a new RDFWriter instance.
getWriter(RDFFormat, Writer) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a new RDFWriter instance.
getxPath() - Method in class org.apache.any23.extractor.xpath.Variable
 
getXpath() - Method in class org.apache.any23.extractor.microdata.Item
 
getXPathForNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Does a reverse walking of the DOM tree to generate a unique XPath expression leading to this node.
getXPathListForNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Returns a list of tag names representing the path from the document root to the given node n.
getXSDDate(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
This method allows to obtain an XML Schema compliant date providing a textual representation of a date and textual a pattern for parsing it.
GIVEN_NAME - Static variable in class org.apache.any23.extractor.html.HCardName
 

H

HAdrExtractor - Class in org.apache.any23.extractor.html.microformats2
Extractor for the h-adr microformat.
HAdrExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HAdrExtractor
 
HAdrExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
 
HAdrExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HAdrExtractorFactory
 
handleComment(String) - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
handleNamespace(String, String) - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
handleStatement(Statement) - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
hasAnyField() - Method in class org.apache.any23.extractor.html.HCardName
 
hasAttribute(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Checks the presence of an attribute in the given node.
hasAttribute(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Checks the presence of an attribute value in attributes that contain whitespace-separated lists of values.
hasClassName(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Tells if an element has a class name not checking the parents in the hierarchy mimicking the CSS .foo match.
hasField(String) - Method in class org.apache.any23.extractor.html.HCardName
 
hashCode() - Method in class org.apache.any23.extractor.microdata.Item
 
hashCode() - Method in class org.apache.any23.extractor.microdata.ItemProp
 
hashCode() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
hashCode() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
hasIssues() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
hasMatchingExtractors() - Method in class org.apache.any23.ExtractionReport
 
hasMatchingExtractors() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Check whether the given DocumentSource content activates of not at least an extractor.
HCalendarExtractor - Class in org.apache.any23.extractor.html
Extractor for the hCalendar microformat.
HCalendarExtractor() - Constructor for class org.apache.any23.extractor.html.HCalendarExtractor
 
HCalendarExtractorFactory - Class in org.apache.any23.extractor.html
 
HCalendarExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HCalendarExtractorFactory
 
HCardExtractor - Class in org.apache.any23.extractor.html
Extractor for the hCard microformat.
HCardExtractor - Class in org.apache.any23.extractor.html.microformats2
Extractor for the h-Card microformat.
HCardExtractor() - Constructor for class org.apache.any23.extractor.html.HCardExtractor
 
HCardExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HCardExtractor
 
HCardExtractorFactory - Class in org.apache.any23.extractor.html
 
HCardExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
 
HCardExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HCardExtractorFactory
 
HCardExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HCardExtractorFactory
 
HCardName - Class in org.apache.any23.extractor.html
An HCard name, consisting of various parts.
HCardName() - Constructor for class org.apache.any23.extractor.html.HCardName
 
HEAD_TAG - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
HeadLinkExtractor - Class in org.apache.any23.extractor.html
This Extractor.TagSoupDOMExtractor implementation retrieves the LINKs declared within the HTML/HEAD page header.
HeadLinkExtractor() - Constructor for class org.apache.any23.extractor.html.HeadLinkExtractor
 
HeadLinkExtractorFactory - Class in org.apache.any23.extractor.html
 
HeadLinkExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HeadLinkExtractorFactory
 
HEntryExtractor - Class in org.apache.any23.extractor.html.microformats2
Extractor for the h-entry microformat.
HEntryExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HEntryExtractor
 
HEntryExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
Extractor for the h-entry microformat.
HEntryExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HEntryExtractorFactory
 
HEventExtractor - Class in org.apache.any23.extractor.html.microformats2
Extractor for the h-event microformat.
HEventExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HEventExtractor
 
HEventExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
 
HEventExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HEventExtractorFactory
 
HGeoExtractor - Class in org.apache.any23.extractor.html.microformats2
Extractor for the h-geo microformat.
HGeoExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HGeoExtractor
 
HGeoExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
 
HGeoExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HGeoExtractorFactory
 
HItemExtractor - Class in org.apache.any23.extractor.html.microformats2
Extractor for the h-item microformat.
HItemExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HItemExtractor
 
HItemExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
 
HItemExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HItemExtractorFactory
 
HListingExtractor - Class in org.apache.any23.extractor.html
Extractor for the hListing microformat.
HListingExtractor() - Constructor for class org.apache.any23.extractor.html.HListingExtractor
 
HListingExtractorFactory - Class in org.apache.any23.extractor.html
 
HListingExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HListingExtractorFactory
 
HONORIFIC_PREFIX - Static variable in class org.apache.any23.extractor.html.HCardName
 
HONORIFIC_SUFFIX - Static variable in class org.apache.any23.extractor.html.HCardName
 
HProductExtractor - Class in org.apache.any23.extractor.html.microformats2
Extractor for the h-product microformat.
HProductExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HProductExtractor
 
HProductExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
 
HProductExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HProductExtractorFactory
 
HRecipeExtractor - Class in org.apache.any23.extractor.html
Extractor for the hRecipe microformat.
HRecipeExtractor - Class in org.apache.any23.extractor.html.microformats2
Extractor for the hRecipe microformat.
HRecipeExtractor() - Constructor for class org.apache.any23.extractor.html.HRecipeExtractor
 
HRecipeExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HRecipeExtractor
 
HRecipeExtractorFactory - Class in org.apache.any23.extractor.html
 
HRecipeExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
 
HRecipeExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HRecipeExtractorFactory
 
HRecipeExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HRecipeExtractorFactory
 
HREF_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
HREF_TAGS - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
List of tags providing the href property.
HResumeExtractor - Class in org.apache.any23.extractor.html
Extractor for the hResume microformat.
HResumeExtractor - Class in org.apache.any23.extractor.html.microformats2
Extractor for the hResume microformat.
HResumeExtractor() - Constructor for class org.apache.any23.extractor.html.HResumeExtractor
 
HResumeExtractor() - Constructor for class org.apache.any23.extractor.html.microformats2.HResumeExtractor
 
HResumeExtractorFactory - Class in org.apache.any23.extractor.html
 
HResumeExtractorFactory - Class in org.apache.any23.extractor.html.microformats2
 
HResumeExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HResumeExtractorFactory
 
HResumeExtractorFactory() - Constructor for class org.apache.any23.extractor.html.microformats2.HResumeExtractorFactory
 
HReviewAggregateExtractor - Class in org.apache.any23.extractor.html
Extractor for the hReview-aggregate microformat.
HReviewAggregateExtractor() - Constructor for class org.apache.any23.extractor.html.HReviewAggregateExtractor
 
HReviewAggregateExtractorFactory - Class in org.apache.any23.extractor.html
 
HReviewAggregateExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HReviewAggregateExtractorFactory
 
HReviewExtractor - Class in org.apache.any23.extractor.html
Extractor for the hReview microformat.
HReviewExtractor() - Constructor for class org.apache.any23.extractor.html.HReviewExtractor
 
HReviewExtractorFactory - Class in org.apache.any23.extractor.html
 
HReviewExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HReviewExtractorFactory
 
HTMLDocument - Class in org.apache.any23.extractor.html
A wrapper around the DOM representation of an HTML document.
HTMLDocument(Node) - Constructor for class org.apache.any23.extractor.html.HTMLDocument
Constructor accepting the root node.
HTMLDocument.TextField - Class in org.apache.any23.extractor.html
This class represents a text extracted from the HTML DOM related to the node from which such test has been retrieved.
HTMLMetaExtractor - Class in org.apache.any23.extractor.html
This extractor represents the HTML META tag values according the HTML4 specification.
HTMLMetaExtractor() - Constructor for class org.apache.any23.extractor.html.HTMLMetaExtractor
 
HTMLMetaExtractorFactory - Class in org.apache.any23.extractor.html
 
HTMLMetaExtractorFactory() - Constructor for class org.apache.any23.extractor.html.HTMLMetaExtractorFactory
 
HTTPClient - Interface in org.apache.any23.http
Abstraction for opening an InputStream on an HTTP IRI.
HTTPClientConfiguration - Interface in org.apache.any23.http
Defines the HTTPClient configuration.
HTTPDocumentSource - Class in org.apache.any23.source
Http implementation of DocumentSource.
HTTPDocumentSource(HTTPClient, String) - Constructor for class org.apache.any23.source.HTTPDocumentSource
 

I

ICalExtractor - Class in org.apache.any23.extractor.calendar
 
ICalExtractor() - Constructor for class org.apache.any23.extractor.calendar.ICalExtractor
 
ICalExtractorFactory - Class in org.apache.any23.extractor.calendar
 
ICalExtractorFactory() - Constructor for class org.apache.any23.extractor.calendar.ICalExtractorFactory
 
ICBMExtractor - Class in org.apache.any23.extractor.html
Extractor for "ICBM coordinates" provided as META headers in the head of an HTML page.
ICBMExtractor() - Constructor for class org.apache.any23.extractor.html.ICBMExtractor
 
ICBMExtractorFactory - Class in org.apache.any23.extractor.html
 
ICBMExtractorFactory() - Constructor for class org.apache.any23.extractor.html.ICBMExtractorFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.JSONLDWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.JSONWriterFactory
Deprecated.
 
IDENTIFIER - Static variable in class org.apache.any23.writer.NQuadsWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.NTriplesWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.RDFXMLWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.TriXWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.TurtleWriterFactory
 
IDENTIFIER - Static variable in class org.apache.any23.writer.URIListWriterFactory
 
IgnoreAccidentalRDFa - Class in org.apache.any23.filter
A TripleHandler that suppresses output of the RDFa parser if the document only contains "accidental" RDFa, like stylesheet links and other non-RDFa uses of HTML's
IgnoreAccidentalRDFa(TripleHandler) - Constructor for class org.apache.any23.filter.IgnoreAccidentalRDFa
 
IgnoreAccidentalRDFa(TripleHandler, boolean) - Constructor for class org.apache.any23.filter.IgnoreAccidentalRDFa
Constructor.
IgnoreTitlesOfEmptyDocuments - Class in org.apache.any23.filter
A TripleHandler that suppresses output of the TitleExtractor unless some other triples could be parsed from the document.
IgnoreTitlesOfEmptyDocuments(TripleHandler) - Constructor for class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
implementJavaNaming(String) - Static method in class org.apache.any23.util.StringUtils
Changes string with following convention: Changes '-' -> '_' remove space characters and make first letter word uppercase: 'some string' -> 'someString' If input string does not contains a whitespace than return unchanged.
includes(Class<? extends MicroformatExtractor>, Class<? extends MicroformatExtractor>) - Static method in class org.apache.any23.extractor.html.MicroformatExtractor
This method checks if there is a native nesting relationship between two MicroformatExtractor.
Includes - Annotation Type in org.apache.any23.extractor.html.annotations
This annotation could be used to decorate a MicroformatExtractor to represent which of the other Microformats could it nest.
Includes - Annotation Type in org.apache.any23.extractor.html.microformats2.annotations
This annotation could be used to decorate a MicroformatExtractor to represent which of the other Microformats could it nest.
INFO - org.apache.any23.validator.ValidationReport.IssueLevel
 
init(HTTPClientConfiguration) - Method in class org.apache.any23.http.DefaultHTTPClient
 
init(HTTPClientConfiguration) - Method in interface org.apache.any23.http.HTTPClient
Initializes the HTTP client.
inputStreamToDocument(InputStream) - Static method in class org.apache.any23.util.StreamUtils
 
iri(String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a IRI.
iri(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a IRI.
IRI_PATH_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
IRI_PREFIX_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
IRI_SCHEMA_SEPARATOR - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
isAbsoluteIRI(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
isAbsoluteIRI(String) - Static method in class org.apache.any23.rdf.RDFUtils
Checks if href is absolute or not.
isAncestorOf(Node, Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Checks whether a node is ancestor or same of another node.
isAncestorOf(Node, Node, boolean) - Static method in class org.apache.any23.extractor.html.DomUtils
Checks whether a node is ancestor or same of another node.
isAnnotated() - Method in class org.apache.any23.writer.JSONWriter
Deprecated.
 
isAnnotated() - Method in class org.apache.any23.writer.RDFWriterTripleHandler
If true then the produced RDF is annotated with the extractors used to generate the specific statements.
isAnnotated() - Method in class org.apache.any23.writer.URIListWriter
 
isCURIE(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
isCURIEBNode(String) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
isDate() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isDocBlocked() - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
isElementNode(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Verifies if the given target node is an element.
isFloat() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isInteger() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isItemProp(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Check whether a node is an itemProp.
isItemScope(Node) - Static method in class org.apache.any23.extractor.microdata.MicrodataParser
Check whether a node is an itemScope.
isLink() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isLocal() - Method in class org.apache.any23.source.ByteArrayDocumentSource
 
isLocal() - Method in class org.apache.any23.source.FileDocumentSource
 
isLocal() - Method in class org.apache.any23.source.HTTPDocumentSource
 
isLocal() - Method in class org.apache.any23.source.StringDocumentSource
 
isMultiField(String) - Method in class org.apache.any23.extractor.html.HCardName
 
isNested() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isNumber() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isOnline(String) - Static method in class org.apache.any23.util.URLUtils
Verifies if the specified URL is reachable online.
isPlain() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
isPrefix(String, String) - Static method in class org.apache.any23.util.StringUtils
Check whether string candidatePrefix is prefix of string container.
isRegisteredName(String) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
 
isRelativeNode(Node) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
isStopAtFirstError() - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
Issue(ValidationReport.IssueLevel, String, Node) - Constructor for class org.apache.any23.validator.ValidationReport.Issue
 
isSuffix(String, String) - Static method in class org.apache.any23.util.StringUtils
Check whether string candidateSuffix is suffix of string container.
isVar() - Method in class org.apache.any23.extractor.xpath.Term
 
isVerifyDataType() - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
isXMLNSDeclared(Document) - Static method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
Item - Class in org.apache.any23.extractor.microdata
Base class for Microdata items.
Item(String) - Constructor for class org.apache.any23.extractor.microdata.Item
Constructor.
ItemProp - Class in org.apache.any23.extractor.microdata
Describes a Microdata item property.
ItemProp(String, String, ItemPropValue) - Constructor for class org.apache.any23.extractor.microdata.ItemProp
Constructor.
ITEMPROP_ATTRIBUTE - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
 
ItemPropValue - Class in org.apache.any23.extractor.microdata
Describes a possible value for a Microdata item property.
ItemPropValue(Object, ItemPropValue.Type) - Constructor for class org.apache.any23.extractor.microdata.ItemPropValue
Constructor.
ItemPropValue.Type - Enum in org.apache.any23.extractor.microdata
Supported types.
ItemScope - Class in org.apache.any23.extractor.microdata
This class describes a Microdata itemscope.
ItemScope(String, ItemProp[], String, String[], String, String) - Constructor for class org.apache.any23.extractor.microdata.ItemScope
Constructor.
ITEMSCOPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
 

J

JCalExtractor - Class in org.apache.any23.extractor.calendar
 
JCalExtractor() - Constructor for class org.apache.any23.extractor.calendar.JCalExtractor
 
JCalExtractorFactory - Class in org.apache.any23.extractor.calendar
 
JCalExtractorFactory() - Constructor for class org.apache.any23.extractor.calendar.JCalExtractorFactory
 
join(String, String...) - Static method in class org.apache.any23.util.StringUtils
Joins the given input sting data list using the specified delimiter.
JSONLDExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor handling JSON-LD format.
JSONLDExtractor() - Constructor for class org.apache.any23.extractor.rdf.JSONLDExtractor
 
JSONLDExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.JSONLDExtractor
Deprecated.
since 2.4. This extractor has never supported these settings. Use JSONLDExtractor() instead.
JSONLDExtractorFactory - Class in org.apache.any23.extractor.rdf
 
JSONLDExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.JSONLDExtractorFactory
 
JSONLDWriter - Class in org.apache.any23.writer
Implementation of JSON-LD TripleWriter.
JSONLDWriter(OutputStream) - Constructor for class org.apache.any23.writer.JSONLDWriter
 
JSONLDWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.JSONLDWriter
 
JSONLDWriterFactory - Class in org.apache.any23.writer
 
JSONLDWriterFactory() - Constructor for class org.apache.any23.writer.JSONLDWriterFactory
 
JSONWriter - Class in org.apache.any23.writer
Deprecated.
since 2.3. Use JSONLDWriter instead.
JSONWriter(OutputStream) - Constructor for class org.apache.any23.writer.JSONWriter
Deprecated.
 
JSONWriterFactory - Class in org.apache.any23.writer
Deprecated.
since 2.3. Use JSONLDWriterFactory instead.
JSONWriterFactory() - Constructor for class org.apache.any23.writer.JSONWriterFactory
Deprecated.
 
JsoupUtils - Class in org.apache.any23.extractor.html
 
JsoupUtils() - Constructor for class org.apache.any23.extractor.html.JsoupUtils
 

L

LicenseExtractor - Class in org.apache.any23.extractor.html
Extractor for the rel-license microformat.
LicenseExtractor() - Constructor for class org.apache.any23.extractor.html.LicenseExtractor
 
LicenseExtractorFactory - Class in org.apache.any23.extractor.html
 
LicenseExtractorFactory() - Constructor for class org.apache.any23.extractor.html.LicenseExtractorFactory
 
Link - org.apache.any23.extractor.microdata.ItemPropValue.Type
 
listFilesRecursively(File, FilenameFilter) - Static method in class org.apache.any23.util.FileUtils
Lists the content of a dir applying the specified filter.
literal(boolean) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(byte) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(double) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(float) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(int) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(long) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(short) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(String, String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
literal(String, IRI) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Literal.
LITERAL - org.apache.any23.extractor.xpath.TemplateObject.Type
 
LocalCopyFactory - Interface in org.apache.any23.source
A factory that creates local copies of DocumentSources.
logger - Static variable in class org.apache.any23.Any23
 
LoggingTripleHandler - Class in org.apache.any23.writer
Triple handler decorator useful for logging purposes.
LoggingTripleHandler(TripleHandler, PrintWriter) - Constructor for class org.apache.any23.writer.LoggingTripleHandler
 
LogUtils - Class in org.apache.any23.util
Deprecated.
since 2.3. JUL is no longer used in this project.
LogUtils() - Constructor for class org.apache.any23.util.LogUtils
Deprecated.
 

M

makeIRI() - Static method in class org.apache.any23.rdf.RDFUtils
 
makeIRI(String) - Static method in class org.apache.any23.rdf.RDFUtils
Convert string to either IRI or Literal.
makeIRI(String, IRI) - Static method in class org.apache.any23.rdf.RDFUtils
makeIRI(String, IRI, boolean) - Static method in class org.apache.any23.rdf.RDFUtils
Creates implementation of Resource from given arguments: type and docIRI.
makeIRI(IRI) - Static method in class org.apache.any23.rdf.RDFUtils
ManchesterSyntaxExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor handling Manchester Syntax format.
ManchesterSyntaxExtractor() - Constructor for class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractor
 
ManchesterSyntaxExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractor
 
ManchesterSyntaxExtractorFactory - Class in org.apache.any23.extractor.rdf
 
ManchesterSyntaxExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractorFactory
 
mark(int) - Method in class org.apache.any23.util.ReaderInputStream
Marks the read limit of the StringReader.
markSupported() - Method in class org.apache.any23.util.ReaderInputStream
 
MathUtils - Class in org.apache.any23.util
Mathematical utility functions.
md5(String) - Static method in class org.apache.any23.util.MathUtils
Create a MD5 weak hash for a given string.
MemCopyFactory - Class in org.apache.any23.source
Creates local copies of DocumentSource by reading them into an in-memory buffer.
MemCopyFactory() - Constructor for class org.apache.any23.source.MemCopyFactory
 
MetaNameMisuseFix - Class in org.apache.any23.validator.rule
Fixes the misuse of the meta name attribute.
MetaNameMisuseFix() - Constructor for class org.apache.any23.validator.rule.MetaNameMisuseFix
 
MetaNameMisuseRule - Class in org.apache.any23.validator.rule
Checks whether the meta attribute name is used to contain a property.
MetaNameMisuseRule() - Constructor for class org.apache.any23.validator.rule.MetaNameMisuseRule
 
MicrodataExtractor - Class in org.apache.any23.extractor.microdata
Default implementation of Microdata extractor, based on Extractor.TagSoupDOMExtractor.
MicrodataExtractor() - Constructor for class org.apache.any23.extractor.microdata.MicrodataExtractor
 
MicrodataExtractorFactory - Class in org.apache.any23.extractor.microdata
 
MicrodataExtractorFactory() - Constructor for class org.apache.any23.extractor.microdata.MicrodataExtractorFactory
 
MicrodataParser - Class in org.apache.any23.extractor.microdata
This class provides utility methods for handling Microdata nodes contained within a DOM document.
MicrodataParser(Document) - Constructor for class org.apache.any23.extractor.microdata.MicrodataParser
 
MicrodataParserException - Exception in org.apache.any23.extractor.microdata
Defines an exception occurring while parsing Microdata.
MicrodataParserException(String, Throwable, Node) - Constructor for exception org.apache.any23.extractor.microdata.MicrodataParserException
 
MicrodataParserException(String, Node) - Constructor for exception org.apache.any23.extractor.microdata.MicrodataParserException
 
MicrodataParserReport - Class in org.apache.any23.extractor.microdata
This class describes the report of the MicrodataParser.
MicrodataParserReport(ItemScope[]) - Constructor for class org.apache.any23.extractor.microdata.MicrodataParserReport
 
MicrodataParserReport(ItemScope[], MicrodataParserException[]) - Constructor for class org.apache.any23.extractor.microdata.MicrodataParserReport
 
MicroformatExtractor - Class in org.apache.any23.extractor.html
The abstract base class for any Microformat specification extractor.
MicroformatExtractor() - Constructor for class org.apache.any23.extractor.html.MicroformatExtractor
 
Microformats2Prefixes - Class in org.apache.any23.extractor.html.microformats2
 
Microformats2Prefixes() - Constructor for class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
 
MIME_TYPE - Static variable in class org.apache.any23.writer.JSONLDWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.JSONWriterFactory
Deprecated.
 
MIME_TYPE - Static variable in class org.apache.any23.writer.NQuadsWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.NTriplesWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.RDFXMLWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.TriXWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.TurtleWriterFactory
 
MIME_TYPE - Static variable in class org.apache.any23.writer.URIListWriterFactory
 
MissingItemscopeAttributeValueFix - Class in org.apache.any23.validator.rule
Fix for the issue described within MissingItemscopeAttributeValueRule
MissingItemscopeAttributeValueFix() - Constructor for class org.apache.any23.validator.rule.MissingItemscopeAttributeValueFix
Default constructor
MissingItemscopeAttributeValueRule - Class in org.apache.any23.validator.rule
This fixes missing attribute values for the 'itemscope' attribute Typically when such a snippet of XHTML is fed through the RDFa11Extractor, and subsequently to Sesame's SesameRDFaParser, it will result in the following behavior.
MissingItemscopeAttributeValueRule() - Constructor for class org.apache.any23.validator.rule.MissingItemscopeAttributeValueRule
Default constructor
MissingOpenGraphNamespaceRule - Class in org.apache.any23.validator.rule
This rule detects the issue of missing Open Graph namespace.
MissingOpenGraphNamespaceRule() - Constructor for class org.apache.any23.validator.rule.MissingOpenGraphNamespaceRule
 
ModelHolder(Value, Model) - Constructor for class org.apache.any23.extractor.yaml.ElementsProcessor.ModelHolder
 
multiply(char, int) - Static method in class org.apache.any23.util.StringUtils
Builds a string composed of the given char c n times.
mv(File, File) - Static method in class org.apache.any23.util.FileUtils
Moves a target file to a new dest location.

N

NAME - Static variable in class org.apache.any23.extractor.csv.CSVExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.AdrExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.GeoExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.HCalendarExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.HCardExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.HeadLinkExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.HListingExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.HRecipeExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.HResumeExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.HReviewAggregateExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.HReviewExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.HTMLMetaExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.ICBMExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.LicenseExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HAdrExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HCardExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HEntryExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HEventExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HGeoExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HItemExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HProductExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HRecipeExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.microformats2.HResumeExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.SpeciesExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.TitleExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.TurtleHTMLExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.html.XFNExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.microdata.MicrodataExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.rdf.JSONLDExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.rdf.NQuadsExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.rdf.NTriplesExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.rdf.RDFXMLExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.rdf.TriXExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.rdf.TurtleExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.rdfa.RDFa11ExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.rdfa.RDFaExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.xpath.XPathExtractorFactory
 
NAME - Static variable in class org.apache.any23.extractor.yaml.YAMLExtractorFactory
 
Nested - org.apache.any23.extractor.microdata.ItemPropValue.Type
 
NICKNAME - Static variable in class org.apache.any23.extractor.html.HCardName
 
NODES_WITH_INVALID_ABOUT - Static variable in class org.apache.any23.validator.rule.AboutNotURIRule
 
nodeToInputStream(Node) - Static method in class org.apache.any23.extractor.html.DomUtils
Convert a w3c dom node to a InputStream
notifyIssue(IssueReport.IssueLevel, String, long, long) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
NQuadsExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor handling N-Quads format.
NQuadsExtractor() - Constructor for class org.apache.any23.extractor.rdf.NQuadsExtractor
 
NQuadsExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.NQuadsExtractor
 
NQuadsExtractorFactory - Class in org.apache.any23.extractor.rdf
 
NQuadsExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.NQuadsExtractorFactory
 
NQuadsWriter - Class in org.apache.any23.writer
Implementation of an N-Quads TripleWriter.
NQuadsWriter(OutputStream) - Constructor for class org.apache.any23.writer.NQuadsWriter
 
NQuadsWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.NQuadsWriter
 
NQuadsWriterFactory - Class in org.apache.any23.writer
 
NQuadsWriterFactory() - Constructor for class org.apache.any23.writer.NQuadsWriterFactory
 
NTriplesExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor handling NTriples NTriples format.
NTriplesExtractor() - Constructor for class org.apache.any23.extractor.rdf.NTriplesExtractor
Default constructor, with no verification of data types and no stop at first error.
NTriplesExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.NTriplesExtractor
 
NTriplesExtractorFactory - Class in org.apache.any23.extractor.rdf
 
NTriplesExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.NTriplesExtractorFactory
 
NTriplesWriter - Class in org.apache.any23.writer
Implementation of an N-Triples TripleWriter.
NTriplesWriter(OutputStream) - Constructor for class org.apache.any23.writer.NTriplesWriter
 
NTriplesWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.NTriplesWriter
 
NTriplesWriterFactory - Class in org.apache.any23.writer
 
NTriplesWriterFactory() - Constructor for class org.apache.any23.writer.NTriplesWriterFactory
 

O

openContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
openContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
openContext(ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
 
openContext(ExtractionContext) - Method in class org.apache.any23.writer.TripleWriterHandler
Writers may override this method to handle an "openContext" extraction event.
OPENGRAPH_PROTOCOL_NS - Static variable in class org.apache.any23.validator.rule.OpenGraphNamespaceFix
 
OpenGraphNamespaceFix - Class in org.apache.any23.validator.rule
This fixes the missing Open Graph protocol.
OpenGraphNamespaceFix() - Constructor for class org.apache.any23.validator.rule.OpenGraphNamespaceFix
 
openInputStream() - Method in class org.apache.any23.source.ByteArrayDocumentSource
 
openInputStream() - Method in class org.apache.any23.source.FileDocumentSource
 
openInputStream() - Method in class org.apache.any23.source.HTTPDocumentSource
 
openInputStream() - Method in class org.apache.any23.source.StringDocumentSource
 
openInputStream(String) - Method in class org.apache.any23.http.DefaultHTTPClient
Opens an InputStream from a given IRI.
openInputStream(String) - Method in interface org.apache.any23.http.HTTPClient
Opens the input stream for the given target IRI.
openSubResult(ExtractionContext) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
openSubResult(ExtractionContext) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
org.apache.any23 - package org.apache.any23
The Any23 root package defines the Any23 library facade and auxiliary classes.
org.apache.any23.extractor - package org.apache.any23.extractor
This package contains classes and interfaces modeling the Extractor API.
org.apache.any23.extractor.calendar - package org.apache.any23.extractor.calendar
This package contains extractors for various calendar serialization formats.
org.apache.any23.extractor.csv - package org.apache.any23.extractor.csv
This package contains the definition of a specific Extractor for CSV files as specified on RFC 4180.
org.apache.any23.extractor.html - package org.apache.any23.extractor.html
All the various Extractor needed to distill RDF from Microformats in HTML pages are contained in this package.
org.apache.any23.extractor.html.annotations - package org.apache.any23.extractor.html.annotations
This package contains the annotations needed to describe the single nesting relations among different Microformats.
org.apache.any23.extractor.html.microformats2 - package org.apache.any23.extractor.html.microformats2
All the various Extractor needed to distill RDF from Microformats in HTML pages are contained in this package.
org.apache.any23.extractor.html.microformats2.annotations - package org.apache.any23.extractor.html.microformats2.annotations
This package contains the annotations needed to describe the single nesting relations among different Microformats.
org.apache.any23.extractor.microdata - package org.apache.any23.extractor.microdata
This package provides classes for dealing with Microdata extraction.
org.apache.any23.extractor.rdf - package org.apache.any23.extractor.rdf
This package contains the concrete implementations (and utilities classes) to perform the extraction from various RDF serialization formats.
org.apache.any23.extractor.rdfa - package org.apache.any23.extractor.rdfa
This package contains the definition of a specific Extractor for RDFa.
org.apache.any23.extractor.xpath - package org.apache.any23.extractor.xpath
This package provides an XPath extractor able to generate RDF statements from any HTML page on the basis of a set of specified rules and a list of statement templates.
org.apache.any23.extractor.yaml - package org.apache.any23.extractor.yaml
This package provides an YAML extractor able to generate RDF statements from any YAML files.
org.apache.any23.filter - package org.apache.any23.filter
This package defines specific TripleHandlers able to filter document and triples that don't respect particular conditions.
org.apache.any23.http - package org.apache.any23.http
This package contains classes and interfaces to implement a small multi-threaded HTTP Client used to fetch data sources.
org.apache.any23.rdf - package org.apache.any23.rdf
This package provides some utility classes to manage RDF models and prefixes.
org.apache.any23.source - package org.apache.any23.source
Package containing some auxiliary classes to handle different types of InputStream sources.
org.apache.any23.util - package org.apache.any23.util
This package collects a set of internal library utility classes.
org.apache.any23.validator - package org.apache.any23.validator
This package defines a simple rule based validation framework able to detect (Rule) and fix (Fix) common mistakes that can be present within an HTML DOM model and that can prevent the extraction of metadata.
org.apache.any23.validator.rule - package org.apache.any23.validator.rule
A set of predefined validation Rules and associated Fixes.
org.apache.any23.vocab - package org.apache.any23.vocab
 
org.apache.any23.writer - package org.apache.any23.writer
This package collects a set of TripleHandler decorators and specific RDF format TripleWriter implementations.

P

parse(InputStream, String, String) - Static method in class org.apache.any23.extractor.html.JsoupUtils
 
parseDateTime(String) - Static method in class org.apache.any23.extractor.microdata.ItemPropValue
 
parseRDF(String) - Static method in class org.apache.any23.rdf.RDFUtils
Parses the content of the resource file guessing the content format from the extension.
parseRDF(RDFFormat, InputStream) - Static method in class org.apache.any23.rdf.RDFUtils
Parses the content of is input stream with the specified parser p using '' as base org.eclipse.rdf4j.model.IRI.
parseRDF(RDFFormat, InputStream, String) - Static method in class org.apache.any23.rdf.RDFUtils
Parses the content of is input stream with the specified parser p using baseIRI.
parseRDF(RDFFormat, String) - Static method in class org.apache.any23.rdf.RDFUtils
Parses the content of in string with the specified parser p using '' as base org.eclipse.rdf4j.model.IRI.
Plain - org.apache.any23.extractor.microdata.ItemPropValue.Type
 
PopularPrefixes - Class in org.apache.any23.rdf
This class act as a container for various well-known and adopted RDF Vocabulary prefixes.
PopularPrefixes() - Constructor for class org.apache.any23.rdf.PopularPrefixes
 
PREFIX_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
prefixes - Static variable in class org.apache.any23.extractor.yaml.YAMLExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.csv.CSVExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.AdrExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.GeoExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.HCalendarExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.HCardExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.HeadLinkExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.HListingExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.HRecipeExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.HResumeExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.HReviewAggregateExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.HReviewExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.HTMLMetaExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.ICBMExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.LicenseExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HAdrExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HCardExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HEntryExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HEventExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HGeoExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HItemExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HProductExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HRecipeExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.microformats2.HResumeExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.SpeciesExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.TitleExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.TurtleHTMLExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.html.XFNExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.microdata.MicrodataExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.rdf.FunctionalSyntaxExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.rdf.JSONLDExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.rdf.ManchesterSyntaxExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.rdf.NQuadsExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.rdf.NTriplesExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.rdf.RDFXMLExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.rdf.TriXExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.rdf.TurtleExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.rdfa.RDFa11ExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.rdfa.RDFaExtractorFactory
 
PREFIXES - Static variable in class org.apache.any23.extractor.xpath.XPathExtractorFactory
 
PRETTY_PRINT - Static variable in class org.apache.any23.writer.WriterSettings
Directive to writer that output should be printed in a way to maximize human readability.
PRINT_ASCII - Static variable in class org.apache.any23.writer.WriterSettings
Directive to writer that at least the non-ASCII characters should be escaped.
printOut(ExtractionResult, Map<String, String>) - Method in class org.apache.any23.extractor.xpath.QuadTemplate
Prints out this quad template in the given ExtractionResult, using the passed variableAssignment to expand variables.
printReport() - Method in class org.apache.any23.writer.ReportingTripleHandler
 
printReport(PrintStream) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
process(Document, ExtractionResult) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
process(Document, ExtractionResult) - Method in interface org.apache.any23.extractor.xpath.XPathExtractionRule
Processes this extraction rule on the given document.
processDocument(URL, Document, ExtractionResult) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
processList(IRI, List<Object>) - Method in class org.apache.any23.extractor.yaml.ElementsProcessor
 
processMap(IRI, Map<String, Object>, Value) - Method in class org.apache.any23.extractor.yaml.ElementsProcessor
This method processes a map with non bnode root.
PROFILE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
PROPERTY_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
PROPERTY_PREFIX - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
 
PropertyPath(String[], Resource, Resource, BNode, Class<? extends MicroformatExtractor>) - Constructor for class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
putData(String, Object) - Method in class org.apache.any23.validator.DefaultRuleContext
 
putData(String, T) - Method in interface org.apache.any23.validator.RuleContext
Puts a data within the context.

Q

quad(String, String, String, String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a statement of type: toValue(s), toValue(p), toValue(o), toValue(g)
quad(Resource, IRI, Value, Resource) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Statement.
QuadTemplate - Class in org.apache.any23.extractor.xpath
This class models a NQuads template, that is a quadruple in which any component can be a variable.
QuadTemplate(TemplateSubject, TemplatePredicate, TemplateObject) - Constructor for class org.apache.any23.extractor.xpath.QuadTemplate
Constructor for template with no graph.
QuadTemplate(TemplateSubject, TemplatePredicate, TemplateObject, TemplateGraph) - Constructor for class org.apache.any23.extractor.xpath.QuadTemplate
Constructor.

R

RDFa11Extractor - Class in org.apache.any23.extractor.rdfa
Extractor implementation for RDFa 1.1 specification.
RDFa11Extractor() - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Extractor
 
RDFa11Extractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Extractor
Deprecated.
since 2.4. This extractor has never supported these settings. Use RDFa11Extractor() instead.
RDFa11ExtractorFactory - Class in org.apache.any23.extractor.rdfa
 
RDFa11ExtractorFactory() - Constructor for class org.apache.any23.extractor.rdfa.RDFa11ExtractorFactory
 
RDFa11Parser - Class in org.apache.any23.extractor.rdfa
RDFa11Parser() - Constructor for class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
RDFa11ParserException - Exception in org.apache.any23.extractor.rdfa
Deprecated.
since 2.3 the Rio implementations are used to parse RDFa.
RDFa11ParserException(String) - Constructor for exception org.apache.any23.extractor.rdfa.RDFa11ParserException
Deprecated.
 
RDFa11ParserException(String, Throwable) - Constructor for exception org.apache.any23.extractor.rdfa.RDFa11ParserException
Deprecated.
 
RDFaExtractor - Class in org.apache.any23.extractor.rdfa
Extractor implementation for RDFa 1.0 specification.
RDFaExtractor() - Constructor for class org.apache.any23.extractor.rdfa.RDFaExtractor
 
RDFaExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdfa.RDFaExtractor
Deprecated.
since 2.4. This extractor has never supported these settings. Use RDFaExtractor() instead.
RDFaExtractorFactory - Class in org.apache.any23.extractor.rdfa
 
RDFaExtractorFactory() - Constructor for class org.apache.any23.extractor.rdfa.RDFaExtractorFactory
 
RDFHandlerAdapter - Class in org.apache.any23.extractor.rdf
An RDFHandler that relays statements and prefix definitions to an ExtractionResult.
RDFHandlerAdapter(ExtractionResult) - Constructor for class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
RDFParserFactory - Class in org.apache.any23.extractor.rdf
This factory provides a common logic for creating and configuring correctly any RDF parser used within the library.
RDFParserFactory() - Constructor for class org.apache.any23.extractor.rdf.RDFParserFactory
 
RDFSchemaUtils - Class in org.apache.any23.vocab
This class provides a set of methods for generating RDF Schema.
RDFUtils - Class in org.apache.any23.rdf
Basic class providing a set of utility methods when dealing with RDF.
RDFWriterTripleHandler - Class in org.apache.any23.writer
A TripleHandler that writes triples to a Sesame RDFWriter, eg for serialization using one of Sesame's writers.
RDFXMLExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor able to perform the extraction on RDF/XML documents.
RDFXMLExtractor() - Constructor for class org.apache.any23.extractor.rdf.RDFXMLExtractor
Default constructor, with no verification of data types and not stop at first error.
RDFXMLExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.RDFXMLExtractor
Constructor, allows to specify the validation and error handling policies.
RDFXMLExtractorFactory - Class in org.apache.any23.extractor.rdf
 
RDFXMLExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.RDFXMLExtractorFactory
 
RDFXMLWriter - Class in org.apache.any23.writer
RDF/XML TripleWriter implementation.
RDFXMLWriter(OutputStream) - Constructor for class org.apache.any23.writer.RDFXMLWriter
 
RDFXMLWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.RDFXMLWriter
 
RDFXMLWriterFactory - Class in org.apache.any23.writer
 
RDFXMLWriterFactory() - Constructor for class org.apache.any23.writer.RDFXMLWriterFactory
 
read() - Method in class org.apache.any23.extractor.html.SpanCloserInputStream
 
read() - Method in class org.apache.any23.util.ReaderInputStream
Reads from the Reader, returning the same value.
read(byte[], int, int) - Method in class org.apache.any23.util.ReaderInputStream
Reads from the Reader into a byte array
readAttribute(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
Read an attribute avoiding NullPointerExceptions, if the attr is missing it just returns an empty string.
readAttribute(Node, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Reads the value of an attribute, returning the empty string if not present.
readAttribute(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Reads the value of the specified attribute, returning the defaultValue string if not present.
readAttributeWithPrefix(Node, String, String) - Static method in class org.apache.any23.extractor.html.DomUtils
Reads the value of the first attribute which name matches with the specified attributePrefix.
ReaderInputStream - Class in org.apache.any23.util
Adapts a Reader as an InputStream.
ReaderInputStream(Reader) - Constructor for class org.apache.any23.util.ReaderInputStream
Construct a ReaderInputStream for the specified Reader.
ReaderInputStream(Reader, String) - Constructor for class org.apache.any23.util.ReaderInputStream
Construct a ReaderInputStream for the specified Reader, with the specified encoding.
readFileContent(File) - Static method in class org.apache.any23.util.FileUtils
Returns the content of a file a single string.
readFileLines(File) - Static method in class org.apache.any23.util.FileUtils
Returns all the lines of a file.
readNodeContent(Node, boolean) - Static method in class org.apache.any23.extractor.html.HTMLDocument
Reads the text content of the given node and returns it.
readResourceContent(Class, String) - Static method in class org.apache.any23.util.FileUtils
Reads a resource file and returns the content as a string.
readResourceContent(String) - Static method in class org.apache.any23.util.FileUtils
Reads a resource file and returns the content as a string.
readStream() - Method in class org.apache.any23.source.FileDocumentSource
 
readTextField(Node) - Static method in class org.apache.any23.extractor.html.HTMLDocument
Reads a text field from the given node adding the content to the given res list.
readUrlField(List<HTMLDocument.TextField>, Node) - Static method in class org.apache.any23.extractor.html.HTMLDocument
Reads an URL field from the given node adding the content to the given res list.
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
 
receiveNamespace(String, String, ExtractionContext) - Method in class org.apache.any23.writer.TripleWriterHandler
Writers may override this method to handle a "receiveNamespace" extraction event.
receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.CountingTripleHandler
 
receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.RepositoryWriter
 
receiveTriple(Resource, IRI, Value, IRI, ExtractionContext) - Method in class org.apache.any23.writer.TripleWriterHandler
Writers may override this method to handle a "receiveTriple" extraction event.
register(ExtractorFactory<?>) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
Registers an ExtractorFactory.
REL_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
remove(QuadTemplate) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
Removes a quad template from the rule.
remove(QuadTemplate) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
remove(Variable) - Method in interface org.apache.any23.extractor.xpath.TemplateXPathExtractionRule
Removes a variable from the template.
remove(Variable) - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
remove(XPathExtractionRule) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
 
removeData(String) - Method in class org.apache.any23.validator.DefaultRuleContext
 
removeData(String) - Method in interface org.apache.any23.validator.RuleContext
Removes a data from the context.
removeRule(Class<? extends Rule>) - Method in class org.apache.any23.validator.DefaultValidator
 
removeRule(Class<? extends Rule>) - Method in interface org.apache.any23.validator.Validator
Allows to remove a rule from the validator and all the related Fixes.
report() - Method in class org.apache.any23.writer.BenchmarkTripleHandler
Returns the report as a human readable string.
reportFixError(Fix, Exception, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
reportFixError(Fix, Exception, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
Reports an error occurred while executing a Fix.
ReportingTripleHandler - Class in org.apache.any23.writer
A TripleHandler that collects various information about the extraction process, such as the extractors used and the total number of triples.
ReportingTripleHandler(TripleHandler) - Constructor for class org.apache.any23.writer.ReportingTripleHandler
 
reportIssue(ValidationReport.IssueLevel, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
reportIssue(ValidationReport.IssueLevel, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
Reports a detected issue.
reportIssue(ValidationReport.IssueLevel, String, Node) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
reportIssue(ValidationReport.IssueLevel, String, Node) - Method in interface org.apache.any23.validator.ValidationReportBuilder
Reports an issue detected on a specified node.
reportRuleError(Rule, Exception, String) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
reportRuleError(Rule, Exception, String) - Method in interface org.apache.any23.validator.ValidationReportBuilder
Reports an error occurred while executing a Rule.
RepositoryWriter - Class in org.apache.any23.writer
An RDF4J repository triple writer.
RepositoryWriter(RepositoryConnection) - Constructor for class org.apache.any23.writer.RepositoryWriter
 
RepositoryWriter(RepositoryConnection, Resource) - Constructor for class org.apache.any23.writer.RepositoryWriter
 
reset() - Method in class org.apache.any23.extractor.html.HCardName
Resets the content of the HName fields.
reset() - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
Resets the parser to the original state.
reset() - Method in class org.apache.any23.util.ReaderInputStream
Resets the StringReader.
reset() - Method in class org.apache.any23.writer.CountingTripleHandler
 
resetExtractor() - Method in class org.apache.any23.extractor.html.AdrExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.EntityBasedMicroformatExtractor
Resets the internal status of the extractor to prepare it to a new extraction section.
resetExtractor() - Method in class org.apache.any23.extractor.html.GeoExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HCardExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HListingExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HRecipeExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HResumeExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HReviewAggregateExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.HReviewExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HAdrExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HCardExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HEntryExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HEventExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HGeoExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HItemExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HProductExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HRecipeExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.microformats2.HResumeExtractor
 
resetExtractor() - Method in class org.apache.any23.extractor.html.SpeciesExtractor
Resets the internal status of the extractor to prepare it to a new extraction section.
resolveCIRIeOrIRIList(Node, String, boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
Resolves a whitelist separated list of CURIE or URI.
resolveCURIEOrIRI(String, boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
Resolves a CURIE or IRI string.
resolveIRI(String) - Method in class org.apache.any23.extractor.html.HTMLDocument
 
resolveIRI(String) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
Resolves a IRI string as IRI.
resolveIRI(String, URI) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
Fixes typical errors in IRIs, and resolves relative IRIs against a base IRI.
RESOURCE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
ResourceRoot(String[], Resource, Class<? extends MicroformatExtractor>) - Constructor for class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
 
REV_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
Rule - Interface in org.apache.any23.validator
Models a rule for an issue detection.
RuleActivation(Rule) - Constructor for class org.apache.any23.validator.ValidationReport.RuleActivation
 
RuleContext<T> - Interface in org.apache.any23.validator
This context is used to pass data from a Rule to a Fix.
RuleError(Rule, Exception, String) - Constructor for class org.apache.any23.validator.ValidationReport.RuleError
 
run() - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Triggers the execution of all the Extractor registered to this class using the default extraction parameters.
run(ExtractionParameters) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Triggers the execution of all the Extractor registered to this class using the specified extraction parameters.
run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.csv.CSVExtractor
run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.rdf.JSONLDExtractor
 
run(ExtractionParameters, ExtractionContext, InputStream, ExtractionResult) - Method in class org.apache.any23.extractor.yaml.YAMLExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.EmbeddedJSONLDExtractor
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.HeadLinkExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.HTMLMetaExtractor
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.ICBMExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.LicenseExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.TitleExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.TurtleHTMLExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.html.XFNExtractor
 
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.microdata.MicrodataExtractor
This extraction performs the Microdata to RDF conversion algorithm.
run(ExtractionParameters, ExtractionContext, Document, ExtractionResult) - Method in class org.apache.any23.extractor.xpath.XPathExtractor
 

S

SerializationException - Exception in org.apache.any23.validator
Defines an error raised from the ValidationReportSerializer.
SerializationException(String) - Constructor for exception org.apache.any23.validator.SerializationException
 
SerializationException(String, Throwable) - Constructor for exception org.apache.any23.validator.SerializationException
 
serialize(ValidationReport, OutputStream) - Method in interface org.apache.any23.validator.ValidationReportSerializer
Serializes the validation report on the given output stream.
serialize(ValidationReport, OutputStream) - Method in class org.apache.any23.validator.XMLValidationReportSerializer
 
serializeGetterValue(Object, Method, PrintStream) - Method in class org.apache.any23.validator.XMLValidationReportSerializer
 
serializeToXML(Node, boolean) - Static method in class org.apache.any23.extractor.html.DomUtils
Given a DOM Node produces the XML serialization omitting the XML declaration.
serializeVocabularies(RDFFormat, PrintStream) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
Serializes all the vocabularies to NQuads over the given output stream.
serializeVocabulary(Vocabulary, RDFFormat) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
Serialized the given vocabulary to NQuads and return them as string.
serializeVocabulary(Vocabulary, RDFFormat, boolean, PrintStream) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
Serializes the given vocabulary to NQuads over the given output stream.
serializeVocabulary(Vocabulary, RDFWriter) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
Serializes the given vocabulary to triples over the given writer.
serializeVocabulary(IRI, IRI[], IRI[], Map<IRI, String>, RDFWriter) - Static method in class org.apache.any23.vocab.RDFSchemaUtils
Serializes a vocabulary composed of the given namespace, resources and properties.
setAnnotated(boolean) - Method in class org.apache.any23.writer.JSONWriter
Deprecated.
 
setAnnotated(boolean) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
Sets the annotation flag.
setAnnotated(boolean) - Method in class org.apache.any23.writer.URIListWriter
 
setCacheFactory(LocalCopyFactory) - Method in class org.apache.any23.Any23
Allows to set a LocalCopyFactory instance.
setContentLength(long) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
setContentLength(long) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
setContentLength(long) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
setContentLength(long) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.CountingTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
setContentLength(long) - Method in class org.apache.any23.writer.RepositoryWriter
 
setContentLength(long) - Method in class org.apache.any23.writer.TripleWriterHandler
Writers may override this method to handle a "setContentLength" extraction event.
setCurrentExtractionResult(ExtractionResult) - Method in class org.apache.any23.extractor.html.MicroformatExtractor
 
setDefaultLogging() - Static method in class org.apache.any23.util.LogUtils
Deprecated.
 
setErrorMode(MicrodataParser.ErrorMode) - Method in class org.apache.any23.extractor.microdata.MicrodataParser
 
setErrorNode(Node) - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
setField(String, HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
 
setFullName(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
 
setHTTPClient(HTTPClient) - Method in class org.apache.any23.Any23
Allows to set the HTTPClient implementation used to retrieve contents.
setHTTPUserAgent(String) - Method in class org.apache.any23.Any23
Sets the HTTP Header User Agent, see RFC 2616-14.43.
setIssueReport(IssueReport) - Method in class org.apache.any23.rdf.Any23ValueFactoryWrapper
 
setLocalCopyFactory(LocalCopyFactory) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Sets the internal factory for generating the document local copy, if null the MemCopyFactory will be used.
setMIMETypeDetector(MIMETypeDetector) - Method in class org.apache.any23.Any23
Allows to set an instance of MIMETypeDetector.
setMIMETypeDetector(MIMETypeDetector) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Sets the internal mime type detector, if null mimetype detection will be skipped and all extractors will be activated.
setOrganization(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
 
setOrganizationUnit(HTMLDocument.TextField) - Method in class org.apache.any23.extractor.html.HCardName
 
setParserEncoding(String) - Method in class org.apache.any23.extractor.SingleDocumentExtraction
Sets the document parser encoding.
setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.csv.CSVExtractor
setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdf.JSONLDExtractor
Deprecated.
since 2.4. This extractor has never supported this setting. Do not use.
setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
Deprecated.
since 2.4. This extractor has never supported this setting. Do not use.
setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
Deprecated.
since 2.4. This extractor has never supported this setting. Do not use.
setStopAtFirstError(boolean) - Method in class org.apache.any23.extractor.yaml.YAMLExtractor
 
setVerboseLogging() - Static method in class org.apache.any23.util.LogUtils
Deprecated.
 
setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdf.BaseRDFExtractor
 
setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdf.JSONLDExtractor
Deprecated.
since 2.4. This extractor has never supported this setting. Do not use.
setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFa11Extractor
Deprecated.
since 2.4. This extractor has never supported this setting. Do not use.
setVerifyDataType(boolean) - Method in class org.apache.any23.extractor.rdfa.RDFaExtractor
Deprecated.
since 2.4. This extractor has never supported this setting. Do not use.
SimpleExtractorFactory<T extends org.apache.any23.extractor.Extractor<?>> - Class in org.apache.any23.extractor
This class is a simple and default-like implementation of ExtractorFactory.
SimpleExtractorFactory(String, Prefixes) - Constructor for class org.apache.any23.extractor.SimpleExtractorFactory
 
SimpleExtractorFactory(String, Prefixes, Collection<String>, String) - Constructor for class org.apache.any23.extractor.SimpleExtractorFactory
 
SingleDocumentExtraction - Class in org.apache.any23.extractor
This class acts as a facade where all extractors (for a given MIMEType) can be called on a single document.
SingleDocumentExtraction(Configuration, DocumentSource, ExtractorFactory<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
Builds an extractor by the specification of document source, extractors factory and output triple handler.
SingleDocumentExtraction(Configuration, DocumentSource, ExtractorGroup, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
Builds an extractor by the specification of document source, list of extractors and output triple handler.
SingleDocumentExtraction(DocumentSource, ExtractorFactory<?>, TripleHandler) - Constructor for class org.apache.any23.extractor.SingleDocumentExtraction
Builds an extractor by the specification of document source, extractors factory and output triple handler, using the DefaultConfiguration.
SingleDocumentExtractionReport - Class in org.apache.any23.extractor
This class provides the report for a SingleDocumentExtraction run.
SingleDocumentExtractionReport(ValidationReport, Map<String, Collection<IssueReport.Issue>>) - Constructor for class org.apache.any23.extractor.SingleDocumentExtractionReport
 
singleton() - Static method in class org.apache.any23.http.DefaultHTTPClientConfiguration
 
source() - Method in class org.apache.any23.extractor.html.HTMLDocument.TextField
 
SPACE_SEPARATOR - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
 
SpanCloserInputStream - Class in org.apache.any23.extractor.html
Extension of InputStream meant to detect and replace any occurrence of inline span:
SpanCloserInputStream(InputStream) - Constructor for class org.apache.any23.extractor.html.SpanCloserInputStream
 
SpeciesExtractor - Class in org.apache.any23.extractor.html
Extractor able to extract the Species Microformat.
SpeciesExtractor() - Constructor for class org.apache.any23.extractor.html.SpeciesExtractor
 
SpeciesExtractorFactory - Class in org.apache.any23.extractor.html
 
SpeciesExtractorFactory() - Constructor for class org.apache.any23.extractor.html.SpeciesExtractorFactory
 
SRC_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
SRC_TAGS - Static variable in class org.apache.any23.extractor.microdata.MicrodataParser
List of tags providing the src property.
startDocument(IRI) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
startDocument(IRI) - Method in class org.apache.any23.filter.IgnoreAccidentalRDFa
 
startDocument(IRI) - Method in class org.apache.any23.filter.IgnoreTitlesOfEmptyDocuments
 
startDocument(IRI) - Method in class org.apache.any23.writer.BenchmarkTripleHandler
 
startDocument(IRI) - Method in class org.apache.any23.writer.CompositeTripleHandler
 
startDocument(IRI) - Method in class org.apache.any23.writer.CountingTripleHandler
 
startDocument(IRI) - Method in class org.apache.any23.writer.JSONWriter
Deprecated.
 
startDocument(IRI) - Method in class org.apache.any23.writer.LoggingTripleHandler
 
startDocument(IRI) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
startDocument(IRI) - Method in class org.apache.any23.writer.ReportingTripleHandler
 
startDocument(IRI) - Method in class org.apache.any23.writer.RepositoryWriter
 
startDocument(IRI) - Method in class org.apache.any23.writer.TripleWriterHandler
Writers may override this method to handle a "startDocument" extraction event.
startRDF() - Method in class org.apache.any23.extractor.rdf.RDFHandlerAdapter
 
StreamUtils - Class in org.apache.any23.util
Contains general utility functions for handling streams.
StringDocumentSource - Class in org.apache.any23.source
String implementation of DocumentSource.
StringDocumentSource(String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
 
StringDocumentSource(String, String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
 
StringDocumentSource(String, String, String, String) - Constructor for class org.apache.any23.source.StringDocumentSource
 
StringUtils - Class in org.apache.any23.util
This class provides a set of string utility methods.
SUBJECT_ATTRIBUTES - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 

T

TagSoupExtractionResult - Interface in org.apache.any23.extractor
This interface models a specific ExtractionResult able to collect property roots generated by HTML Microformat extractions.
TagSoupExtractionResult.PropertyPath - Class in org.apache.any23.extractor
Defines a property path object.
TagSoupExtractionResult.ResourceRoot - Class in org.apache.any23.extractor
Defines a property root object.
TagSoupParser - Class in org.apache.any23.extractor.html
Parses an InputStream into an HTML DOM tree.
TagSoupParser(InputStream, String) - Constructor for class org.apache.any23.extractor.html.TagSoupParser
 
TagSoupParser(InputStream, String, String) - Constructor for class org.apache.any23.extractor.html.TagSoupParser
 
TagSoupParser.ElementLocation - Class in org.apache.any23.extractor.html
Describes a DOM Element location.
TemplateGraph - Class in org.apache.any23.extractor.xpath
Represents an Quad graph IRI template.
TemplateGraph(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateGraph
Constructor.
TemplateObject - Class in org.apache.any23.extractor.xpath
Represents a Quad object template.
TemplateObject(TemplateObject.Type, String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateObject
Constructor.
TemplateObject.Type - Enum in org.apache.any23.extractor.xpath
Supported object types.
TemplatePredicate - Class in org.apache.any23.extractor.xpath
Represents a Quad predicate template.
TemplatePredicate(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplatePredicate
Constructor.
TemplateSubject - Class in org.apache.any23.extractor.xpath
Represents a Quad subject template.
TemplateSubject(TemplateSubject.Type, String, boolean) - Constructor for class org.apache.any23.extractor.xpath.TemplateSubject
Constructor.
TemplateSubject.Type - Enum in org.apache.any23.extractor.xpath
Supported subject types.
TemplateXPathExtractionRule - Interface in org.apache.any23.extractor.xpath
Defines an XPathExtractionRule able to expand a QuadTemplate set based on a given template.
TemplateXPathExtractionRuleImpl - Class in org.apache.any23.extractor.xpath
Default implementation of XPathExtractionRule.
TemplateXPathExtractionRuleImpl(String, String) - Constructor for class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
Term<T extends org.eclipse.rdf4j.model.Value> - Class in org.apache.any23.extractor.xpath
Represents a generic template term.
Term(String, boolean) - Constructor for class org.apache.any23.extractor.xpath.Term
Constructor.
TextField(String, Node) - Constructor for class org.apache.any23.extractor.html.HTMLDocument.TextField
 
TIME_PROPERTY_PREFIX - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
 
TitleExtractor - Class in org.apache.any23.extractor.html
Extracts the value of the <title> element of an HTML or XHTML page.
TitleExtractor() - Constructor for class org.apache.any23.extractor.html.TitleExtractor
 
TitleExtractorFactory - Class in org.apache.any23.extractor.html
 
TitleExtractorFactory() - Constructor for class org.apache.any23.extractor.html.TitleExtractorFactory
 
toByteArray(InputStream) - Static method in class org.apache.any23.source.MemCopyFactory
 
toJSON() - Method in class org.apache.any23.extractor.microdata.Item
 
toJSON() - Method in class org.apache.any23.extractor.microdata.ItemProp
 
toJSON() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
toJSON() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
toJSON() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
toString() - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
toString() - Method in class org.apache.any23.extractor.microdata.ItemProp
 
toString() - Method in class org.apache.any23.extractor.microdata.ItemPropValue
 
toString() - Method in class org.apache.any23.extractor.microdata.ItemScope
 
toString() - Method in exception org.apache.any23.extractor.microdata.MicrodataParserException
 
toString() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.PropertyPath
 
toString() - Method in class org.apache.any23.extractor.TagSoupExtractionResult.ResourceRoot
 
toString() - Method in class org.apache.any23.extractor.xpath.QuadTemplate
 
toString() - Method in class org.apache.any23.extractor.xpath.TemplateGraph
 
toString() - Method in class org.apache.any23.extractor.xpath.TemplateObject
 
toString() - Method in class org.apache.any23.extractor.xpath.TemplatePredicate
 
toString() - Method in class org.apache.any23.extractor.xpath.TemplateSubject
 
toString() - Method in class org.apache.any23.extractor.xpath.TemplateXPathExtractionRuleImpl
 
toString() - Method in class org.apache.any23.extractor.xpath.Term
 
toString() - Method in class org.apache.any23.validator.DefaultValidationReport
 
toString() - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
toString() - Method in class org.apache.any23.validator.EmptyValidationReport
 
toString() - Method in class org.apache.any23.validator.ValidationReport.Error
 
toString() - Method in class org.apache.any23.validator.ValidationReport.FixError
 
toString() - Method in class org.apache.any23.validator.ValidationReport.Issue
 
toString() - Method in class org.apache.any23.validator.ValidationReport.RuleActivation
 
toString() - Method in class org.apache.any23.validator.ValidationReport.RuleError
 
toValue(String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Value.
toXSDDateTime(Date) - Static method in class org.apache.any23.rdf.RDFUtils
Prints a date to the XSD datetime format.
traceRuleActivation(Rule) - Method in class org.apache.any23.validator.DefaultValidationReportBuilder
 
traceRuleActivation(Rule) - Method in interface org.apache.any23.validator.ValidationReportBuilder
Traces that a rule has been applied.
triple(String, String, String) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a statement of type: toValue(s), toValue(p), toValue(o)
triple(Resource, IRI, Value) - Static method in class org.apache.any23.rdf.RDFUtils
Creates a Statement.
TripleWriterHandler - Class in org.apache.any23.writer
This class connects a TripleHandler to a TripleWriter by writing received data.
TripleWriterHandler() - Constructor for class org.apache.any23.writer.TripleWriterHandler
 
TriXExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor to perform extraction on TriX documents.
TriXExtractor() - Constructor for class org.apache.any23.extractor.rdf.TriXExtractor
Default constructor, with no verification of data types and not stop at first error.
TriXExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.TriXExtractor
Constructor, allows to specify the validation and error handling policies.
TriXExtractorFactory - Class in org.apache.any23.extractor.rdf
 
TriXExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.TriXExtractorFactory
 
TriXWriter - Class in org.apache.any23.writer
TriX TripleWriter implementation.
TriXWriter(OutputStream) - Constructor for class org.apache.any23.writer.TriXWriter
 
TriXWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.TriXWriter
 
TriXWriterFactory - Class in org.apache.any23.writer
 
TriXWriterFactory() - Constructor for class org.apache.any23.writer.TriXWriterFactory
 
TurtleExtractor - Class in org.apache.any23.extractor.rdf
Concrete implementation of Extractor.ContentExtractor able to perform the extraction on Turtle documents.
TurtleExtractor() - Constructor for class org.apache.any23.extractor.rdf.TurtleExtractor
Default constructor, with no verification of data types and no stop at first error.
TurtleExtractor(boolean, boolean) - Constructor for class org.apache.any23.extractor.rdf.TurtleExtractor
Constructor, allows to specify the validation and error handling policies.
TurtleExtractorFactory - Class in org.apache.any23.extractor.rdf
 
TurtleExtractorFactory() - Constructor for class org.apache.any23.extractor.rdf.TurtleExtractorFactory
 
TurtleHTMLExtractor - Class in org.apache.any23.extractor.html
Extractor for Turtle/N3 format embedded within HTML script tags.
TurtleHTMLExtractor() - Constructor for class org.apache.any23.extractor.html.TurtleHTMLExtractor
 
TurtleHTMLExtractorFactory - Class in org.apache.any23.extractor.html
 
TurtleHTMLExtractorFactory() - Constructor for class org.apache.any23.extractor.html.TurtleHTMLExtractorFactory
 
TurtleWriter - Class in org.apache.any23.writer
N3 notation TripleWriter implementation.
TurtleWriter(OutputStream) - Constructor for class org.apache.any23.writer.TurtleWriter
Constructor.
TurtleWriter(OutputStream, Settings) - Constructor for class org.apache.any23.writer.TurtleWriter
 
TurtleWriterFactory - Class in org.apache.any23.writer
 
TurtleWriterFactory() - Constructor for class org.apache.any23.writer.TurtleWriterFactory
 
TYPE_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
TYPEOF_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 

U

unblockContext(ExtractionContext) - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
unblockDocument() - Method in class org.apache.any23.filter.ExtractionContextBlocker
 
unregister(String) - Method in class org.apache.any23.extractor.ExtractorRegistryImpl
Unregisters the ExtractorFactory with the given name.
updateIRIMapping(Node) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
Updates the IRI mapping with the XMLNS attributes declared in the current node.
updateVocabulary(Node) - Method in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
Updates the vocabulary context with possible @vocab declarations.
URI - org.apache.any23.extractor.xpath.TemplateObject.Type
 
URI - org.apache.any23.extractor.xpath.TemplateSubject.Type
 
URIListWriter - Class in org.apache.any23.writer
This writer simply produces a list of unique IRI present in the subject or in the object of every single extracted RDF Statement.
URIListWriter(OutputStream) - Constructor for class org.apache.any23.writer.URIListWriter
 
URIListWriterFactory - Class in org.apache.any23.writer
 
URIListWriterFactory() - Constructor for class org.apache.any23.writer.URIListWriterFactory
 
URL_PROPERTY_PREFIX - Static variable in class org.apache.any23.extractor.html.microformats2.Microformats2Prefixes
 
URLUtils - Class in org.apache.any23.util
Contains general utility functions for handling URLs.

V

validate(URI, Document, boolean) - Method in class org.apache.any23.validator.DefaultValidator
 
validate(URI, Document, boolean) - Method in interface org.apache.any23.validator.Validator
Performs a validation - fixing of the provided document.
validate(DOMDocument, boolean) - Method in class org.apache.any23.validator.DefaultValidator
 
validate(DOMDocument, boolean) - Method in interface org.apache.any23.validator.Validator
Performs a validation - fixing of the provided document.
ValidationReport - Interface in org.apache.any23.validator
This class contains the report of a validation performed by the Validator class.
ValidationReport.Error - Class in org.apache.any23.validator
An error occurred while performing the validation process.
ValidationReport.FixError - Class in org.apache.any23.validator
An error occurred while executing a fix.
ValidationReport.Issue - Class in org.apache.any23.validator
An issue found during the validation process.
ValidationReport.IssueLevel - Enum in org.apache.any23.validator
Defines the different issue levels.
ValidationReport.RuleActivation - Class in org.apache.any23.validator
This class describes the activation of a rule.
ValidationReport.RuleError - Class in org.apache.any23.validator
An error occurred while executing a rule.
ValidationReportBuilder - Interface in org.apache.any23.validator
The report interface is used to generate diagnostics about validation.
ValidationReportSerializer - Interface in org.apache.any23.validator
Defines a serializer for validation reports.
Validator - Interface in org.apache.any23.validator
The validator class allows to perform validation - correction of related to HTML Document instances.
ValidatorException - Exception in org.apache.any23.validator
Models any exception raised by the Validator.
ValidatorException(String) - Constructor for exception org.apache.any23.validator.ValidatorException
 
ValidatorException(String, Throwable) - Constructor for exception org.apache.any23.validator.ValidatorException
 
value() - Method in class org.apache.any23.extractor.html.HTMLDocument.TextField
 
value() - Method in annotation type org.apache.any23.validator.XMLValidationReportSerializer.NodeName
 
valueFactory - Variable in class org.apache.any23.extractor.html.MicroformatExtractor
 
valueOf(String) - Static method in enum org.apache.any23.extractor.microdata.ItemPropValue.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.any23.extractor.xpath.TemplateObject.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.any23.extractor.xpath.TemplateSubject.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.any23.validator.ValidationReport.IssueLevel
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.apache.any23.extractor.microdata.ItemPropValue.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.any23.extractor.xpath.TemplateObject.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.any23.extractor.xpath.TemplateSubject.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.any23.validator.ValidationReport.IssueLevel
Returns an array containing the constants of this enum type, in the order they are declared.
Variable - Class in org.apache.any23.extractor.xpath
Represents a variable in TemplateXPathExtractionRule.
Variable(String, String) - Constructor for class org.apache.any23.extractor.xpath.Variable
 
VERSION - Static variable in class org.apache.any23.Any23
Any23 core library version.
vf - Variable in class org.apache.any23.extractor.yaml.ElementsProcessor
 
VOCAB_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 

W

WARNING - org.apache.any23.validator.ValidationReport.IssueLevel
 
writeNamespace(String, String) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
writeNamespace(String, String) - Method in class org.apache.any23.writer.JSONWriter
Deprecated.
 
writeNamespace(String, String) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
writeNamespace(String, String) - Method in class org.apache.any23.writer.URIListWriter
 
WriterSettings - Class in org.apache.any23.writer
This class encapsulates commonly supported settings for TripleWriter implementations.
writeTriple(Resource, IRI, Value) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
writeTriple(Resource, IRI, Value, IRI) - Method in class org.apache.any23.extractor.ExtractionResultImpl
 
writeTriple(Resource, IRI, Value, Resource) - Method in class org.apache.any23.writer.JSONWriter
Deprecated.
 
writeTriple(Resource, IRI, Value, Resource) - Method in class org.apache.any23.writer.RDFWriterTripleHandler
 
writeTriple(Resource, IRI, Value, Resource) - Method in class org.apache.any23.writer.URIListWriter
 

X

XCalExtractor - Class in org.apache.any23.extractor.calendar
 
XCalExtractor() - Constructor for class org.apache.any23.extractor.calendar.XCalExtractor
 
XCalExtractorFactory - Class in org.apache.any23.extractor.calendar
 
XCalExtractorFactory() - Constructor for class org.apache.any23.extractor.calendar.XCalExtractorFactory
 
XFNExtractor - Class in org.apache.any23.extractor.html
Extractor for the XFN microformat.
XFNExtractor() - Constructor for class org.apache.any23.extractor.html.XFNExtractor
 
XFNExtractorFactory - Class in org.apache.any23.extractor.html
 
XFNExtractorFactory() - Constructor for class org.apache.any23.extractor.html.XFNExtractorFactory
 
XML_LANG_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
XML_LITERAL_DATATYPE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
XMLNS_ATTRIBUTE - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
XMLNS_DEFAULT - Static variable in class org.apache.any23.extractor.rdfa.RDFa11Parser
Deprecated.
 
XMLValidationReportSerializer - Class in org.apache.any23.validator
Default implementation of ValidationReportSerializer for XML.
XMLValidationReportSerializer() - Constructor for class org.apache.any23.validator.XMLValidationReportSerializer
 
XMLValidationReportSerializer.NodeName - Annotation Type in org.apache.any23.validator
Allows to specify a custom node name.
XPathExtractionRule - Interface in org.apache.any23.extractor.xpath
Defines an extraction rule for the XPathExtractor.
XPathExtractor - Class in org.apache.any23.extractor.xpath
Implementation of an Extractor.TagSoupDOMExtractor able to apply XPathExtractionRules and generate quads.
XPathExtractor() - Constructor for class org.apache.any23.extractor.xpath.XPathExtractor
 
XPathExtractor(List<XPathExtractionRule>) - Constructor for class org.apache.any23.extractor.xpath.XPathExtractor
 
XPathExtractorFactory - Class in org.apache.any23.extractor.xpath
 
XPathExtractorFactory() - Constructor for class org.apache.any23.extractor.xpath.XPathExtractorFactory
 
XSLTStylesheet - Class in org.apache.any23.extractor.rdfa
An XSLT stylesheet loaded from an InputStream, can be applied to DOM trees and writes the result to a Writer.
XSLTStylesheet(InputStream) - Constructor for class org.apache.any23.extractor.rdfa.XSLTStylesheet
 
XSLTStylesheetException - Exception in org.apache.any23.extractor.rdfa
Defines an exception raised by the XSLTStylesheet class.
XSLTStylesheetException(String) - Constructor for exception org.apache.any23.extractor.rdfa.XSLTStylesheetException
 
XSLTStylesheetException(String, Throwable) - Constructor for exception org.apache.any23.extractor.rdfa.XSLTStylesheetException
 

Y

YAMLExtractor - Class in org.apache.any23.extractor.yaml
 
YAMLExtractor() - Constructor for class org.apache.any23.extractor.yaml.YAMLExtractor
 
YAMLExtractorFactory - Class in org.apache.any23.extractor.yaml
 
YAMLExtractorFactory() - Constructor for class org.apache.any23.extractor.yaml.YAMLExtractorFactory
 
YAMLProcessorException - Exception in org.apache.any23.extractor.yaml
 
YAMLProcessorException() - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
 
YAMLProcessorException(Object) - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
 
YAMLProcessorException(Object, String) - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
 
YAMLProcessorException(Object, String, Throwable) - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
 
YAMLProcessorException(Object, String, Throwable, boolean, boolean) - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
 
YAMLProcessorException(Object, Throwable) - Constructor for exception org.apache.any23.extractor.yaml.YAMLProcessorException
 
A B C D E F G H I J L M N O P Q R S T U V W X Y 
All Classes All Packages