All Classes Interface Summary Class Summary Enum Summary Exception Summary Annotation Types Summary
Class |
Description |
AboutNotURIRule |
This rule is able to detect whether an about value is a valid URL or otherwise is a valid relative URL.
|
AcceptHeaderBuilder |
Concatenates a collection of MIME specs in "type/subtype;q=x.x" notation into an HTTP Accept header value, and
removes duplicates and types covered by wildcards.
|
AdrExtractor |
Extractor for the adr microformat.
|
AdrExtractorFactory |
|
Any23 |
A facade with convenience methods for typical Any23 extraction operations.
|
Any23ValueFactoryWrapper |
Any23 specialization of the ValueFactory .
|
BaseRDFExtractor |
Base class for a generic RDF Extractor.ContentExtractor .
|
BenchmarkTripleHandler |
TripleHandler decorator useful to perform benchmarking.
|
ByteArrayDocumentSource |
ByteArray implementation of DocumentSource .
|
CompositeTripleHandler |
A TripleHandler multi decorator, that wraps zero or more other triple handlers and dispatches all events to
each of them.
|
CountingTripleHandler |
A simple TripleHandler that merely counts the number of triples it has received.
|
CSVExtractor |
This extractor produces RDF from a CSV file .
|
CSVExtractorFactory |
|
DefaultDOMDocument |
This class wraps the DOM document.
|
DefaultHTTPClient |
|
DefaultHTTPClientConfiguration |
|
DefaultRuleContext |
|
DefaultValidationReport |
|
DefaultValidationReportBuilder |
|
DefaultValidator |
|
DiscoveryUtils |
This class provides utility methods for discovering classes in packages.
|
DocumentReport |
Represents the validationReportBuilder generated by a the TagSoupParser when a document is retrieved and
validated.
|
DOMDocument |
This interface models a document to be processed by the Validator .
|
DomUtils |
This class provides utility methods for DOM manipulation.
|
ElementsProcessor |
|
ElementsProcessor.ModelHolder |
A model holder describes the two required parameters which makes a model useful in further processing: a root
node and model itself.
|
EmbeddedJSONLDExtractor |
This extractor represents the HTML script tags used to embed blocks of data in documents.
|
EmbeddedJSONLDExtractorFactory |
|
EmptyValidationReport |
|
EntityBasedMicroformatExtractor |
Base class for microformat extractors based on entities.
|
ExampleInputOutput |
A reporter for example input and output of an extractor.
|
ExtractionContextBlocker |
A wrapper around a TripleHandler that can block and unblock calls to the handler, either for the entire
document, or for individual ExtractionContext s.
|
ExtractionReport |
This class contains some statistics and general information about an extraction.
|
ExtractionResultImpl |
A default implementation of ExtractionResult ; it receives extraction output from one Extractor
working on one document, and passes the output on to a TripleHandler .
|
ExtractorRegistryImpl |
Singleton class acting as a register for all the various Extractor .
|
FileDocumentSource |
File implementation of DocumentSource .
|
FileUtils |
Utility class for handling files.
|
Fix |
This interface models a fix for an issue.
|
FunctionalSyntaxExtractor |
|
FunctionalSyntaxExtractorFactory |
|
GeoExtractor |
Extractor for the Geo microformat.
|
GeoExtractorFactory |
|
HAdrExtractor |
Extractor for the h-adr microformat.
|
HAdrExtractorFactory |
|
HCalendarExtractor |
|
HCalendarExtractorFactory |
|
HCardExtractor |
Extractor for the hCard microformat.
|
HCardExtractor |
Extractor for the h-Card microformat.
|
HCardExtractorFactory |
|
HCardExtractorFactory |
|
HCardName |
An HCard name, consisting of various parts.
|
HeadLinkExtractor |
This Extractor.TagSoupDOMExtractor implementation retrieves the LINK s
declared within the HTML/HEAD page header.
|
HeadLinkExtractorFactory |
|
HEntryExtractor |
Extractor for the h-entry microformat.
|
HEntryExtractorFactory |
Extractor for the h-entry microformat.
|
HEventExtractor |
Extractor for the h-event microformat.
|
HEventExtractorFactory |
|
HGeoExtractor |
Extractor for the h-geo microformat.
|
HGeoExtractorFactory |
|
HItemExtractor |
Extractor for the h-item microformat.
|
HItemExtractorFactory |
|
HListingExtractor |
|
HListingExtractorFactory |
|
HProductExtractor |
|
HProductExtractorFactory |
|
HRecipeExtractor |
Extractor for the hRecipe microformat.
|
HRecipeExtractor |
Extractor for the hRecipe microformat.
|
HRecipeExtractorFactory |
|
HRecipeExtractorFactory |
|
HResumeExtractor |
Extractor for the hResume microformat.
|
HResumeExtractor |
Extractor for the hResume microformat.
|
HResumeExtractorFactory |
|
HResumeExtractorFactory |
|
HReviewAggregateExtractor |
|
HReviewAggregateExtractorFactory |
|
HReviewExtractor |
Extractor for the hReview microformat.
|
HReviewExtractorFactory |
|
HTMLDocument |
A wrapper around the DOM representation of an HTML document.
|
HTMLDocument.TextField |
This class represents a text extracted from the HTML DOM related to the node from which such test has been
retrieved.
|
HTMLMetaExtractor |
|
HTMLMetaExtractorFactory |
|
HTTPClient |
|
HTTPClientConfiguration |
|
HTTPDocumentSource |
Http implementation of DocumentSource .
|
ICalExtractor |
|
ICalExtractorFactory |
|
ICBMExtractor |
Extractor for "ICBM coordinates" provided as META headers in the head of an HTML page.
|
ICBMExtractorFactory |
|
IgnoreAccidentalRDFa |
A TripleHandler that suppresses output of the RDFa parser if the document only contains "accidental" RDFa,
like stylesheet links and other non-RDFa uses of HTML's
|
IgnoreTitlesOfEmptyDocuments |
A TripleHandler that suppresses output of the TitleExtractor unless some other triples could be
parsed from the document.
|
Includes |
This annotation could be used to decorate a MicroformatExtractor to represent which of the other
Microformats could it nest.
|
Includes |
This annotation could be used to decorate a MicroformatExtractor to represent which of the other
Microformats could it nest.
|
Item |
Base class for Microdata items.
|
ItemProp |
Describes a Microdata item property.
|
ItemPropValue |
Describes a possible value for a Microdata item property.
|
ItemPropValue.Type |
Supported types.
|
ItemScope |
This class describes a Microdata itemscope.
|
JCalExtractor |
|
JCalExtractorFactory |
|
JSONLDExtractor |
Concrete implementation of Extractor.ContentExtractor handling
JSON-LD format.
|
JSONLDExtractorFactory |
|
JSONLDWriter |
Implementation of JSON-LD TripleWriter .
|
JSONLDWriterFactory |
|
JSONWriter |
Deprecated.
|
JSONWriterFactory |
Deprecated.
|
JsoupUtils |
|
LicenseExtractor |
|
LicenseExtractorFactory |
|
LocalCopyFactory |
A factory that creates local copies of DocumentSource s.
|
LoggingTripleHandler |
Triple handler decorator useful for logging purposes.
|
LogUtils |
Deprecated.
|
ManchesterSyntaxExtractor |
Concrete implementation of Extractor.ContentExtractor handling
Manchester Syntax format.
|
ManchesterSyntaxExtractorFactory |
|
MathUtils |
Mathematical utility functions.
|
MemCopyFactory |
Creates local copies of DocumentSource by reading them into an in-memory buffer.
|
MetaNameMisuseFix |
Fixes the misuse of the meta name attribute.
|
MetaNameMisuseRule |
Checks whether the meta attribute name is used to contain a property.
|
MicrodataExtractor |
Default implementation of Microdata extractor, based on
Extractor.TagSoupDOMExtractor .
|
MicrodataExtractorFactory |
|
MicrodataParser |
This class provides utility methods for handling Microdata nodes contained within a DOM document.
|
MicrodataParserException |
Defines an exception occurring while parsing Microdata.
|
MicrodataParserReport |
|
MicroformatExtractor |
|
Microformats2Prefixes |
|
MissingItemscopeAttributeValueFix |
|
MissingItemscopeAttributeValueRule |
This fixes missing attribute values for the 'itemscope' attribute Typically when such a snippet of XHTML is fed
through the RDFa11Extractor , and subsequently to Sesame's SesameRDFaParser,
it will result in the following behavior.
|
MissingOpenGraphNamespaceRule |
This rule detects the issue of missing Open Graph namespace.
|
NQuadsExtractor |
Concrete implementation of Extractor.ContentExtractor handling
N-Quads format.
|
NQuadsExtractorFactory |
|
NQuadsWriter |
Implementation of an N-Quads TripleWriter .
|
NQuadsWriterFactory |
|
NTriplesExtractor |
Concrete implementation of Extractor.ContentExtractor handling NTriples
NTriples format.
|
NTriplesExtractorFactory |
|
NTriplesWriter |
Implementation of an N-Triples TripleWriter .
|
NTriplesWriterFactory |
|
OpenGraphNamespaceFix |
This fixes the missing Open Graph protocol.
|
PopularPrefixes |
This class act as a container for various well-known and adopted RDF Vocabulary prefixes.
|
QuadTemplate |
This class models a NQuads template, that is a quadruple in which any component can be a variable.
|
RDFa11Extractor |
Extractor implementation for RDFa
1.1 specification.
|
RDFa11ExtractorFactory |
|
RDFa11Parser |
Deprecated.
|
RDFa11ParserException |
Deprecated.
|
RDFaExtractor |
Extractor implementation for RDFa
1.0 specification.
|
RDFaExtractorFactory |
|
RDFHandlerAdapter |
An RDFHandler that relays statements and prefix definitions to an ExtractionResult .
|
RDFParserFactory |
This factory provides a common logic for creating and configuring correctly any RDF parser used within the
library.
|
RDFSchemaUtils |
This class provides a set of methods for generating RDF Schema.
|
RDFUtils |
Basic class providing a set of utility methods when dealing with RDF.
|
RDFWriterTripleHandler |
A TripleHandler that writes triples to a Sesame RDFWriter , eg for serialization
using one of Sesame's writers.
|
RDFXMLExtractor |
Concrete implementation of Extractor.ContentExtractor able to perform the
extraction on RDF/XML documents.
|
RDFXMLExtractorFactory |
|
RDFXMLWriter |
RDF/XML TripleWriter implementation.
|
RDFXMLWriterFactory |
|
ReaderInputStream |
Adapts a Reader as an InputStream .
|
ReportingTripleHandler |
A TripleHandler that collects various information about the extraction process, such as the extractors used
and the total number of triples.
|
RepositoryWriter |
An RDF4J repository triple writer.
|
Rule |
Models a rule for an issue detection.
|
RuleContext<T> |
This context is used to pass data from a Rule to a Fix .
|
SerializationException |
|
SimpleExtractorFactory<T extends org.apache.any23.extractor.Extractor<?>> |
This class is a simple and default-like implementation of ExtractorFactory .
|
SingleDocumentExtraction |
This class acts as a facade where all extractors (for a given MIMEType) can be called on a single document.
|
SingleDocumentExtractionReport |
|
SpanCloserInputStream |
Extension of InputStream meant to detect and replace any occurrence of inline span:
|
SpeciesExtractor |
|
SpeciesExtractorFactory |
|
StreamUtils |
Contains general utility functions for handling streams.
|
StringDocumentSource |
String implementation of DocumentSource .
|
StringUtils |
This class provides a set of string utility methods.
|
TagSoupExtractionResult |
This interface models a specific ExtractionResult able to collect property roots generated by HTML
Microformat extractions.
|
TagSoupExtractionResult.PropertyPath |
Defines a property path object.
|
TagSoupExtractionResult.ResourceRoot |
Defines a property root object.
|
TagSoupParser |
|
TagSoupParser.ElementLocation |
Describes a DOM Element location.
|
TemplateGraph |
Represents an Quad graph IRI template.
|
TemplateObject |
Represents a Quad object template.
|
TemplateObject.Type |
Supported object types.
|
TemplatePredicate |
Represents a Quad predicate template.
|
TemplateSubject |
Represents a Quad subject template.
|
TemplateSubject.Type |
Supported subject types.
|
TemplateXPathExtractionRule |
|
TemplateXPathExtractionRuleImpl |
|
Term<T extends org.eclipse.rdf4j.model.Value> |
Represents a generic template term.
|
TitleExtractor |
Extracts the value of the <title> element of an HTML or XHTML page.
|
TitleExtractorFactory |
|
TripleWriterHandler |
This class connects a TripleHandler to a TripleWriter by writing received data.
|
TriXExtractor |
Concrete implementation of Extractor.ContentExtractor to perform extraction on
TriX documents.
|
TriXExtractorFactory |
|
TriXWriter |
TriX TripleWriter implementation.
|
TriXWriterFactory |
|
TurtleExtractor |
Concrete implementation of Extractor.ContentExtractor able to perform the
extraction on Turtle documents.
|
TurtleExtractorFactory |
|
TurtleHTMLExtractor |
Extractor for Turtle/N3 format embedded within HTML script tags.
|
TurtleHTMLExtractorFactory |
|
TurtleWriter |
N3 notation TripleWriter implementation.
|
TurtleWriterFactory |
|
URIListWriter |
This writer simply produces a list of unique IRI present in the subject or in the object of every single
extracted RDF Statement.
|
URIListWriterFactory |
|
URLUtils |
Contains general utility functions for handling URLs.
|
ValidationReport |
This class contains the report of a validation performed by the Validator class.
|
ValidationReport.Error |
An error occurred while performing the validation process.
|
ValidationReport.FixError |
An error occurred while executing a fix.
|
ValidationReport.Issue |
An issue found during the validation process.
|
ValidationReport.IssueLevel |
Defines the different issue levels.
|
ValidationReport.RuleActivation |
This class describes the activation of a rule.
|
ValidationReport.RuleError |
An error occurred while executing a rule.
|
ValidationReportBuilder |
The report interface is used to generate diagnostics about validation.
|
ValidationReportSerializer |
Defines a serializer for validation reports.
|
Validator |
The validator class allows to perform validation - correction of related to HTML Document
instances.
|
ValidatorException |
Models any exception raised by the Validator .
|
Variable |
|
WriterSettings |
This class encapsulates commonly supported settings for TripleWriter implementations.
|
XCalExtractor |
|
XCalExtractorFactory |
|
XFNExtractor |
Extractor for the XFN microformat.
|
XFNExtractorFactory |
|
XMLValidationReportSerializer |
|
XMLValidationReportSerializer.NodeName |
Allows to specify a custom node name.
|
XPathExtractionRule |
|
XPathExtractor |
Implementation of an Extractor.TagSoupDOMExtractor able to apply
XPathExtractionRule s and generate quads.
|
XPathExtractorFactory |
|
XSLTStylesheet |
An XSLT stylesheet loaded from an InputStream, can be applied to DOM trees and writes the result to a Writer .
|
XSLTStylesheetException |
|
YAMLExtractor |
|
YAMLExtractorFactory |
|
YAMLProcessorException |
|