Package | Description |
---|---|
org.apache.nutch.indexer |
Index content, configure and run indexing and cleaning jobs to
add, update, and delete documents from an index.
|
org.apache.nutch.metadata |
A Multi-valued Metadata container, and set
of constant fields for Nutch Metadata.
|
org.apache.nutch.net.protocols |
Helper classes related to the
Protocol
interface, sea also org.apache.nutch.protocol . |
org.apache.nutch.parse |
The
Parse interface and related classes. |
org.apache.nutch.protocol |
Classes related to the
Protocol interface,
see also org.apache.nutch.net.protocols . |
org.apache.nutch.protocol.http |
Protocol plugin which supports retrieving documents via the http protocol.
|
org.apache.nutch.protocol.httpclient |
Protocol plugin which supports retrieving documents via the HTTP and
HTTPS protocols, optionally with Basic, Digest and NTLM authentication
schemes for web server as well as proxy server.
|
Modifier and Type | Method and Description |
---|---|
Metadata |
NutchDocument.getDocumentMeta() |
Modifier and Type | Class and Description |
---|---|
class |
SpellCheckedMetadata
A decorator to Metadata that adds spellchecking capabilities to property
names.
|
Modifier and Type | Method and Description |
---|---|
Metadata |
MetaWrapper.getMetadata()
Get all metadata.
|
Constructor and Description |
---|
MetaWrapper(Metadata metadata,
Writable instance,
Configuration conf) |
Modifier and Type | Method and Description |
---|---|
Metadata |
Response.getHeaders()
Returns all the headers.
|
Modifier and Type | Method and Description |
---|---|
Metadata |
HTMLMetaTags.getGeneralTags()
Returns all collected values of the general meta tags.
|
Modifier and Type | Method and Description |
---|---|
java.util.Map<Outlink,Metadata> |
NutchSitemapParse.getOutlinkMap() |
Modifier and Type | Method and Description |
---|---|
void |
NutchSitemapParse.setOutlinks(java.util.Map<Outlink,Metadata> outlinkMap) |
Constructor and Description |
---|
NutchSitemapParse(java.util.Map<Outlink,Metadata> outlinkMap,
ParseStatus parseStatus) |
Modifier and Type | Method and Description |
---|---|
Metadata |
Content.getMetadata()
Other protocol-specific data.
|
Modifier and Type | Method and Description |
---|---|
void |
Content.setMetadata(Metadata metadata)
Other protocol-specific data.
|
Constructor and Description |
---|
Content(java.lang.String url,
java.lang.String base,
byte[] content,
java.lang.String contentType,
Metadata metadata,
Configuration conf) |
Content(java.lang.String url,
java.lang.String base,
byte[] content,
java.lang.String contentType,
Metadata metadata,
MimeUtil mimeTypes) |
Modifier and Type | Method and Description |
---|---|
Metadata |
HttpResponse.getHeaders() |
Modifier and Type | Method and Description |
---|---|
Metadata |
HttpResponse.getHeaders() |
Modifier and Type | Method and Description |
---|---|
HttpAuthentication |
HttpAuthenticationFactory.findAuthentication(Metadata header) |
Copyright © 2019 The Apache Software Foundation