web pages
,
host metadata
) of data in abstracted storage.See: Description
Class | Description |
---|---|
Host |
Host represents a store of webpages or other data which resides on a server or other computer so that it can be accessed over the Internet
|
Host.Builder |
RecordBuilder for Host instances.
|
Host.Tombstone | |
ParseStatus |
A nested container representing parse status data captured from invocation of parsers on fetch of a WebPage
|
ParseStatus.Builder |
RecordBuilder for ParseStatus instances.
|
ParseStatus.Tombstone | |
ProtocolStatus |
A nested container representing data captured from web server responses.
|
ProtocolStatus.Builder |
RecordBuilder for ProtocolStatus instances.
|
ProtocolStatus.Tombstone | |
StorageUtils |
Entry point to Gora store/mapreduce functionality.
|
WebPage |
WebPage is the primary data structure in Nutch representing crawl data for a given WebPage at some point in time
|
WebPage.Builder |
RecordBuilder for WebPage instances.
|
WebPage.Tombstone | |
WebTableCreator |
Enum | Description |
---|---|
Host.Field |
Enum containing all data bean's fields.
|
Mark | |
ParseStatus.Field |
Enum containing all data bean's fields.
|
ProtocolStatus.Field |
Enum containing all data bean's fields.
|
WebPage.Field |
Enum containing all data bean's fields.
|
web pages
,
host metadata
) of data in abstracted storage.Copyright © 2019 The Apache Software Foundation