Package | Description |
---|---|
org.apache.nutch.parse |
The
Parse interface and related classes. |
org.creativecommons.nutch |
Sample plugins that parse and index Creative Commons medadata.
|
Modifier and Type | Class and Description |
---|---|
class |
ParserNotFound |
Modifier and Type | Method and Description |
---|---|
Parse |
ParseUtil.parse(java.lang.String url,
WebPage page)
|
Modifier and Type | Method and Description |
---|---|
static void |
CCParseFilter.Walker.walk(org.w3c.dom.Node doc,
java.net.URL base,
WebPage page,
Configuration conf)
Scan the document adding attributes to metadata.
|
Copyright © 2019 The Apache Software Foundation