public class IndexingFilters
extends java.lang.Object
IndexingFilter
implementing plugins.Modifier and Type | Field and Description |
---|---|
static java.lang.String |
INDEXINGFILTER_ORDER |
Constructor and Description |
---|
IndexingFilters(Configuration conf) |
Modifier and Type | Method and Description |
---|---|
NutchDocument |
filter(NutchDocument doc,
java.lang.String url,
WebPage page)
Run all defined filters.
|
java.util.Collection<WebPage.Field> |
getFields()
Gets all the fields for a given
WebPage Many datastores need to
setup the mapreduce job by specifying the fields needed. |
public static final java.lang.String INDEXINGFILTER_ORDER
public IndexingFilters(Configuration conf)
public NutchDocument filter(NutchDocument doc, java.lang.String url, WebPage page) throws IndexingException
IndexingException
public java.util.Collection<WebPage.Field> getFields()
WebPage
Many datastores need to
setup the mapreduce job by specifying the fields needed. All extensions
that work on WebPage are able to specify what fields they need.Copyright © 2019 The Apache Software Foundation