public class OPICScoringFilter extends java.lang.Object implements ScoringFilter
X_POINT_ID
Constructor and Description |
---|
OPICScoringFilter() |
Modifier and Type | Method and Description |
---|---|
void |
distributeScoreToOutlinks(java.lang.String fromUrl,
WebPage row,
java.util.Collection<ScoreDatum> scoreData,
int allCount)
Get cash on hand, divide it by the number of outlinks and apply.
|
float |
generatorSortValue(java.lang.String url,
WebPage row,
float initSort)
Use
WebPage.getScore() . |
Configuration |
getConf() |
java.util.Collection<WebPage.Field> |
getFields() |
float |
indexerScore(java.lang.String url,
NutchDocument doc,
WebPage row,
float initScore)
Dampen the boost value by scorePower.
|
void |
initialScore(java.lang.String url,
WebPage row)
Set to 0.0f (unknown value) - inlink contributions will bring it to a
correct level.
|
void |
injectedScore(java.lang.String url,
WebPage row)
Set an initial score for newly injected pages.
|
void |
setConf(Configuration conf) |
void |
updateScore(java.lang.String url,
WebPage row,
java.util.List<ScoreDatum> inlinkedScoreData)
Increase the score by a sum of inlinked scores.
|
public Configuration getConf()
getConf
in interface Configurable
public void setConf(Configuration conf)
setConf
in interface Configurable
public void injectedScore(java.lang.String url, WebPage row) throws ScoringFilterException
ScoringFilter
injectedScore
in interface ScoringFilter
url
- url of the pagerow
- new page. Filters will modify it in-place.ScoringFilterException
public void initialScore(java.lang.String url, WebPage row) throws ScoringFilterException
initialScore
in interface ScoringFilter
url
- url of the pageScoringFilterException
public float generatorSortValue(java.lang.String url, WebPage row, float initSort) throws ScoringFilterException
WebPage.getScore()
.generatorSortValue
in interface ScoringFilter
url
- url of the pagerow
- WebPage
object relative to the URLinitSort
- initial sort value, or a value from previous filters in chainScoringFilterException
public void updateScore(java.lang.String url, WebPage row, java.util.List<ScoreDatum> inlinkedScoreData)
updateScore
in interface ScoringFilter
url
- url of the pagerow
- WebPage
object relative to the URLinlinkedScoreData
- list of ScoreDatum
s for all inlinks pointing to
this URL.public void distributeScoreToOutlinks(java.lang.String fromUrl, WebPage row, java.util.Collection<ScoreDatum> scoreData, int allCount)
distributeScoreToOutlinks
in interface ScoringFilter
fromUrl
- url of the source pagescoreData
- A list of ScoreDatum
allCount
- number of all collected outlinks from the source pagepublic float indexerScore(java.lang.String url, NutchDocument doc, WebPage row, float initScore)
indexerScore
in interface ScoringFilter
url
- url of the pagedoc
- document. NOTE: this already contains all information collected by
indexing filters. Implementations may modify this instance, in
order to store/remove some information.initScore
- initial boost value for the Lucene document.public java.util.Collection<WebPage.Field> getFields()
getFields
in interface FieldPluggable
Copyright © 2019 The Apache Software Foundation