Package | Description |
---|---|
org.apache.nutch.scoring |
The
ScoringFilter interface. |
org.apache.nutch.scoring.link |
Scoring filter
|
org.apache.nutch.scoring.opic |
Scoring filter implementing a variant of the Online Page Importance Computation
(OPIC) algorithm.
|
org.apache.nutch.scoring.tld |
Top Level Domain Scoring plugin.
|
Modifier and Type | Method and Description |
---|---|
void |
ScoringFilter.distributeScoreToOutlinks(java.lang.String fromUrl,
WebPage page,
java.util.Collection<ScoreDatum> scoreData,
int allCount)
Distribute score value from the current page to all its outlinked pages.
|
void |
ScoringFilters.distributeScoreToOutlinks(java.lang.String fromUrl,
WebPage row,
java.util.Collection<ScoreDatum> scoreData,
int allCount) |
float |
ScoringFilter.generatorSortValue(java.lang.String url,
WebPage page,
float initSort)
This method prepares a sort value for the purpose of sorting and selecting
top N scoring pages during fetchlist generation.
|
float |
ScoringFilters.generatorSortValue(java.lang.String url,
WebPage row,
float initSort)
Calculate a sort value for Generate.
|
float |
ScoringFilter.indexerScore(java.lang.String url,
NutchDocument doc,
WebPage page,
float initScore)
This method calculates a Lucene document boost.
|
float |
ScoringFilters.indexerScore(java.lang.String url,
NutchDocument doc,
WebPage row,
float initScore) |
void |
ScoringFilter.initialScore(java.lang.String url,
WebPage page)
Set an initial score for newly discovered pages.
|
void |
ScoringFilters.initialScore(java.lang.String url,
WebPage row)
Calculate a new initial score, used when adding newly discovered pages.
|
void |
ScoringFilter.injectedScore(java.lang.String url,
WebPage page)
Set an initial score for newly injected pages.
|
void |
ScoringFilters.injectedScore(java.lang.String url,
WebPage row)
Calculate a new initial score, used when injecting new pages.
|
void |
ScoringFilter.updateScore(java.lang.String url,
WebPage page,
java.util.List<ScoreDatum> inlinkedScoreData)
This method calculates a new score during table update, based on the values
contributed by inlinked pages.
|
void |
ScoringFilters.updateScore(java.lang.String url,
WebPage row,
java.util.List<ScoreDatum> inlinkedScoreData) |
Modifier and Type | Method and Description |
---|---|
void |
LinkAnalysisScoringFilter.distributeScoreToOutlinks(java.lang.String fromUrl,
WebPage page,
java.util.Collection<ScoreDatum> scoreData,
int allCount) |
float |
LinkAnalysisScoringFilter.generatorSortValue(java.lang.String url,
WebPage page,
float initSort) |
float |
LinkAnalysisScoringFilter.indexerScore(java.lang.String url,
NutchDocument doc,
WebPage page,
float initScore) |
void |
LinkAnalysisScoringFilter.initialScore(java.lang.String url,
WebPage page) |
void |
LinkAnalysisScoringFilter.injectedScore(java.lang.String url,
WebPage page) |
void |
LinkAnalysisScoringFilter.updateScore(java.lang.String url,
WebPage page,
java.util.List<ScoreDatum> inlinkedScoreData) |
Modifier and Type | Method and Description |
---|---|
float |
OPICScoringFilter.generatorSortValue(java.lang.String url,
WebPage row,
float initSort)
Use
WebPage.getScore() . |
void |
OPICScoringFilter.initialScore(java.lang.String url,
WebPage row)
Set to 0.0f (unknown value) - inlink contributions will bring it to a
correct level.
|
void |
OPICScoringFilter.injectedScore(java.lang.String url,
WebPage row) |
Modifier and Type | Method and Description |
---|---|
void |
TLDScoringFilter.distributeScoreToOutlinks(java.lang.String fromUrl,
WebPage page,
java.util.Collection<ScoreDatum> scoreData,
int allCount) |
float |
TLDScoringFilter.generatorSortValue(java.lang.String url,
WebPage page,
float initSort) |
float |
TLDScoringFilter.indexerScore(java.lang.String url,
NutchDocument doc,
WebPage page,
float initScore) |
void |
TLDScoringFilter.initialScore(java.lang.String url,
WebPage page) |
void |
TLDScoringFilter.injectedScore(java.lang.String url,
WebPage page) |
void |
TLDScoringFilter.updateScore(java.lang.String url,
WebPage page,
java.util.List<ScoreDatum> inlinkedScoreData) |
Copyright © 2019 The Apache Software Foundation