public class HTMLLanguageParser extends java.lang.Object implements ParseFilter
X_POINT_ID
Constructor and Description |
---|
HTMLLanguageParser() |
Modifier and Type | Method and Description |
---|---|
Parse |
filter(java.lang.String url,
WebPage page,
Parse parse,
HTMLMetaTags metaTags,
org.w3c.dom.DocumentFragment doc)
Scan the HTML document looking at possible indications of content language
html lang attribute (http://www.w3.org/TR/REC-html40/struct/dirlang.html#h-8.1) meta dc.language (http://dublincore.org/documents/2000/07/16/usageguide/qualified -html.shtml#language) meta http-equiv (content-language) (http://www.w3.org/TR/REC-html40/struct/global.html#h-7.4.4.2) |
Configuration |
getConf() |
java.util.Collection<WebPage.Field> |
getFields() |
void |
setConf(Configuration conf) |
public Parse filter(java.lang.String url, WebPage page, Parse parse, HTMLMetaTags metaTags, org.w3c.dom.DocumentFragment doc)
filter
in interface ParseFilter
public void setConf(Configuration conf)
setConf
in interface Configurable
public Configuration getConf()
getConf
in interface Configurable
public java.util.Collection<WebPage.Field> getFields()
getFields
in interface FieldPluggable
Copyright © 2019 The Apache Software Foundation