Class RelTagParser
- java.lang.Object
-
- org.apache.nutch.microformats.reltag.RelTagParser
-
- All Implemented Interfaces:
Configurable,HtmlParseFilter,Pluggable
public class RelTagParser extends Object implements HtmlParseFilter
Adds microformat rel-tags of document if found.- See Also:
- http://www.microformats.org/wiki/rel-tag
-
-
Field Summary
Fields Modifier and Type Field Description static StringREL_TAG-
Fields inherited from interface org.apache.nutch.parse.HtmlParseFilter
X_POINT_ID
-
-
Constructor Summary
Constructors Constructor Description RelTagParser()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description ParseResultfilter(Content content, ParseResult parseResult, HTMLMetaTags metaTags, DocumentFragment doc)Scan the HTML document looking at possible rel-tagsConfigurationgetConf()voidsetConf(Configuration conf)
-
-
-
Field Detail
-
REL_TAG
public static final String REL_TAG
- See Also:
- Constant Field Values
-
-
Method Detail
-
filter
public ParseResult filter(Content content, ParseResult parseResult, HTMLMetaTags metaTags, DocumentFragment doc)
Scan the HTML document looking at possible rel-tags- Specified by:
filterin interfaceHtmlParseFilter- Parameters:
content- theContentfor a given responseparseResult- the result of running on or moreParser's on the content.metaTags- a populatedHTMLMetaTagsobjectdoc- aDocumentFragment(DOM) which can be processed in the filtering process.- Returns:
- a filtered
ParseResult - See Also:
Parser.getParse(Content)
-
setConf
public void setConf(Configuration conf)
- Specified by:
setConfin interfaceConfigurable
-
getConf
public Configuration getConf()
- Specified by:
getConfin interfaceConfigurable
-
-