Uses of Class
org.apache.nutch.parse.ParseText
-
Packages that use ParseText Package Description org.apache.nutch.parse TheParseinterface and related classes.org.apache.nutch.segment A segment stores all data from on generate/fetch/update cycle: fetch list, protocol status, raw content, parsed content, and extracted outgoing links. -
-
Uses of ParseText in org.apache.nutch.parse
Methods in org.apache.nutch.parse that return ParseText Modifier and Type Method Description static ParseTextParseText. read(DataInput in)Methods in org.apache.nutch.parse with parameters of type ParseText Modifier and Type Method Description voidParseResult. put(String key, ParseText text, ParseData data)Store a result of parsing.voidParseResult. put(Text key, ParseText text, ParseData data)Store a result of parsing.Constructors in org.apache.nutch.parse with parameters of type ParseText Constructor Description ParseImpl(ParseText text, ParseData data)ParseImpl(ParseText text, ParseData data, boolean isCanonical) -
Uses of ParseText in org.apache.nutch.segment
Methods in org.apache.nutch.segment with parameters of type ParseText Modifier and Type Method Description booleanSegmentMergeFilter. filter(Text key, CrawlDatum generateData, CrawlDatum fetchData, CrawlDatum sigData, Content content, ParseData parseData, ParseText parseText, Collection<CrawlDatum> linked)The filtering method which gets all information being merged for a given key (URL).booleanSegmentMergeFilters. filter(Text key, CrawlDatum generateData, CrawlDatum fetchData, CrawlDatum sigData, Content content, ParseData parseData, ParseText parseText, Collection<CrawlDatum> linked)Iterates over allSegmentMergeFilterextensions and if any of them returns false, it will return false as well.
-