Class RelTagIndexingFilter
- java.lang.Object
- 
- org.apache.nutch.microformats.reltag.RelTagIndexingFilter
 
- 
- All Implemented Interfaces:
- Configurable,- IndexingFilter,- Pluggable
 
 public class RelTagIndexingFilter extends Object implements IndexingFilter AnIndexingFilterthat addtagfield(s) to the document.- Author:
- Jérôme Charron
- See Also:
- http://www.microformats.org/wiki/rel-tag
 
- 
- 
Field Summary- 
Fields inherited from interface org.apache.nutch.indexer.IndexingFilterX_POINT_ID
 
- 
 - 
Constructor SummaryConstructors Constructor Description RelTagIndexingFilter()
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description NutchDocumentfilter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)Adds fields or otherwise modifies the document that will be indexed for a parse.ConfigurationgetConf()voidsetConf(Configuration conf)
 
- 
- 
- 
Method Detail- 
filterpublic NutchDocument filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks) throws IndexingException Description copied from interface:IndexingFilterAdds fields or otherwise modifies the document that will be indexed for a parse. Unwanted documents can be removed from indexing by returning a null value.- Specified by:
- filterin interface- IndexingFilter
- Parameters:
- doc- document instance for collecting fields
- parse- parse data instance
- url- page url
- datum- crawl datum for the page (fetch datum from segment containing fetch status and fetch time)
- inlinks- page inlinks
- Returns:
- modified (or a new) document instance, or null (meaning the document should be discarded)
- Throws:
- IndexingException- if an error occurs during during filtering
 
 - 
setConfpublic void setConf(Configuration conf) - Specified by:
- setConfin interface- Configurable
 
 - 
getConfpublic Configuration getConf() - Specified by:
- getConfin interface- Configurable
 
 
- 
 
-