Package org.apache.nutch.parse.tika
Class HTMLMetaProcessor
- java.lang.Object
- 
- org.apache.nutch.parse.tika.HTMLMetaProcessor
 
- 
 public class HTMLMetaProcessor extends Object Class for parsing META Directives from DOM trees. This class handles specifically Robots META directives (all, none, nofollow, noindex), finding BASE HREF tags, and HTTP-EQUIV no-cache instructions. All meta directives are stored in a HTMLMetaTags instance.
- 
- 
Constructor SummaryConstructors Constructor Description HTMLMetaProcessor()
 - 
Method SummaryAll Methods Static Methods Concrete Methods Modifier and Type Method Description static voidgetMetaTags(HTMLMetaTags metaTags, Node node, URL currURL)Sets the indicators inrobotsMetato appropriate values, based on any META tags found under the givennode.
 
- 
- 
- 
Method Detail- 
getMetaTagspublic static final void getMetaTags(HTMLMetaTags metaTags, Node node, URL currURL) Sets the indicators inrobotsMetato appropriate values, based on any META tags found under the givennode.- Parameters:
- metaTags- a- HTMLMetaTagsto populate with tags discovered in the given Node
- node- a DOM- Nodeto process and extract metadata from
- currURL- the cononical URL associated with the metatags and Node
 
 
- 
 
-