Uses of Class
org.apache.nutch.indexer.IndexingException
- 
Packages that use IndexingException Package Description org.apache.nutch.analysis.lang Text document language identifier.org.apache.nutch.indexer Index content, configure and run indexing and cleaning jobs to add, update, and delete documents from an index.org.apache.nutch.indexer.anchor An indexing plugin for inbound anchor text.org.apache.nutch.indexer.arbitrary Indexing filter to add document arbitrary data to the index from the output of a user-specified class.org.apache.nutch.indexer.basic A basic indexing plugin, adds basic fields: url, host, title, content, etc.org.apache.nutch.indexer.feed Indexing filter to index meta data from RSS feeds.org.apache.nutch.indexer.filter org.apache.nutch.indexer.geoip This plugin implements an indexing filter which takes advantage of the GeoIP2-java API.org.apache.nutch.indexer.jexl This plugin implements a dynamic indexing filter which uses JEXL expressions to allow filtering based on the page's metadataorg.apache.nutch.indexer.links org.apache.nutch.indexer.metadata Indexing filter to add document metadata to the index.org.apache.nutch.indexer.more A more indexing plugin, adds "more" index fields:last modified date, MIME type, content length.org.apache.nutch.indexer.replace Indexing filter to allow pattern replacements on metadata.org.apache.nutch.indexer.staticfield A simple plugin called at indexing that adds fields with static data.org.apache.nutch.indexer.subcollection Indexing filter to assign documents to subcollections.org.apache.nutch.indexer.tld Top Level Domain Indexing plugin.org.apache.nutch.indexer.urlmeta URL Meta Tag Indexing Pluginorg.apache.nutch.microformats.reltag A microformats Rel-Tag Parser/Indexer/Querier plugin.org.creativecommons.nutch Sample plugins that parse and index Creative Commons metadata.
- 
- 
Uses of IndexingException in org.apache.nutch.analysis.langMethods in org.apache.nutch.analysis.lang that throw IndexingException Modifier and Type Method Description NutchDocumentLanguageIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)
- 
Uses of IndexingException in org.apache.nutch.indexerMethods in org.apache.nutch.indexer that throw IndexingException Modifier and Type Method Description NutchDocumentIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)Adds fields or otherwise modifies the document that will be indexed for a parse.NutchDocumentIndexingFilters. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)Run all defined filters.
- 
Uses of IndexingException in org.apache.nutch.indexer.anchorMethods in org.apache.nutch.indexer.anchor that throw IndexingException Modifier and Type Method Description NutchDocumentAnchorIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)TheAnchorIndexingFilterfilter object which supports boolean configuration settings for the deduplication of anchors.
- 
Uses of IndexingException in org.apache.nutch.indexer.arbitraryMethods in org.apache.nutch.indexer.arbitrary that throw IndexingException Modifier and Type Method Description NutchDocumentArbitraryIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)TheArbitraryIndexingFilterfilter object uses reflection to instantiate the configured class and invoke the configured method.
- 
Uses of IndexingException in org.apache.nutch.indexer.basicMethods in org.apache.nutch.indexer.basic that throw IndexingException Modifier and Type Method Description NutchDocumentBasicIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)TheBasicIndexingFilterfilter object which supports few configuration settings for adding basic searchable fields.
- 
Uses of IndexingException in org.apache.nutch.indexer.feedMethods in org.apache.nutch.indexer.feed that throw IndexingException Modifier and Type Method Description NutchDocumentFeedIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)Extracts out the relevant fields: FEED_AUTHOR FEED_TAGS FEED_PUBLISHED FEED_UPDATED FEED And sends them to theIndexerfor indexing within the Nutch index.
- 
Uses of IndexingException in org.apache.nutch.indexer.filterMethods in org.apache.nutch.indexer.filter that throw IndexingException Modifier and Type Method Description NutchDocumentMimeTypeIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)static voidMimeTypeIndexingFilter. main(String[] args)Main method for invoking this tool
- 
Uses of IndexingException in org.apache.nutch.indexer.geoipMethods in org.apache.nutch.indexer.geoip that throw IndexingException Modifier and Type Method Description NutchDocumentGeoIPIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)
- 
Uses of IndexingException in org.apache.nutch.indexer.jexlMethods in org.apache.nutch.indexer.jexl that throw IndexingException Modifier and Type Method Description NutchDocumentJexlIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)
- 
Uses of IndexingException in org.apache.nutch.indexer.linksMethods in org.apache.nutch.indexer.links that throw IndexingException Modifier and Type Method Description NutchDocumentLinksIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)
- 
Uses of IndexingException in org.apache.nutch.indexer.metadataMethods in org.apache.nutch.indexer.metadata that throw IndexingException Modifier and Type Method Description NutchDocumentMetadataIndexer. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)
- 
Uses of IndexingException in org.apache.nutch.indexer.moreMethods in org.apache.nutch.indexer.more that throw IndexingException Modifier and Type Method Description NutchDocumentMoreIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)
- 
Uses of IndexingException in org.apache.nutch.indexer.replaceMethods in org.apache.nutch.indexer.replace that throw IndexingException Modifier and Type Method Description NutchDocumentReplaceIndexer. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)
- 
Uses of IndexingException in org.apache.nutch.indexer.staticfieldMethods in org.apache.nutch.indexer.staticfield that throw IndexingException Modifier and Type Method Description NutchDocumentStaticFieldIndexer. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)TheStaticFieldIndexerfilter object which adds fields as per configuration setting.
- 
Uses of IndexingException in org.apache.nutch.indexer.subcollectionMethods in org.apache.nutch.indexer.subcollection that throw IndexingException Modifier and Type Method Description NutchDocumentSubcollectionIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)
- 
Uses of IndexingException in org.apache.nutch.indexer.tldMethods in org.apache.nutch.indexer.tld that throw IndexingException Modifier and Type Method Description NutchDocumentTLDIndexingFilter. filter(NutchDocument doc, Parse parse, Text urlText, CrawlDatum datum, Inlinks inlinks)
- 
Uses of IndexingException in org.apache.nutch.indexer.urlmetaMethods in org.apache.nutch.indexer.urlmeta that throw IndexingException Modifier and Type Method Description NutchDocumentURLMetaIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)This will take the metatags that you have listed in your "urlmeta.tags" property, and looks for them inside the CrawlDatum object.
- 
Uses of IndexingException in org.apache.nutch.microformats.reltagMethods in org.apache.nutch.microformats.reltag that throw IndexingException Modifier and Type Method Description NutchDocumentRelTagIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)
- 
Uses of IndexingException in org.creativecommons.nutchMethods in org.creativecommons.nutch that throw IndexingException Modifier and Type Method Description NutchDocumentCCIndexingFilter. filter(NutchDocument doc, Parse parse, Text url, CrawlDatum datum, Inlinks inlinks)
 
-